Saliency-based identification and recognition of pointed-at objects

A Borji - IEEE transactions on pattern analysis and machine …, 2019 - ieeexplore.ieee.org

Visual saliency models have enjoyed a big leap in performance in recent years, thanks to
advances in deep learning and large scale annotated data. Despite enormous effort and …

被引用次数：194 相关文章所有 5 个版本

[PDF] ieee.org

Electromagnetic nanocommunication networks: Principles, applications, and challenges

MH Kabir, SMR Islam, AP Shrestha, F Ali… - IEEE …, 2021 - ieeexplore.ieee.org

Nanoscale devices, also called nanomachines, form communication networks and
cooperate with each other so that they can be used to perform complicated tasks. Such …

被引用次数：13 相关文章所有 5 个版本

[PDF] thecvf.com

Yourefit: Embodied reference understanding with language and gesture

Y Chen, Q Li, D Kong, YL Kei, SC Zhu… - Proceedings of the …, 2021 - openaccess.thecvf.com

We study the machine's understanding of embodied reference: One agent uses both
language and gesture to refer to an object to another agent in a shared physical …

被引用次数：41 相关文章所有 6 个版本

[PDF] arxiv.org

Tidying deep saliency prediction architectures

N Reddy, S Jain, P Yarlagadda… - 2020 IEEE/RSJ …, 2020 - ieeexplore.ieee.org

Learning computational models for visual attention (saliency estimation) is an effort to inch
machines/robots closer to human visual cognitive abilities. Data-driven efforts have …

被引用次数：53 相关文章所有 13 个版本

[PDF] openreview.net

Eqa-mx: Embodied question answering using multimodal expression

MM Islam, A Gladstone, R Islam… - The Twelfth International …, 2023 - openreview.net

Humans predominantly use verbal utterances and nonverbal gestures (eg, eye gaze and
pointing gestures) in their natural interactions. For instance, pointing gestures and verbal …

被引用次数：5 相关文章

[PDF] brown.edu

Interpreting multimodal referring expressions in real time

D Whitney, M Eldon, J Oberlin… - 2016 IEEE International …, 2016 - ieeexplore.ieee.org

Humans communicate about objects using language, gesture, and context, fusing
information from multiple modalities over time. Robots need to interpret this communication …

被引用次数：82 相关文章所有 4 个版本

[PDF] neurips.cc

CAESAR: An embodied simulator for generating multimodal referring expression datasets

MM Islam, R Mirzaiee, A Gladstone… - Advances in Neural …, 2022 - proceedings.neurips.cc

Humans naturally use verbal utterances and nonverbal gestures to refer to various objects
(known as $\textit {referring expressions} $) in different interactional scenarios. As collecting …

被引用次数：12 相关文章所有 3 个版本

[PDF] uva.nl

Joint attention by gaze interpolation and saliency

Z Yücel, AA Salah, Ç Meriçli, T Meriçli… - IEEE Transactions …, 2013 - ieeexplore.ieee.org

Joint attention, which is the ability of coordination of a common point of reference with the
communicating party, emerges as a key factor in various interaction scenarios. This paper …

被引用次数：78 相关文章所有 15 个版本

[PDF] arvojournals.org

Complementary effects of gaze direction and early saliency in guiding fixations during free viewing

A Borji, D Parks, L Itti - Journal of vision, 2014 - jov.arvojournals.org

Gaze direction provides an important and ubiquitous communication channel in daily
behavior and social interaction of humans and some animals. While several studies have …

被引用次数：69 相关文章所有 7 个版本

[PDF] ntu.ac.uk

Attentional mechanisms for socially interactive robots–a survey

JF Ferreira, J Dias - IEEE Transactions on Autonomous Mental …, 2014 - ieeexplore.ieee.org

This review intends to provide an overview of the state of the art in the modeling and
implementation of automatic attentional mechanisms for socially interactive robots. Humans …

被引用次数：61 相关文章所有 6 个版本

高级搜索

QQ 群