Anticipating human activities using object affordances for reactive robotic response

N Gupta, SK Gupta, RK Pathak, V Jain… - Artificial intelligence …, 2022 - Springer

Human activity recognition (HAR) has multifaceted applications due to its worldly usage of
acquisition devices such as smartphones, video cameras, and its ability to capture human …

被引用次数：188 相关文章所有 10 个版本

[PDF] academia.edu

A survey of robot learning strategies for human-robot collaboration in industrial settings

D Mukherjee, K Gupta, LH Chang, H Najjaran - Robotics and Computer …, 2022 - Elsevier

Increased global competition has placed a premium on customer satisfaction, and there is a
greater demand for manufacturers to be flexible with their products and services. This …

被引用次数：193 相关文章所有 4 个版本

[PDF] thecvf.com

Ego4d: Around the world in 3,000 hours of egocentric video

K Grauman, A Westbury, E Byrne… - Proceedings of the …, 2022 - openaccess.thecvf.com

We introduce Ego4D, a massive-scale egocentric video dataset and benchmark suite. It
offers 3,670 hours of daily-life activity video spanning hundreds of scenarios (household …

被引用次数：665 相关文章所有 13 个版本

[PDF] thecvf.com

Affordances from human videos as a versatile representation for robotics

S Bahl, R Mendonca, L Chen… - Proceedings of the …, 2023 - openaccess.thecvf.com

Building a robot that can understand and learn to interact by watching humans has inspired
several vision problems. However, despite some successful results on static datasets, it …

被引用次数：64 相关文章所有 9 个版本

[PDF] thecvf.com

Anticipative video transformer

R Girdhar, K Grauman - Proceedings of the IEEE/CVF …, 2021 - openaccess.thecvf.com

Abstract We propose Anticipative Video Transformer (AVT), an end-to-end attention-based
video modeling architecture that attends to the previously observed video in order to …

被引用次数：202 相关文章所有 6 个版本

Vision-based holistic scene understanding towards proactive human–robot collaboration

J Fan, P Zheng, S Li - Robotics and Computer-Integrated Manufacturing, 2022 - Elsevier

Recently human–robot collaboration (HRC) has emerged as a promising paradigm for mass
personalization in manufacturing owing to the potential to fully exploit the strength of human …

被引用次数：119 相关文章所有 3 个版本

[PDF] springer.com

Rescaling egocentric vision: Collection, pipeline and challenges for epic-kitchens-100

D Damen, H Doughty, GM Farinella, A Furnari… - International Journal of …, 2022 - Springer

This paper introduces the pipeline to extend the largest dataset in egocentric vision, EPIC-
KITCHENS. The effort culminates in EPIC-KITCHENS-100, a collection of 100 hours, 20M …

被引用次数：415 相关文章所有 13 个版本

[PDF] thecvf.com

Vln bert: A recurrent vision-and-language bert for navigation

Y Hong, Q Wu, Y Qi… - Proceedings of the …, 2021 - openaccess.thecvf.com

Accuracy of many visiolinguistic tasks has benefited significantly from the application of
vision-and-language (V&L) BERT. However, its application for the task of vision-and …

被引用次数：233 相关文章所有 5 个版本

[PDF] thecvf.com

Dynamic multiscale graph neural networks for 3d skeleton based human motion prediction

M Li, S Chen, Y Zhao, Y Zhang… - Proceedings of the …, 2020 - openaccess.thecvf.com

We propose novel dynamic multiscale graph neural networks (DMGNN) to predict 3D
skeleton-based human motions. The core idea of DMGNN is to use a multiscale graph to …

被引用次数：323 相关文章所有 10 个版本

[PDF] arxiv.org

Human action recognition and prediction: A survey

Y Kong, Y Fu - International Journal of Computer Vision, 2022 - Springer

Derived from rapid advances in computer vision and machine learning, video analysis tasks
have been moving from inferring the present state to predicting the future state. Vision-based …

被引用次数：652 相关文章所有 6 个版本

高级搜索

QQ 群