A survey of robot learning strategies for human-robot collaboration in industrial settings

D Mukherjee, K Gupta, LH Chang, H Najjaran - Robotics and Computer …, 2022 - Elsevier
Increased global competition has placed a premium on customer satisfaction, and there is a
greater demand for manufacturers to be flexible with their products and services. This …

Sign language recognition: A deep survey

R Rastgoo, K Kiani, S Escalera - Expert Systems with Applications, 2021 - Elsevier
Sign language, as a different form of the communication language, is important to large
groups of people in society. There are different signs in each sign language with variability …

[PDF][PDF] 图像理解中的卷积神经网络

常亮, 邓小明, 周明全, 武仲科, 袁野, 杨硕, 王宏安 - 自动化学报, 2016 - faculty.csu.edu.cn
摘要近年来, 卷积神经网络(Convolutional neural networks, CNN) 已在图像理解领域得到了
广泛的应用, 引起了研究者的关注. 特别是随着大规模图像数据的产生以及计算机硬件(特别是 …

Tapir: Tracking any point with per-frame initialization and temporal refinement

C Doersch, Y Yang, M Vecerik… - Proceedings of the …, 2023 - openaccess.thecvf.com
We present a novel model for Tracking Any Point (TAP) that effectively tracks any queried
point on any physical surface throughout a video sequence. Our approach employs two …

Tap-vid: A benchmark for tracking any point in a video

C Doersch, A Gupta, L Markeeva… - Advances in …, 2022 - proceedings.neurips.cc
Generic motion understanding from video involves not only tracking objects, but also
perceiving how their surfaces deform and move. This information is useful to make …

Interhand2. 6m: A dataset and baseline for 3d interacting hand pose estimation from a single rgb image

G Moon, SI Yu, H Wen, T Shiratori, KM Lee - Computer Vision–ECCV 2020 …, 2020 - Springer
Abstract Analysis of hand-hand interactions is a crucial step towards better understanding
human behavior. However, most researches in 3D hand pose estimation have focused on …

A review of single-source deep unsupervised visual domain adaptation

S Zhao, X Yue, S Zhang, B Li, H Zhao… - … on Neural Networks …, 2020 - ieeexplore.ieee.org
Large-scale labeled training datasets have enabled deep neural networks to excel across a
wide range of benchmark vision tasks. However, in many applications, it is prohibitively …

Virtual-reality interpromotion technology for metaverse: A survey

D Wu, Z Yang, P Zhang, R Wang… - IEEE Internet of Things …, 2023 - ieeexplore.ieee.org
The metaverse aims to build an immersive virtual reality world to support the daily life, work,
and recreation of people. In this survey, the status quo of the metaverse is investigated, and …

Whole-body human pose estimation in the wild

S Jin, L Xu, J Xu, C Wang, W Liu, C Qian… - Computer Vision–ECCV …, 2020 - Springer
This paper investigates the task of 2D human whole-body pose estimation, which aims to
localize dense landmarks on the entire human body including face, hands, body, and feet …

Learning joint reconstruction of hands and manipulated objects

Y Hasson, G Varol, D Tzionas… - Proceedings of the …, 2019 - openaccess.thecvf.com
Estimating hand-object manipulations is essential for in-terpreting and imitating human
actions. Previous work has made significant progress towards reconstruction of hand poses …