An outlook into the future of egocentric vision

C Plizzari, G Goletto, A Furnari, S Bansal… - International Journal of …, 2024 - Springer
What will the future be? We wonder! In this survey, we explore the gap between current
research in egocentric vision and the ever-anticipated future, where wearable computing …

Challenges and solutions for vision-based hand gesture interpretation: A review

K Gao, H Zhang, X Liu, X Wang, L Xie, B Ji… - Computer Vision and …, 2024 - Elsevier
Hand gesture is one of the most efficient and natural interfaces in current human–computer
interaction (HCI) systems. Despite the great progress achieved in hand gesture-based HCI …

Transformer-based unified recognition of two hands manipulating objects

H Cho, C Kim, J Kim, S Lee… - Proceedings of the …, 2023 - openaccess.thecvf.com
Understanding the hand-object interactions from an egocentric video has received a great
attention recently. So far, most approaches are based on the convolutional neural network …

On the utility of 3d hand poses for action recognition

MS Shamil, D Chatterjee, F Sener, S Ma… - European Conference on …, 2025 - Springer
Abstract 3D hand pose is an underexplored modality for action recognition. Poses are
compact yet informative and can greatly benefit applications with limited compute budgets …

A survey of deep learning methods and datasets for hand pose estimation from hand-object interaction images

T Woo, W Park, W Jeong, J Park - Computers & Graphics, 2023 - Elsevier
The research topic of estimating hand pose from the images of hand-object interaction has
the potential for replicating natural hand behavior in many practical applications of virtual …

HandBooster: Boosting 3D Hand-Mesh Reconstruction by Conditional Synthesis and Sampling of Hand-Object Interactions

H Xu, H Li, Y Wang, S Liu… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
Reconstructing 3D hand mesh robustly from a single image is very challenging due to the
lack of diversity in existing real-world datasets. While data synthesis helps relieve the issue …

EAGLE: Egocentric AGgregated Language-video Engine

J Bi, Y Tang, L Song, A Vosoughi, N Nguyen… - arXiv preprint arXiv …, 2024 - arxiv.org
The rapid evolution of egocentric video analysis brings new insights into understanding
human activities and intentions from a first-person perspective. Despite this progress, the …

Single-to-Dual-View Adaptation for Egocentric 3D Hand Pose Estimation

R Liu, T Ohkawa, M Zhang… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
The pursuit of accurate 3D hand pose estimation stands as a keystone for understanding
human activity in the realm of egocentric vision. The majority of existing estimation methods …

SiMA-Hand: Boosting 3D Hand-Mesh Reconstruction by Single-to-Multi-View Adaptation

Y Wang, H Xu, PA Heng, CW Fu - … of the AAAI Conference on Artificial …, 2024 - ojs.aaai.org
Estimating 3D hand mesh from RGB images is a longstanding track, in which occlusion is
one of the most challenging problems. Existing attempts towards this task often fail when the …

Progressively global–local fusion with explicit guidance for accurate and robust 3d hand pose reconstruction

K Gao, X Liu, P Ren, H Chen, T Zhen, L Xie, Z Li… - Knowledge-Based …, 2024 - Elsevier
Parametric and non-parametric methods are two commonly used strategies in current 3D
hand pose reconstruction. Parametric methods predict low-dimensional parameters to fit a …