You2me: Inferring body pose in egocentric video via first and second person interactions

K Grauman, A Westbury, E Byrne… - Proceedings of the …, 2022 - openaccess.thecvf.com

We introduce Ego4D, a massive-scale egocentric video dataset and benchmark suite. It
offers 3,670 hours of daily-life activity video spanning hundreds of scenarios (household …

被引用次数：748 相关文章所有 13 个版本

[PDF] thecvf.com

Decoupling human and camera motion from videos in the wild

V Ye, G Pavlakos, J Malik… - Proceedings of the …, 2023 - openaccess.thecvf.com

We propose a method to reconstruct global human trajectories from videos in the wild. Our
optimization method decouples the camera and human motion, which allows us to place …

被引用次数：54 相关文章所有 5 个版本

[PDF] arxiv.org

Intergen: Diffusion-based multi-human motion generation under complex interactions

H Liang, W Zhang, W Li, J Yu, L Xu - International Journal of Computer …, 2024 - Springer

We have recently seen tremendous progress in diffusion advances for generating realistic
human motions. Yet, they largely disregard the multi-human interactions. In this paper, we …

被引用次数：49 相关文章所有 4 个版本

[PDF] thecvf.com

Ego-body pose estimation via ego-head pose estimation

J Li, K Liu, J Wu - Proceedings of the IEEE/CVF Conference …, 2023 - openaccess.thecvf.com

Estimating 3D human motion from an egocentric video sequence plays a critical role in
human behavior understanding and has various applications in VR/AR. However, naively …

被引用次数：43 相关文章所有 12 个版本

[PDF] acm.org

Imuposer: Full-body pose estimation using imus in phones, watches, and earbuds

V Mollyn, R Arakawa, M Goel, C Harrison… - Proceedings of the 2023 …, 2023 - dl.acm.org

Tracking body pose on-the-go could have powerful uses in fitness, mobile gaming, context-
aware virtual assistants, and rehabilitation. However, users are unlikely to buy and wear …

被引用次数：35 相关文章所有 3 个版本

[PDF] arxiv.org

Gimo: Gaze-informed human motion prediction in context

Y Zheng, Y Yang, K Mo, J Li, T Yu, Y Liu, CK Liu… - … on Computer Vision, 2022 - Springer

Predicting human motion is critical for assistive robots and AR/VR applications, where the
interaction with humans needs to be safe and comfortable. Meanwhile, an accurate …

被引用次数：54 相关文章所有 6 个版本

[PDF] arxiv.org

Egobody: Human body shape and motion of interacting people from head-mounted devices

S Zhang, Q Ma, Y Zhang, Z Qian, T Kwon… - European conference on …, 2022 - Springer

Understanding social interactions from egocentric views is crucial for many applications,
ranging from assistive robotics to AR/VR. Key to reasoning about interactions is to …

被引用次数：63 相关文章所有 6 个版本

[PDF] neurips.cc

Learning state-aware visual representations from audible interactions

H Mittal, P Morgado, U Jain… - Advances in Neural …, 2022 - proceedings.neurips.cc

We propose a self-supervised algorithm to learn representations from egocentric video data.
Recently, significant efforts have been made to capture humans interacting with their own …

被引用次数：27 相关文章所有 5 个版本

[PDF] arxiv.org

My view is the best view: Procedure learning from egocentric videos

S Bansal, C Arora, CV Jawahar - European Conference on Computer …, 2022 - Springer

Procedure learning involves identifying the key-steps and determining their logical order to
perform a task. Existing approaches commonly use third-person videos for learning the …

被引用次数：43 相关文章所有 11 个版本

[PDF] neurips.cc

Egotaskqa: Understanding human tasks in egocentric videos

B Jia, T Lei, SC Zhu, S Huang - Advances in Neural …, 2022 - proceedings.neurips.cc

Understanding human tasks through video observations is an essential capability of
intelligent agents. The challenges of such capability lie in the difficulty of generating a …

被引用次数：34 相关文章所有 5 个版本

高级搜索

QQ 群