Human motion generation: A survey

W Zhu, X Ma, D Ro, H Ci, J Zhang, J Shi… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org
Human motion generation aims to generate natural human pose sequences and shows
immense potential for real-world applications. Substantial progress has been made recently …

A survey on 3d skeleton-based action recognition using learning method

B Ren, M Liu, R Ding, H Liu - Cyborg and Bionic Systems, 2024 - spj.science.org
Three-dimensional skeleton-based action recognition (3D SAR) has gained important
attention within the computer vision community, owing to the inherent advantages offered by …

A Dual-Augmentor Framework for Domain Generalization in 3D Human Pose Estimation

Q Peng, C Zheng, C Chen - Proceedings of the IEEE/CVF …, 2024 - openaccess.thecvf.com
Abstract 3D human pose data collected in controlled laboratory settings present challenges
for pose estimators that generalize across diverse scenarios. To address this domain …

Wham: Reconstructing world-grounded humans with accurate 3d motion

S Shin, J Kim, E Halilaj… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
The estimation of 3D human motion from video has progressed rapidly but current methods
still have several key limitations. First most methods estimate the human in camera …

3d human pose perception from egocentric stereo videos

H Akada, J Wang, V Golyanik… - Proceedings of the …, 2024 - openaccess.thecvf.com
While head-mounted devices are becoming more compact they provide egocentric views
with significant self-occlusions of the device user. Hence existing methods often fail to …

Skeleton-in-context: Unified skeleton sequence modeling with in-context learning

X Wang, Z Fang, X Li, X Li… - Proceedings of the …, 2024 - openaccess.thecvf.com
In-context learning provides a new perspective for multi-task modeling for vision and NLP.
Under this setting the model can perceive tasks from prompts and accomplish them without …

Motionagformer: Enhancing 3d human pose estimation with a transformer-gcnformer network

S Mehraban, V Adeli, B Taati - Proceedings of the IEEE/CVF …, 2024 - openaccess.thecvf.com
Recent transformer-based approaches have demonstrated excellent performance in 3D
human pose estimation. However, they have a holistic view and by encoding global …

AvatarGPT: All-in-One Framework for Motion Understanding Planning Generation and Beyond

Z Zhou, Y Wan, B Wang - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com
Abstract Large Language Models (LLMs) have shown remarkable emergent abilities in
unifying almost all (if not every) NLP tasks. In the human motion-related realm however …

Llms are good action recognizers

H Qu, Y Cai, J Liu - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com
Skeleton-based action recognition has attracted lots of research attention. Recently to build
an accurate skeleton-based action recognizer a variety of works have been proposed …

Crossglg: Llm guides one-shot skeleton-based 3d action recognition in a cross-level manner

T Yan, W Zeng, Y Xiao, X Tong, B Tan, Z Fang… - … on Computer Vision, 2024 - Springer
Most existing one-shot skeleton-based action recognition focuses on raw low-level
information (eg., joint location), and may suffer from local information loss and low …