Deep learning-based human pose estimation: A survey

C Zheng, W Wu, C Chen, T Yang, S Zhu, J Shen… - ACM Computing …, 2023 - dl.acm.org
Human pose estimation aims to locate the human body parts and build human body
representation (eg, body skeleton) from input data such as images and videos. It has drawn …

[HTML][HTML] Deep 3D human pose estimation: A review

J Wang, S Tan, X Zhen, S Xu, F Zheng, Z He… - Computer Vision and …, 2021 - Elsevier
Abstract Three-dimensional (3D) human pose estimation involves estimating the articulated
3D joint locations of a human body from an image or video. Due to its widespread …

Magic123: One image to high-quality 3d object generation using both 2d and 3d diffusion priors

G Qian, J Mai, A Hamdi, J Ren, A Siarohin, B Li… - arXiv preprint arXiv …, 2023 - arxiv.org
We present Magic123, a two-stage coarse-to-fine approach for high-quality, textured 3D
meshes generation from a single unposed image in the wild using both2D and 3D priors. In …

Mhformer: Multi-hypothesis transformer for 3d human pose estimation

W Li, H Liu, H Tang, P Wang… - Proceedings of the …, 2022 - openaccess.thecvf.com
Estimating 3D human poses from monocular videos is a challenging task due to depth
ambiguity and self-occlusion. Most existing works attempt to solve both issues by exploiting …

Mixste: Seq2seq mixed spatio-temporal encoder for 3d human pose estimation in video

J Zhang, Z Tu, J Yang, Y Chen… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
Recent transformer-based solutions have been introduced to estimate 3D human pose from
2D keypoint sequence by considering body joints among all frames globally to learn spatio …

Revisiting skeleton-based action recognition

H Duan, Y Zhao, K Chen, D Lin… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
Human skeleton, as a compact representation of human action, has received increasing
attention in recent years. Many skeleton-based action recognition methods adopt GCNs to …

3d human pose estimation with spatial and temporal transformers

C Zheng, S Zhu, M Mendieta, T Yang… - Proceedings of the …, 2021 - openaccess.thecvf.com
Transformer architectures have become the model of choice in natural language processing
and are now being introduced into computer vision tasks such as image classification, object …

Diffpose: Toward more reliable 3d pose estimation

J Gong, LG Foo, Z Fan, Q Ke… - Proceedings of the …, 2023 - openaccess.thecvf.com
Monocular 3D human pose estimation is quite challenging due to the inherent ambiguity
and occlusion, which often lead to high uncertainty and indeterminacy. On the other hand …

Stylegan-human: A data-centric odyssey of human generation

J Fu, S Li, Y Jiang, KY Lin, C Qian, CC Loy… - … on Computer Vision, 2022 - Springer
Unconditional human image generation is an important task in vision and graphics, enabling
various applications in the creative industry. Existing studies in this field mainly focus on …

Diffusion-based 3d human pose estimation with multi-hypothesis aggregation

W Shan, Z Liu, X Zhang, Z Wang… - Proceedings of the …, 2023 - openaccess.thecvf.com
In this paper, a novel Diffusion-based 3D Pose estimation (D3DP) method with Joint-wise
reProjection-based Multi-hypothesis Aggregation (JPMA) is proposed for probabilistic 3D …