Recent advances of monocular 2d and 3d human pose estimation: a deep learning perspective

W Liu, Q Bao, Y Sun, T Mei - ACM Computing Surveys, 2022 - dl.acm.org
Estimation of the human pose from a monocular camera has been an emerging research
topic in the computer vision community with many applications. Recently, benefiting from the …

A review of deep learning techniques for 2D and 3D human pose estimation

MB Gamra, MA Akhloufi - Image and Vision Computing, 2021 - Elsevier
Inferring human pose from a monocular RGB image remains an interesting field of research
in computer vision. It serves as a fundamental key for many real-world applications …

Conditional detr for fast training convergence

D Meng, X Chen, Z Fan, G Zeng, H Li… - Proceedings of the …, 2021 - openaccess.thecvf.com
The recently-developed DETR approach applies the transformer encoder and decoder
architecture to object detection and achieves promising performance. In this paper, we …

Revealing the dark secrets of masked image modeling

Z Xie, Z Geng, J Hu, Z Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Masked image modeling (MIM) as pre-training is shown to be effective for numerous vision
downstream tasks, but how and where MIM works remain unclear. In this paper, we compare …

Yolo-pose: Enhancing yolo for multi person pose estimation using object keypoint similarity loss

D Maji, S Nagori, M Mathew… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
We introduce YOLO-pose, a novel heatmap-free approach for joint detection, and 2D multi-
person pose estimation in an image based on the popular YOLO object detection …

End-to-end multi-person pose estimation with transformers

D Shi, X Wei, L Li, Y Ren, W Tan - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
Current methods of multi-person pose estimation typically treat the localization and
association of body joints separately. In this paper, we propose the first fully end-to-end multi …

Lite pose: Efficient architecture design for 2d human pose estimation

Y Wang, M Li, H Cai, WM Chen… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
Pose estimation plays a critical role in human-centered vision applications. However, it is
difficult to deploy state-of-the-art HRNet-based pose estimation models on resource …

Human pose as compositional tokens

Z Geng, C Wang, Y Wei, Z Liu… - Proceedings of the …, 2023 - openaccess.thecvf.com
Human pose is typically represented by a coordinate vector of body joints or their heatmap
embeddings. While easy for data processing, unrealistic pose estimates are admitted due to …

Single-stage is enough: Multi-person absolute 3D pose estimation

L Jin, C Xu, X Wang, Y Xiao, Y Guo… - Proceedings of the …, 2022 - openaccess.thecvf.com
The existing multi-person absolute 3D pose estimation methods are mainly based on two-
stage paradigm, ie, top-down or bottom-up, leading to redundant pipelines with high …

Rethinking keypoint representations: Modeling keypoints and poses as objects for multi-person human pose estimation

W McNally, K Vats, A Wong, J McPhee - European Conference on …, 2022 - Springer
In keypoint estimation tasks such as human pose estimation, heatmap-based regression is
the dominant approach despite possessing notable drawbacks: heatmaps intrinsically suffer …