Recent advances of monocular 2d and 3d human pose estimation: A deep learning perspective

W Liu, Q Bao, Y Sun, T Mei - ACM Computing Surveys, 2022 - dl.acm.org
Estimation of the human pose from a monocular camera has been an emerging research
topic in the computer vision community with many applications. Recently, benefiting from the …

Perceiver: General perception with iterative attention

A Jaegle, F Gimeno, A Brock… - International …, 2021 - proceedings.mlr.press
Biological systems understand the world by simultaneously processing high-dimensional
inputs from modalities as diverse as vision, audition, touch, proprioception, etc. The …

Deep high-resolution representation learning for visual recognition

J Wang, K Sun, T Cheng, B Jiang… - IEEE transactions on …, 2020 - ieeexplore.ieee.org
High-resolution representations are essential for position-sensitive vision problems, such as
human pose estimation, semantic segmentation, and object detection. Existing state-of-the …

Deep high-resolution representation learning for human pose estimation

K Sun, B Xiao, D Liu, J Wang - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com
In this paper, we are interested in the human pose estimation problem with a focus on
learning reliable high-resolution representations. Most existing methods recover high …

Integral human pose regression

X Sun, B Xiao, F Wei, S Liang… - Proceedings of the …, 2018 - openaccess.thecvf.com
State-of-the-art human pose estimation methods are based on heat map representation. In
spite of the good performance, the representation has a few issues in nature, such as non …

Vnect: Real-time 3d human pose estimation with a single rgb camera

D Mehta, S Sridhar, O Sotnychenko, H Rhodin… - Acm transactions on …, 2017 - dl.acm.org
We present the first real-time method to capture the full global 3D skeletal pose of a human
in a stable, temporally consistent manner using a single RGB camera. Our method combines …

Monocular 3d human pose estimation in the wild using improved cnn supervision

D Mehta, H Rhodin, D Casas, P Fua… - … conference on 3D …, 2017 - ieeexplore.ieee.org
We propose a CNN-based approach for 3D human body pose estimation from single RGB
images that addresses the issue of limited generalizability of models trained solely on the …

Associative embedding: End-to-end learning for joint detection and grouping

A Newell, Z Huang, J Deng - Advances in neural …, 2017 - proceedings.neurips.cc
We introduce associative embedding, a novel method for supervising convolutional neural
networks for the task of detection and grouping. A number of computer vision problems can …

Graph stacked hourglass networks for 3d human pose estimation

T Xu, W Takano - Proceedings of the IEEE/CVF conference …, 2021 - openaccess.thecvf.com
In this paper, we propose a novel graph convolutional network architecture, Graph Stacked
Hourglass Networks, for 2D-to-3D human pose estimation tasks. The proposed architecture …

Multi-context attention for human pose estimation

X Chu, W Yang, W Ouyang, C Ma… - Proceedings of the …, 2017 - openaccess.thecvf.com
In this paper, we propose to incorporate convolutional neural networks with a multi-context
attention mechanism into an end-to-end framework for human pose estimation. We adopt …