Recent advances of monocular 2d and 3d human pose estimation: A deep learning perspective

W Liu, Q Bao, Y Sun, T Mei - ACM Computing Surveys, 2022 - dl.acm.org
Estimation of the human pose from a monocular camera has been an emerging research
topic in the computer vision community with many applications. Recently, benefiting from the …

Human pose estimation and its application to action recognition: A survey

L Song, G Yu, J Yuan, Z Liu - Journal of Visual Communication and Image …, 2021 - Elsevier
Human pose estimation aims at predicting the poses of human body parts in images or
videos. Since pose motions are often driven by some specific human actions, knowing the …

Alphapose: Whole-body regional multi-person pose estimation and tracking in real-time

HS Fang, J Li, H Tang, C Xu, H Zhu… - … on Pattern Analysis …, 2022 - ieeexplore.ieee.org
Accurate whole-body multi-person pose estimation and tracking is an important yet
challenging topic in computer vision. To capture the subtle actions of humans for complex …

Multi-animal pose estimation, identification and tracking with DeepLabCut

J Lauer, M Zhou, S Ye, W Menegas, S Schneider… - Nature …, 2022 - nature.com
Estimating the pose of multiple animals is a challenging computer vision problem: frequent
interactions cause occlusions and complicate the association of detected keypoints to the …

Vibe: Video inference for human body pose and shape estimation

M Kocabas, N Athanasiou… - Proceedings of the IEEE …, 2020 - openaccess.thecvf.com
Human motion is fundamental to understanding behavior. Despite progress on single-image
3D pose and shape estimation, existing video-based state-of-the-art methods fail to produce …

Deep high-resolution representation learning for human pose estimation

K Sun, B Xiao, D Liu, J Wang - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com
In this paper, we are interested in the human pose estimation problem with a focus on
learning reliable high-resolution representations. Most existing methods recover high …

Video action transformer network

R Girdhar, J Carreira, C Doersch… - Proceedings of the …, 2019 - openaccess.thecvf.com
Abstract We introduce the Action Transformer model for recognizing and localizing human
actions in video clips. We repurpose a Transformer-style architecture to aggregate features …

Simple baselines for human pose estimation and tracking

B Xiao, H Wu, Y Wei - Proceedings of the European …, 2018 - openaccess.thecvf.com
There has been significant progress on pose estimation and increasing interests on pose
tracking in recent years. At the same time, the overall algorithm and system complexity …

Learning 3d human dynamics from video

A Kanazawa, JY Zhang, P Felsen… - Proceedings of the …, 2019 - openaccess.thecvf.com
From an image of a person in action, we can easily guess the 3D motion of the person in the
immediate past and future. This is because we have a mental model of 3D human dynamics …

RGB-D salient object detection via 3D convolutional neural networks

Q Chen, Z Liu, Y Zhang, K Fu, Q Zhao… - Proceedings of the AAAI …, 2021 - ojs.aaai.org
RGB-D salient object detection (SOD) recently has attracted increasing research interest
and many deep learning methods based on encoder-decoder architectures have emerged …