3d human pose estimation: A review of the literature and analysis of covariates

N Sarafianos, B Boteanu, B Ionescu… - Computer Vision and …, 2016 - Elsevier
Estimating the pose of a human in 3D given an image or a video has recently received
significant attention from the scientific community. The main reasons for this trend are the …

TEMOS: Generating Diverse Human Motions from Textual Descriptions

M Petrovich, MJ Black, G Varol - European Conference on Computer …, 2022 - Springer
We address the problem of generating diverse 3D human motions from textual descriptions.
This challenging task requires joint modeling of both modalities: understanding and …

Human pose estimation from monocular images: A comprehensive survey

W Gong, X Zhang, J Gonzàlez, A Sobral, T Bouwmans… - Sensors, 2016 - mdpi.com
Human pose estimation refers to the estimation of the location of body parts and how they
are connected in an image. Human pose estimation from monocular images has wide …

Mixste: Seq2seq mixed spatio-temporal encoder for 3d human pose estimation in video

J Zhang, Z Tu, J Yang, Y Chen… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
Recent transformer-based solutions have been introduced to estimate 3D human pose from
2D keypoint sequence by considering body joints among all frames globally to learn spatio …

Action-conditioned 3d human motion synthesis with transformer vae

M Petrovich, MJ Black, G Varol - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
We tackle the problem of action-conditioned generation of realistic and diverse human
motion sequences. In contrast to methods that complete, or extend, motion sequences, this …

Balanced mse for imbalanced visual regression

J Ren, M Zhang, C Yu, Z Liu - Proceedings of the IEEE/CVF …, 2022 - openaccess.thecvf.com
Data imbalance exists ubiquitously in real-world visual regressions, eg, age estimation and
pose estimation, hurting the model's generalizability and fairness. Thus, imbalanced …

3d human pose estimation in video with temporal convolutions and semi-supervised training

D Pavllo, C Feichtenhofer… - Proceedings of the …, 2019 - openaccess.thecvf.com
In this work, we demonstrate that 3D poses in video can be effectively estimated with a fully
convolutional model based on dilated temporal convolutions over 2D keypoints. We also …

3D human pose estimation with spatio-temporal criss-cross attention

Z Tang, Z Qiu, Y Hao, R Hong… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Recent transformer-based solutions have shown great success in 3D human pose
estimation. Nevertheless, to calculate the joint-to-joint affinity matrix, the computational cost …

Semantic graph convolutional networks for 3d human pose regression

L Zhao, X Peng, Y Tian, M Kapadia… - Proceedings of the …, 2019 - openaccess.thecvf.com
In this paper, we study the problem of learning Graph Convolutional Networks (GCNs) for
regression. Current architectures of GCNs are limited to the small receptive field of …

Scannet: Richly-annotated 3d reconstructions of indoor scenes

A Dai, AX Chang, M Savva, M Halber… - Proceedings of the …, 2017 - openaccess.thecvf.com
A key requirement for leveraging supervised deep learning methods is the availability of
large, labeled datasets. Unfortunately, in the context of RGB-D scene understanding, very …