Recovering 3d human mesh from monocular images: A survey

Y Tian, H Zhang, Y Liu, L Wang - IEEE transactions on pattern …, 2023 - ieeexplore.ieee.org
Estimating human pose and shape from monocular images is a long-standing problem in
computer vision. Since the release of statistical body models, 3D human mesh recovery has …

A review on 2D instance segmentation based on deep neural networks

W Gu, S Bai, L Kong - Image and Vision Computing, 2022 - Elsevier
Image instance segmentation involves labeling pixels of images with classes and instances,
which is one of the pivotal technologies in many domains, such as natural scenes …

Vitpose: Simple vision transformer baselines for human pose estimation

Y Xu, J Zhang, Q Zhang, D Tao - Advances in Neural …, 2022 - proceedings.neurips.cc
Although no specific domain knowledge is considered in the design, plain vision
transformers have shown excellent performance in visual recognition tasks. However, little …

Putting people in their place: Monocular regression of 3d people in depth

Y Sun, W Liu, Q Bao, Y Fu, T Mei… - Proceedings of the …, 2022 - openaccess.thecvf.com
Given an image with multiple people, our goal is to directly regress the pose and shape of all
the people as well as their relative depth. Inferring the depth of a person in an image …

Smpler-x: Scaling up expressive human pose and shape estimation

Z Cai, W Yin, A Zeng, C Wei, Q Sun… - Advances in …, 2024 - proceedings.neurips.cc
Expressive human pose and shape estimation (EHPS) unifies body, hands, and face motion
capture with numerous applications. Despite encouraging progress, current state-of-the-art …

Human pose as compositional tokens

Z Geng, C Wang, Y Wei, Z Liu… - Proceedings of the …, 2023 - openaccess.thecvf.com
Human pose is typically represented by a coordinate vector of body joints or their heatmap
embeddings. While easy for data processing, unrealistic pose estimates are admitted due to …

Real-time high-resolution background matting

S Lin, A Ryabtsev, S Sengupta… - Proceedings of the …, 2021 - openaccess.thecvf.com
We introduce a real-time, high-resolution background replacement technique which
operates at 30fps in 4K resolution, and 60fps for HD on a modern GPU. Our technique is …

AGORA: Avatars in geography optimized for regression analysis

P Patel, CHP Huang, J Tesch… - Proceedings of the …, 2021 - openaccess.thecvf.com
While the accuracy of 3D human pose estimation from images has steadily improved on
benchmark datasets, the best methods still fail in many real-world scenarios. This suggests …

Unihcp: A unified model for human-centric perceptions

Y Ci, Y Wang, M Chen, S Tang, L Bai… - Proceedings of the …, 2023 - openaccess.thecvf.com
Human-centric perceptions (eg, pose estimation, human parsing, pedestrian detection,
person re-identification, etc.) play a key role in industrial applications of visual models. While …

Exemplar fine-tuning for 3d human model fitting towards in-the-wild 3d human pose estimation

H Joo, N Neverova, A Vedaldi - 2021 International Conference …, 2021 - ieeexplore.ieee.org
Differently from 2D image datasets such as COCO, largescale human datasets with 3D
ground-truth annotations are very difficult to obtain in the wild. In this paper, we address this …