A Dual-Augmentor Framework for Domain Generalization in 3D Human Pose Estimation

Q Peng, C Zheng, C Chen - Proceedings of the IEEE/CVF …, 2024 - openaccess.thecvf.com
Abstract 3D human pose data collected in controlled laboratory settings present challenges
for pose estimators that generalize across diverse scenarios. To address this domain …

Imagpose: A unified conditional framework for pose-guided person generation

F Shen, J Tang - The Thirty-eighth Annual Conference on Neural …, 2024 - openreview.net
Diffusion models represent a promising avenue for image generation, having demonstrated
competitive performance in pose-guided person image generation. However, existing …

GraphMLP: A graph MLP-like architecture for 3D human pose estimation

W Li, M Liu, H Liu, T Guo, T Wang, H Tang, N Sebe - Pattern Recognition, 2025 - Elsevier
Modern multi-layer perceptron (MLP) models have shown competitive results in learning
visual representations without self-attention. However, existing MLP models are not good at …

DBMHT: A double-branch multi-hypothesis transformer for 3D human pose estimation in video

X Xiang, X Li, W Bao, Y Qiao, A El Saddik - Computer Vision and Image …, 2024 - Elsevier
The estimation of 3D human poses from monocular videos presents a significant challenge.
The existing methods face the problems of deep ambiguity and self-occlusion. To overcome …

Hourglass Tokenizer for Efficient Transformer-Based 3D Human Pose Estimation

W Li, M Liu, H Liu, P Wang, J Cai… - Proceedings of the …, 2024 - openaccess.thecvf.com
Transformers have been successfully applied in the field of video-based 3D human pose
estimation. However the high computational costs of these video pose transformers (VPTs) …

RT-Pose: A 4D Radar Tensor-Based 3D Human Pose Estimation and Localization Benchmark

YH Ho, JH Cheng, SY Kuan, Z Jiang, W Chai… - … on Computer Vision, 2025 - Springer
Traditional methods for human localization and pose estimation (HPE), which mainly rely on
RGB images as an input modality, confront substantial limitations in real-world applications …

Refined temporal pyramidal compression-and-amplification transformer for 3D human pose estimation

H Liu, W Xiang, JY He, ZQ Cheng, B Luo… - arXiv preprint arXiv …, 2023 - arxiv.org
Accurately estimating the 3D pose of humans in video sequences requires both accuracy
and a well-structured architecture. With the success of transformers, we introduce the …

Exploring Learning-based Motion Models in Multi-Object Tracking

HW Huang, CY Yang, W Chai, Z Jiang… - arXiv preprint arXiv …, 2024 - arxiv.org
In the field of multi-object tracking (MOT), traditional methods often rely on the Kalman Filter
for motion prediction, leveraging its strengths in linear motion scenarios. However, the …

Towards Practical Human Motion Prediction with LiDAR Point Clouds

X Han, Y Ren, Y Yao, Y Sun, Y Ma - Proceedings of the 32nd ACM …, 2024 - dl.acm.org
Human motion prediction is crucial for human-centric multimedia understanding and
interacting. Current methods typically rely on ground truth human poses as observed input …

Person in Place: Generating Associative Skeleton-Guidance Maps for Human-Object Interaction Image Editing

CH Yang, CH Kang, K Kong, H Oh… - Proceedings of the …, 2024 - openaccess.thecvf.com
Recently there were remarkable advances in image editing tasks in various ways.
Nevertheless existing image editing models are not designed for Human-Object Interaction …