An enhanced real-time human pose estimation method based on modified YOLOv8 framework

C Dong, G Du - Scientific Reports, 2024 - nature.com
The objective of human pose estimation (HPE) derived from deep learning aims to
accurately estimate and predict the human body posture in images or videos via the …

Unipose: Detecting any keypoints

J Yang, A Zeng, R Zhang, L Zhang - arXiv preprint arXiv:2310.08530, 2023 - arxiv.org
This work proposes a unified framework called UniPose to detect keypoints of any
articulated (eg, human and animal), rigid, and soft objects via visual or textual prompts for …

Capturing Closely Interacted Two-Person Motions with Reaction Priors

Q Fang, Y Fan, Y Li, J Dong, D Wu… - Proceedings of the …, 2024 - openaccess.thecvf.com
In this paper we focus on capturing closely interacted two-person motions from monocular
videos an important yet understudied topic. Unlike less-interacted motions closely interacted …

Human pose-based estimation, tracking and action recognition with deep learning: A survey

L Zhou, X Meng, Z Liu, M Wu, Z Gao… - arXiv preprint arXiv …, 2023 - arxiv.org
Human pose analysis has garnered significant attention within both the research community
and practical applications, owing to its expanding array of uses, including gaming, video …

Multi-agent Long-term 3D Human Pose Forecasting via Interaction-aware Trajectory Conditioning

J Jeong, D Park, KJ Yoon - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com
Human pose forecasting garners attention for its diverse applications. However challenges
in modeling the multi-modal nature of human motion and intricate interactions among agents …

DiffusionRegPose: Enhancing Multi-Person Pose Estimation using a Diffusion-Based End-to-End Regression Approach

D Tan, H Chen, W Tian, L Xiong - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
This paper presents the DiffusionRegPose a novel approach to multi-person pose
estimation that converts a one-stage end-to-end keypoint regression model into a diffusion …

DV-3DLane: End-to-end Multi-modal 3D Lane Detection with Dual-view Representation

Y Luo, S Cui, Z Li - arXiv preprint arXiv:2406.16072, 2024 - arxiv.org
Accurate 3D lane estimation is crucial for ensuring safety in autonomous driving. However,
prevailing monocular techniques suffer from depth loss and lighting variations, hampering …

Efficient Sampling of Two-Stage Multi-Person Pose Estimation and Tracking from Spatiotemporal

S Lin, W Hou - Applied Sciences, 2024 - mdpi.com
Tracking the articulated poses of multiple individuals in complex videos is a highly
challenging task due to a variety of factors that compromise the accuracy of estimation and …

DHRNet: A Dual-path Hierarchical Relation Network for multi-person pose estimation

Y Dang, J Yin, L Liu, P Ding, Y Sun, Y Hu - Knowledge-Based Systems, 2024 - Elsevier
Multi-person pose estimation (MPPE) presents a challenging yet crucial task in computer
vision. Most existing methods predominantly concentrate on isolated interaction either …

VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks

J Wu, M Zhong, S Xing, Z Lai, Z Liu, W Wang… - arXiv preprint arXiv …, 2024 - arxiv.org
We present VisionLLM v2, an end-to-end generalist multimodal large model (MLLM) that
unifies visual perception, understanding, and generation within a single framework. Unlike …