Posynda: Multi-hypothesis pose synthesis domain adaptation for robust 3d human pose estimation

H Liu, JY He, ZQ Cheng, W Xiang, Q Yang… - Proceedings of the 31st …, 2023 - dl.acm.org
The current 3D human pose estimators face challenges in adapting to new datasets due to
the scarcity of 2D-3D pose pairs in target domain training sets. We present the Multi …

A comprehensive review of 3d object detection in autonomous driving: Technological advances and future directions

Y Wang, S Wang, Y Li, M Liu - arXiv preprint arXiv:2408.16530, 2024 - arxiv.org
In recent years, 3D object perception has become a crucial component in the development
of autonomous driving systems, providing essential environmental awareness. However, as …

Procontext: Exploring progressive context transformer for tracking

JP Lan, ZQ Cheng, JY He, C Li, B Luo… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
Existing Visual Object Tracking (VOT) only takes the target area in the first frame as a
template. This causes tracking to inevitably fail in fast-changing and crowded scenes, as it …

Improving anomaly segmentation with multi-granularity cross-domain alignment

J Zhang, X Wu, ZQ Cheng, Q He, W Li - Proceedings of the 31st ACM …, 2023 - dl.acm.org
Anomaly segmentation plays a crucial role in identifying anomalous objects within images,
which facilitates the detection of road anomalies for autonomous driving. Although existing …

Damo-streamnet: Optimizing streaming perception in autonomous driving

JY He, ZQ Cheng, C Li, W Xiang, B Chen, B Luo… - arXiv preprint arXiv …, 2023 - arxiv.org
Real-time perception, or streaming perception, is a crucial aspect of autonomous driving that
has yet to be thoroughly explored in existing research. To address this gap, we present …

Future Feature-based Supervised Contrastive Learning for Streaming Perception

T Wang, H Huang - IEEE Transactions on Circuits and Systems …, 2024 - ieeexplore.ieee.org
Streaming perception, a critical task in computer vision, involves the real-time prediction of
object locations within video sequences based on prior frames. While current methods like …

Fast Fourier inception networks for occluded video prediction

P Li, C Zhang, X Xu - IEEE Transactions on Multimedia, 2023 - ieeexplore.ieee.org
Video prediction is a pixel-level task that generates future frames by employing the historical
frames. There often exist continuous complex motions, such as object overlapping and …

[HTML][HTML] VN-MADDPG: A variable-noise-based multi-agent reinforcement learning algorithm for autonomous vehicles at unsignalized intersections

H Zhang, Y Du, S Zhao, Y Yuan, Q Gao - Electronics, 2024 - mdpi.com
The decision-making performance of autonomous vehicles tends to be unstable at
unsignalized intersections, making it difficult for them to make optimal decisions. We …

KeyPosS: Plug-and-Play Facial Landmark Detection through GPS-Inspired True-Range Multilateration

X Bao, ZQ Cheng, JY He, W Xiang, C Li, J Sun… - Proceedings of the 31st …, 2023 - dl.acm.org
In the realm of facial analysis, accurate landmark detection is crucial for various applications,
ranging from face recognition and expression analysis to animation. Conventional heatmap …

StreamTrack: real-time meta-detector for streaming perception in full-speed domain driving scenarios

W Ge, X Wang, Z Mao, J Ren, J Shen - Applied Intelligence, 2024 - Springer
Streaming perception is a crucial task in the field of autonomous driving, which aims to
eliminate the inconsistency between the perception results and the real environment due to …