3D object detection for autonomous driving: A comprehensive survey

J Mao, S Shi, X Wang, H Li - International Journal of Computer Vision, 2023 - Springer
Autonomous driving, in recent years, has been receiving increasing attention for its potential
to relieve drivers' burdens and improve the safety of driving. In modern autonomous driving …

Planning-oriented autonomous driving

Y Hu, J Yang, L Chen, K Li, C Sima… - Proceedings of the …, 2023 - openaccess.thecvf.com
Modern autonomous driving system is characterized as modular tasks in sequential order,
ie, perception, prediction, and planning. In order to perform a wide diversity of tasks and …

Transfuser: Imitation with transformer-based sensor fusion for autonomous driving

K Chitta, A Prakash, B Jaeger, Z Yu… - … on Pattern Analysis …, 2022 - ieeexplore.ieee.org
How should we integrate representations from complementary sensors for autonomous
driving? Geometry-based fusion has shown promise for perception (eg, object detection …

Multi-modal fusion transformer for end-to-end autonomous driving

A Prakash, K Chitta, A Geiger - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
How should representations from complementary sensors be integrated for autonomous
driving? Geometry-based sensor fusion has shown great promise for perception tasks such …

Beverse: Unified perception and prediction in birds-eye-view for vision-centric autonomous driving

Y Zhang, Z Zhu, W Zheng, J Huang, G Huang… - arXiv preprint arXiv …, 2022 - arxiv.org
In this paper, we present BEVerse, a unified framework for 3D perception and prediction
based on multi-camera systems. Unlike existing studies focusing on the improvement of …

Learning lane graph representations for motion forecasting

M Liang, B Yang, R Hu, Y Chen, R Liao, S Feng… - Computer Vision–ECCV …, 2020 - Springer
We propose a motion forecasting model that exploits a novel structured map representation
as well as actor-map interactions. Instead of encoding vectorized maps as raster images, we …

Reasonnet: End-to-end driving with temporal and global reasoning

H Shao, L Wang, R Chen… - Proceedings of the …, 2023 - openaccess.thecvf.com
The large-scale deployment of autonomous vehicles is yet to come, and one of the major
remaining challenges lies in urban dense traffic scenarios. In such cases, it remains …

Vip3d: End-to-end visual trajectory prediction via 3d agent queries

J Gu, C Hu, T Zhang, X Chen, Y Wang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Perception and prediction are two separate modules in the existing autonomous driving
systems. They interact with each other via hand-picked features such as agent bounding …

Mp3: A unified model to map, perceive, predict and plan

S Casas, A Sadat, R Urtasun - Proceedings of the IEEE/CVF …, 2021 - openaccess.thecvf.com
High-definition maps (HD maps) are a key component of most modern self-driving systems
due to their valuable semantic and geometric information. Unfortunately, building HD maps …

Standing between past and future: Spatio-temporal modeling for multi-camera 3d multi-object tracking

Z Pang, J Li, P Tokmakov, D Chen… - Proceedings of the …, 2023 - openaccess.thecvf.com
This work proposes an end-to-end multi-camera 3D multi-object tracking (MOT) framework. It
emphasizes spatio-temporal continuity and integrates both past and future reasoning for …