Tri-perspective view for vision-based 3d semantic occupancy prediction

Y Huang, W Zheng, Y Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Modern methods for vision-centric autonomous driving perception widely adopt the bird's-
eye-view (BEV) representation to describe a 3D scene. Despite its better efficiency than …

Voxformer: Sparse voxel transformer for camera-based 3d semantic scene completion

Y Li, Z Yu, C Choy, C Xiao, JM Alvarez… - Proceedings of the …, 2023 - openaccess.thecvf.com
Humans can easily imagine the complete 3D geometry of occluded objects and scenes. This
appealing ability is vital for recognition and understanding. To enable such capability in AI …

Occ3d: A large-scale 3d occupancy prediction benchmark for autonomous driving

X Tian, T Jiang, L Yun, Y Mao, H Yang… - Advances in …, 2024 - proceedings.neurips.cc
Robotic perception requires the modeling of both 3D geometry and semantics. Existing
methods typically focus on estimating 3D bounding boxes, neglecting finer geometric details …

Surroundocc: Multi-camera 3d occupancy prediction for autonomous driving

Y Wei, L Zhao, W Zheng, Z Zhu… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract 3D scene understanding plays a vital role in vision-based autonomous driving.
While most existing methods focus on 3D object detection, they have difficulty describing …

Openoccupancy: A large scale benchmark for surrounding semantic occupancy perception

X Wang, Z Zhu, W Xu, Y Zhang, Y Wei… - Proceedings of the …, 2023 - openaccess.thecvf.com
Semantic occupancy perception is essential for autonomous driving, as automated vehicles
require a fine-grained perception of the 3D urban structures. However, existing relevant …

Occformer: Dual-path transformer for vision-based 3d semantic occupancy prediction

Y Zhang, Z Zhu, D Du - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
The vision-based perception for autonomous driving has undergone a transformation from
the bird-eye-view (BEV) representations to the 3D semantic occupancy. Compared with the …

Grid-centric traffic scenario perception for autonomous driving: A comprehensive review

Y Shi, K Jiang, J Li, Z Qian, J Wen, M Yang… - arXiv preprint arXiv …, 2023 - arxiv.org
Grid-centric perception is a crucial field for mobile robot perception and navigation.
Nonetheless, grid-centric perception is less prevalent than object-centric perception as …

Pointr: Diverse point cloud completion with geometry-aware transformers

X Yu, Y Rao, Z Wang, Z Liu, J Lu… - Proceedings of the …, 2021 - openaccess.thecvf.com
Point clouds captured in real-world applications are often incomplete due to the limited
sensor resolution, single viewpoint, and occlusion. Therefore, recovering the complete point …

Occworld: Learning a 3d occupancy world model for autonomous driving

W Zheng, W Chen, Y Huang, B Zhang, Y Duan… - European Conference on …, 2025 - Springer
Understanding how the 3D scene evolves is vital for making decisions in autonomous
driving. Most existing methods achieve this by predicting the movements of object boxes …

Monoscene: Monocular 3d semantic scene completion

AQ Cao, R De Charette - … of the IEEE/CVF Conference on …, 2022 - openaccess.thecvf.com
MonoScene proposes a 3D Semantic Scene Completion (SSC) framework, where the dense
geometry and semantics of a scene are inferred from a single monocular RGB image …