Multi-view stereo in the deep learning era: A comprehensive review

X Wang, C Wang, B Liu, X Zhou, L Zhang, J Zheng… - Displays, 2021 - Elsevier
Multi-view stereo infers the 3D geometry from a set of images captured from several known
positions and viewpoints. It is one of the most important components of 3D reconstruction …

Panoptic neural fields: A semantic object-aware neural scene representation

A Kundu, K Genova, X Yin, A Fathi… - Proceedings of the …, 2022 - openaccess.thecvf.com
We present PanopticNeRF, an object-aware neural scene representation that decomposes
a scene into a set of objects (things) and background (stuff). Each object is represented by a …

Surroundocc: Multi-camera 3d occupancy prediction for autonomous driving

Y Wei, L Zhao, W Zheng, Z Zhu… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract 3D scene understanding plays a vital role in vision-based autonomous driving.
While most existing methods focus on 3D object detection, they have difficulty describing …

Nice-slam: Neural implicit scalable encoding for slam

Z Zhu, S Peng, V Larsson, W Xu… - Proceedings of the …, 2022 - openaccess.thecvf.com
Neural implicit representations have recently shown encouraging results in various
domains, including promising progress in simultaneous localization and mapping (SLAM) …

Cross-view transformers for real-time map-view semantic segmentation

B Zhou, P Krähenbühl - … of the IEEE/CVF conference on …, 2022 - openaccess.thecvf.com
We present cross-view transformers, an efficient attention-based model for map-view
semantic segmentation from multiple cameras. Our architecture implicitly learns a mapping …

Nerfingmvs: Guided optimization of neural radiance fields for indoor multi-view stereo

Y Wei, S Liu, Y Rao, W Zhao, J Lu… - Proceedings of the …, 2021 - openaccess.thecvf.com
In this work, we present a new multi-view depth estimation method that utilizes both
conventional SfM reconstruction and learning-based priors over the recently proposed …

Neural 3d scene reconstruction with the manhattan-world assumption

H Guo, S Peng, H Lin, Q Wang… - Proceedings of the …, 2022 - openaccess.thecvf.com
This paper addresses the challenge of reconstructing 3D indoor scenes from multi-view
images. Many previous works have shown impressive reconstruction results on textured …

Neumesh: Learning disentangled neural mesh-based implicit field for geometry and texture editing

B Yang, C Bao, J Zeng, H Bao, Y Zhang, Z Cui… - … on Computer Vision, 2022 - Springer
Very recently neural implicit rendering techniques have been rapidly evolved and shown
great advantages in novel view synthesis and 3D scene reconstruction. However, existing …

Eslam: Efficient dense slam system based on hybrid representation of signed distance fields

MM Johari, C Carta, F Fleuret - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
We present ESLAM, an efficient implicit neural representation method for Simultaneous
Localization and Mapping (SLAM). ESLAM reads RGB-D frames with unknown camera …

MBEV: Multi-Camera Joint 3D Detection and Segmentation with Unified Birds-Eye View Representation

E Xie, Z Yu, D Zhou, J Philion, A Anandkumar… - arXiv preprint arXiv …, 2022 - arxiv.org
In this paper, we propose M $^ 2$ BEV, a unified framework that jointly performs 3D object
detection and map segmentation in the Birds Eye View~(BEV) space with multi-camera …