Vision-centric bev perception: A survey

Y Ma, T Wang, X Bai, H Yang, Y Hou… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org
In recent years, vision-centric Bird's Eye View (BEV) perception has garnered significant
interest from both industry and academia due to its inherent advantages, such as providing …

Building lane-level maps from aerial images

J Yao, X Pan, T Wu, X Zhang - ICASSP 2024-2024 IEEE …, 2024 - ieeexplore.ieee.org
Detecting lane lines from sensors is becoming an increasingly significant part of
autonomous driving systems. However, less development has been made on high-definition …

Depthssc: Depth-spatial alignment and dynamic voxel resolution for monocular 3d semantic scene completion

J Yao, J Zhang - arXiv preprint arXiv:2311.17084, 2023 - arxiv.org
The task of 3D semantic scene completion with monocular cameras is gaining increasing
attention in the field of autonomous driving. Its objective is to predict the occupancy status of …

Enhancing aerial object detection with selective frequency interaction network

W Weng, M Wei, J Ren, F Shen - IEEE Transactions on Artificial …, 2024 - ieeexplore.ieee.org
Aerial object detection is a crucial task in computer vision because it plays a pivotal role in
understanding remote images. However, most Convolutional Neural Network (CNN) …

Rethinking Human Motion Prediction with Symplectic Integral

H Chen, K Lyu, Z Liu, Y Yin… - Proceedings of the …, 2024 - openaccess.thecvf.com
Long-term and accurate forecasting is the long-standing pursuit of the human motion
prediction task. Existing methods typically suffer from dramatic degradation in prediction …

Simpb: A single model for 2d and 3d object detection from multiple cameras

Y Tang, Z Meng, G Chen, E Cheng - European Conference on Computer …, 2025 - Springer
The field of autonomous driving has attracted considerable interest in approaches that
directly infer 3D objects in the Bird's Eye View (BEV) from multiple cameras. Some attempts …

Learning High-Resolution Vector Representation from Multi-camera Images for 3D Object Detection

Z Chen, S Xu, M Ye, Z Qian, X Zou, DY Yeung… - … on Computer Vision, 2025 - Springer
Abstract The Bird's-Eye-View (BEV) representation is a critical factor that directly impacts the
3D object detection performance, but the traditional BEV grid representation induces …

Count, decompose and correct: A new approach to handwritten Chinese character error correction

P Hu, J Ma, Z Zhang, J Du, J Zhang - Pattern Recognition, 2025 - Elsevier
Recently, handwritten Chinese character error correction has been greatly improved by
employing encoder–decoder methods to decompose a Chinese character into an …

OAM modes classification and demultiplexing via Fourier optical neural network

J Ye, B Jahannia, H Kang, H Wang… - Complex Light and …, 2024 - spiedigitallibrary.org
Here, we present a free-space optical communication system, adept at managing alignment
deviations and the challenges posed by the atmospheric turbulence in the transmission of …

Video Object Segmentation with Dynamic Query Modulation

H Zhou, R Hu, X Li - arXiv preprint arXiv:2403.11529, 2024 - arxiv.org
Storing intermediate frame segmentations as memory for long-range context modeling,
spatial-temporal memory-based methods have recently showcased impressive results in …