Emerging Trends in Autonomous Vehicle Perception: Multimodal Fusion for 3D Object Detection

SY Alaba, AC Gurbuz, JE Ball - World Electric Vehicle Journal, 2024 - mdpi.com
World Electric Vehicle Journal, 2024mdpi.com
The pursuit of autonomous driving relies on developing perception systems capable of
making accurate, robust, and rapid decisions to interpret the driving environment effectively.
Object detection is crucial for understanding the environment at these systems' core. While
2D object detection and classification have advanced significantly with the advent of deep
learning (DL) in computer vision (CV) applications, they fall short in providing essential
depth information, a key element in comprehending driving environments. Consequently, 3D …
The pursuit of autonomous driving relies on developing perception systems capable of making accurate, robust, and rapid decisions to interpret the driving environment effectively. Object detection is crucial for understanding the environment at these systems’ core. While 2D object detection and classification have advanced significantly with the advent of deep learning (DL) in computer vision (CV) applications, they fall short in providing essential depth information, a key element in comprehending driving environments. Consequently, 3D object detection becomes a cornerstone for autonomous driving and robotics, offering precise estimations of object locations and enhancing environmental comprehension. The CV community’s growing interest in 3D object detection is fueled by the evolution of DL models, including Convolutional Neural Networks (CNNs) and Transformer networks. Despite these advancements, challenges such as varying object scales, limited 3D sensor data, and occlusions persist in 3D object detection. To address these challenges, researchers are exploring multimodal techniques that combine information from multiple sensors, such as cameras, radar, and LiDAR, to enhance the performance of perception systems. This survey provides an exhaustive review of multimodal fusion-based 3D object detection methods, focusing on CNN and Transformer-based models. It underscores the necessity of equipping fully autonomous vehicles with diverse sensors to ensure robust and reliable operation. The survey explores the advantages and drawbacks of cameras, LiDAR, and radar sensors. Additionally, it summarizes autonomy datasets and examines the latest advancements in multimodal fusion-based methods. The survey concludes by highlighting the ongoing challenges, open issues, and potential directions for future research.
MDPI
以上显示的是最相近的搜索结果。 查看全部搜索结果