Learning accurate monocular 3D voxel representation via bilateral voxel transformer

T Cheng, H Jiang, S Chen, B Liao, Q Zhang… - Image and Vision …, 2024 - Elsevier
Vision-based methods for 3D scene perception have been widely explored for autonomous
vehicles. However, inferring complete 3D semantic scenes from monocular 2D images is still …

Trajectory Prediction for Multiple Classes of Road User with Social-Goal Attention Networks

L Astuti, CH Chiu, YC Lin, MC Lin - Available at SSRN 4894095 - papers.ssrn.com
The upcoming prediction of road agent positions is a crucial task in intelligent systems,
requiring consideration of various factors such as the multiple categories of road agents …