Delving into Multi-modal Multi-task Foundation Models for Road Scene Understanding: From Learning Paradigm Perspectives

S Luo, W Chen, W Tian, R Liu, L Hou… - IEEE Transactions …, 2024 - ieeexplore.ieee.org
Foundation models have indeed made a profound impact on various fields, emerging as
pivotal components that significantly shape the capabilities of intelligent systems. In the …

Self-Supervised Geometry-Guided Initialization for Robust Monocular Visual Odometry

T Kanai, I Vasiljevic, V Guizilini, K Shintani - arXiv preprint arXiv …, 2024 - arxiv.org
Monocular visual odometry is a key technology in a wide variety of autonomous systems.
Relative to traditional feature-based methods, that suffer from failures due to poor lighting …