F Liu, K Fang, P Abbeel, S Levine - arXiv e-prints, 2024 - ui.adsabs.harvard.edu
Open-vocabulary generalization requires robotic systems to perform tasks involving complex
and diverse environments and task goals. While the recent advances in vision language …