Moka: Open-vocabulary robotic manipulation through mark-based visual prompting

F Liu, K Fang, P Abbeel, S Levine - arXiv preprint arXiv:2403.03174, 2024 - arxiv.org
Open-vocabulary generalization requires robotic systems to perform tasks involving complex
and diverse environments and task goals. While the recent advances in vision language …

MOKA: Open-Vocabulary Robotic Manipulation through Mark-Based Visual Prompting

F Liu, K Fang, P Abbeel, S Levine - arXiv e-prints, 2024 - ui.adsabs.harvard.edu
Open-vocabulary generalization requires robotic systems to perform tasks involving complex
and diverse environments and task goals. While the recent advances in vision language …

MOKA: Open-Vocabulary Robotic Manipulation through Mark-Based Visual Prompting

F Liu, K Fang, P Abbeel, S Levine - First Workshop on Vision-Language … - openreview.net
Open-vocabulary generalization requires robotic systems to perform tasks involving complex
and diverse environments and task goals. While the recent advances in vision language …