T Ma, J Zhou, Z Wang, R Qiu, J Liang - arXiv preprint arXiv:2406.09738, 2024 - arxiv.org
Developing robots capable of executing various manipulation tasks, guided by natural language instructions and visual observations of intricate real-world environments, remains …