M Li,
C Wang, W Feng, S Lyu… - Proceedings of the …, 2023 - openaccess.thecvf.com
Visual Grounding (VG) aims at localizing target objects from an image based on given
expressions and has made significant progress with the development of detection and vision …