T Iki,
A Aizawa - Findings of the Association for Computational …, 2020 - aclanthology.org
Referring expression comprehension, which is the ability to locate language to an object in
an image, plays an important role in creating common ground. Many models that fuse visual …