Learning better visual dialog agents with pretrained visual-linguistic representation

T Tu, Q Ping, G Thattai, G Tur… - Proceedings of the …, 2021 - openaccess.thecvf.com
GuessWhat?! is a visual dialog guessing game which incorporates a Questioner agent that
generates a sequence of questions, while an Oracle agent answers the respective questions …

Guessing state tracking for visual dialogue

W Pang, X Wang - Computer Vision–ECCV 2020: 16th European …, 2020 - Springer
The Guesser is a task of visual grounding in GuessWhat?! like visual dialogue. It locates the
target object in an image supposed by an Oracle oneself over a question-answer based …

Object Category-Based Visual Dialog for Effective Question Generation

F Xu, Y Zhou, Z Zhong, G Li - International Conference on Computational …, 2024 - Springer
GuessWhat?! is a visual dialog dataset that consists of a series of goal-oriented questions
and answers between a questioner and an answerer. The purpose of the task is to enable …