作者
Aida Nematzadeh, Kaylee Burns, Erin Grant, Alison Gopnik, Thomas L Griffiths
发表日期
2018/8/28
期刊
arXiv preprint arXiv:1808.09352
简介
We propose a new dataset for evaluating question answering models with respect to their capacity to reason about beliefs. Our tasks are inspired by theory-of-mind experiments that examine whether children are able to reason about the beliefs of others, in particular when those beliefs differ from reality. We evaluate a number of recent neural models with memory augmentation. We find that all fail on our tasks, which require keeping track of inconsistent states of the world; moreover, the models' accuracy decreases notably when random sentences are introduced to the tasks at test.
引用总数
201820192020202120222023202417611162717
学术搜索中的文章
A Nematzadeh, K Burns, E Grant, A Gopnik, TL Griffiths - arXiv preprint arXiv:1808.09352, 2018