作者
Yan Yan, Rómer Rosales, Glenn Fung, Mark Schmidt, Gerardo Hermosillo, Luca Bogoni, Linda Moy, Jennifer Dy
发表日期
2010/3/31
研讨会论文
Proceedings of the thirteenth international conference on artificial intelligence and statistics
页码范围
932-939
出版商
JMLR Workshop and Conference Proceedings
简介
Supervised learning from multiple labeling sources is an increasingly important problem in machine learning and data mining. This paper develops a probabilistic approach to this problem when annotators may be unreliable (labels are noisy), but also their expertise varies depending on the data they observe (annotators may have knowledge about different parts of the input space). That is, an annotator may not be consistently accurate (or inaccurate) across the task domain. The presented approach produces classification and annotator models that allow us to provide estimates of the true labels and annotator variable expertise. We provide an analysis of the proposed model under various scenarios and show experimentally that annotator expertise can indeed vary in real tasks and that the presented approach provides clear advantages over previously introduced multi-annotator methods, which only consider general annotator characteristics.
引用总数
2011201220132014201520162017201820192020202120222023202410193628212723151713199134
学术搜索中的文章
Y Yan, R Rosales, G Fung, M Schmidt, G Hermosillo… - Proceedings of the thirteenth international conference …, 2010