作者
Katrin Tomanek, Joachim Wermter, Udo Hahn
发表日期
2007/6
研讨会论文
Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL)
页码范围
486-495
简介
We consider the impact Active Learning (AL) has on effective and efficient text corpus annotation, and report on reduction rates for annotation efforts ranging up until 72%. We also address the issue whether a corpus annotated by means of AL–using a particular classifier and a particular feature set–can be re-used to train classifiers different from the ones employed by AL, supplying alternative feature sets as well. We, finally, report on our experience with the AL paradigm under real-world conditions, ie, the annotation of large-scale document corpora for the life sciences.
引用总数
2007200820092010201120122013201420152016201720182019202020212022202320241141618671183834353561
学术搜索中的文章