作者
Markus Weimer, Iryna Gurevych, Max Mühlhäuser
发表日期
2007/6
研讨会论文
Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics Companion Volume Proceedings of the Demo and Poster Sessions
页码范围
125-128
简介
Assessing the quality of user generated content is an important problem for many web forums. While quality is currently assessed manually, we propose an algorithm to assess the quality of forum posts automatically and test it on data provided by Nabble. com. We use state-of-the-art classification techniques and experiment with five feature classes: Surface, Lexical, Syntactic, Forum specific and Similarity features. We achieve an accuracy of 89% on the task of automatically assessing post quality in the software domain using forum specific features. Without forum specific features, we achieve an accuracy of 82%.
引用总数
20072008200920102011201220132014201520162017201820192020202120222023414132115111071055655255
学术搜索中的文章
M Weimer, I Gurevych, M Mühlhäuser - Proceedings of the 45th Annual Meeting of the …, 2007