作者
David Fisher, Ashish Jain, Mostafa Keikha, W Bruce Croft, Nedim Lipka
发表日期
2015
期刊
University of Massachusetts, Boston, MA, Technical Report. https://pdfs. semanticscholar. org/9f0c/53afcc5e33b8b22722add0812bf14ccf875b. pdf
简介
Diversification techniques for web search have recently been developed that assume that, for each query, there is a set of underlying aspects or subtopics that address specific user intents. These techniques attempt to balance the relevance of the retrieved documents with the coverage of the aspects. Evaluation of diversification techniques requires some way of defining a set of aspects for each test query and a “gold standard” assignment of documents to aspects. This has made the study of diversification difficult for new data such as microblogs. A related task, keyword-based summarization, is important for microblogs but also has problems in evaluation. In this paper, we describe an approach to evaluating ranking diversity and summarization in microblogs by assuming hashtags correspond to subtopics. We show the viability of this approach to evaluation, and validate the assumption that hashtags are subtopics. The results show that, despite the differences in content, the best techniques for search diversification with microblogs are the same as with web pages. The summarization results confirm that the DSPapprox technique is effective and that phrase-based summarization techniques perform somewhat worse than single words in terms of covering the underlying aspects.
引用总数
201620172018201920202021202220231121
学术搜索中的文章
D Fisher, A Jain, M Keikha, WB Croft, N Lipka - University of Massachusetts, Boston, MA, Technical …, 2015