作者
Parvaz Mahdabi, Linda Andersson, Mostafa Keikha, Fabio Crestani
发表日期
2012/8/12
图书
Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
页码范围
505-514
简介
Patent prior art queries are full patent applications which are much longer than standard web search topics. Such queries are composed of hundreds of terms and do not represent a focused information need. One way to make the queries more focused is to select a group of key terms as representatives. Existing works show that such a selection to reduce patent queries is a challenging task mainly because of the presence of ambiguous terms. Given this setup, we present a query modeling approach where we utilize patent-specific characteristics to generate more precise queries. We propose to automatically disambiguate query terms by employing noun phrases that are extracted using the global analysis of the patent collection. We further introduce a method for predicting whether expansion using noun phrases would improve the retrieval effectiveness.
Our experiments show that we can obtain almost 20 …
引用总数
201220132014201520162017201820192020202120222023202425987515222
学术搜索中的文章
P Mahdabi, L Andersson, M Keikha, F Crestani - Proceedings of the 35th international ACM SIGIR …, 2012