Web information organization using keyword distillation based clustering

T Shibata, Y Bamba, K Shinzato… - 2009 IEEE/WIC/ACM …, 2009 - ieeexplore.ieee.org
2009 IEEE/WIC/ACM International Joint Conference on Web …, 2009ieeexplore.ieee.org
This paper describes a system that conducts search result clustering for several thousands
of Web pages, and elaborates cluster labels through keyword distillation. Keyword
distillation is a method that properly handles spelling variations, transliterations, synonyms,
inclusion relations and word ambiguity, using linguistic resources and contexts of a user's
query. The system provides a clustering result from 1,000 pages in less than one minute by
taking advantage of a search engine infrastructure and grid computing environment …
This paper describes a system that conducts search result clustering for several thousands of Web pages, and elaborates cluster labels through keyword distillation. Keyword distillation is a method that properly handles spelling variations, transliterations, synonyms, inclusion relations and word ambiguity, using linguistic resources and contexts of a user's query. The system provides a clustering result from 1,000 pages in less than one minute by taking advantage of a search engine infrastructure and grid computing environment. Experimental results show that the system correctly merged synonymous keywords and is useful for finding topics hidden in the lower-ranked pages in a search result.
ieeexplore.ieee.org
以上显示的是最相近的搜索结果。 查看全部搜索结果