作者
Budi Yuwono, Dik Lun Lee
发表日期
1996/2/26
研讨会论文
Proceedings of the Twelfth International Conference on Data Engineering
页码范围
164-171
出版商
IEEE
简介
Applying information retrieval techniques to the World Wide Web (WWW) environment is a challenge, mostly because of its hypertext/hypermedia nature and the richness of the meta-information it provides. We present four keyword-based search and ranking algorithms for locating relevant WWW pages with respect to user queries. The first algorithm, Boolean Spreading Activation, extends the notion of word occurrence in the Boolean retrieval model by propagating the occurrence of a query word in a page to other pages linked to it. The second algorithm, Most-cited, uses the number of citing hyperlinks between potentially relevant WWW pages to increase the relevance scores of the referenced pages over the referencing pages. The third algorithm, TFxIDF vector space model, is based on word distribution statistics. The last algorithm, Vector Spreading Activation, combines TFxIDF with the spreading activation …
引用总数
199519961997199819992000200120022003200420052006200720082009201020112012201320142015201620172018201920202021202220232024233798111299101113410123256441111
学术搜索中的文章
B Yuwono, DL Lee - Proceedings of the Twelfth International Conference on …, 1996