作者
John D King, Yuefeng Li, Xiaohui Tao, Richi Nayak
发表日期
2007/1/1
期刊
Web intelligence and agent systems: An international journal
卷号
5
期号
3
页码范围
233-253
出版商
IOS Press
简介
Little is known about the content of the major search engines. We present an automatic learning method which trains an ontology with world knowledge of hundreds of different subjects in a three-level taxonomy covering the documents offered in our university library. We then mine this ontology to find important classification rules, and then use these rules to perform an extensive analysis of the content of the largest general purpose internet search engines in use today. Instead of representing documents and collections as a set of terms, we represent them as a set of subjects, which is a highly efficient representation, leading to a more robust representation of information and a decrease of synonymy.
引用总数
2006200720082009201020112012201320142015201620172018201920202021202212467391064222211
学术搜索中的文章
JD King, Y Li, X Tao, R Nayak - Web intelligence and agent systems: An international …, 2007