作者
Zhao Xu, Qingcai Chen, Hongzhi Guo
发表日期
2008/12
研讨会论文
11th Joint International Conference on Information Sciences
页码范围
584-589
出版商
Atlantis Press
简介
Visualizing hierarchy of a website is very helpful for both users’ navigating and search engine efficiently presenting results. In this paper, treating webpages as nodes and hyperlinks as directed edges, the link structure is firstly modeled as weighted directed graph. Considering multiple website features, which include directory path, contents and anchor texts etc., the weight is determined by semantic relevance between webpages. The single source shortest path algorithm is finally applied to extract the Topic hierarchy. Conducted experiment on real web to evaluate the proposed algorithm shows the proposed method gets an average pre-cision gain of 11.67% than baseline method.
引用总数
学术搜索中的文章
Z Xu, Q Chen, H Guo - 11th Joint International Conference on Information …, 2008