作者
Sonali Gupta, Komal Kumar Bhatia
发表日期
2013/8/24
研讨会论文
2013 International Symposium on Computational and Business Intelligence
页码范围
152-157
出版商
IEEE
简介
The Hidden Web refers to a huge portion of the WWW that holds numerous freely accessible Web databases, hidden behind search form interfaces which can only be accessed through dynamic web pages that are generated in response to the user queries issued at the search form interface. Thus, the core challenge to implement any crawler for the Hidden Web is to routinely surpass these search form interfaces by automatically generating & issuing queries that help discover these dynamic Web pages. The paper provides a novel approach to guide the crawler in choosing the right query term to be submitted to any search form interface that has been designed to accept keywords or terms as input to it. The system is based on the use of classification hierarchies that might have either been manually or automatically constructed. And for the purposes of illustration, we have considered the search form interfaces in the …
引用总数
2013201420152016201720182019202020212022202311112
学术搜索中的文章
S Gupta, KK Bhatia - 2013 International Symposium on Computational and …, 2013