M Kumar, R Bhatia, D Rattan - Wiley Interdisciplinary Reviews …, 2017 - Wiley Online Library
Performance of any search engine relies heavily on its Web crawler. Web crawlers are the programs that get webpages from the Web by following hyperlinks. These webpages are …
Information retrieval (IR) has changed considerably in recent years with the expansion of the World Wide Web and the advent of modern and inexpensive graphical user interfaces and …
Cultural artifacts of the past have always had an important role in the formation of consciousness and self-understanding of a society and the construction of its future. The …
G Pant, P Srinivasan - ACM Transactions on Information Systems (TOIS), 2005 - dl.acm.org
Topical crawling is a young and creative area of research that holds the promise of benefiting from several sophisticated data mining techniques. The use of classification …
G Pant, P Srinivasan - IEEE Transactions on knowledge and …, 2005 - ieeexplore.ieee.org
Context of a hyperlink or link context is defined as the terms that appear in the text around a hyperlink within a Web page. Link contexts have been applied to a variety of Web …
Sentiments and opinions expressed in Web pages towards objects, entities, and products constitute an important portion of the textual content available in the Web. In the last decade …
Many Web IR and Digital Library applications require a crawling process to collect pages with the ultimate goal of taking advantage of useful information available on Web sites. For …
M Yuvarani, A Kannan - … on Web Intelligence (WI 2006 Main …, 2006 - ieeexplore.ieee.org
The traditional process of focused web crawler is to harvest a collection of web documents that are focused on the topical subspaces. The intricacy of focused crawlers is identifying the …
Focused crawlers are effective tools for applications requiring a high number of pages belonging to a specific topic. Several strategies for implementing these crawlers have been …