A survey of Web crawlers for information retrieval

M Kumar, R Bhatia, D Rattan - Wiley Interdisciplinary Reviews …, 2017 - Wiley Online Library
Performance of any search engine relies heavily on its Web crawler. Web crawlers are the
programs that get webpages from the Web by following hyperlinks. These webpages are …

[图书][B] Principles of distributed database systems

MT Özsu, P Valduriez - 1999 - Springer
The first edition of this book appeared in 1991 when the technology was new and there were
not too many products. In the Preface to the first edition, we had quoted Michael Stonebraker …

An efficient deep learning-based scheme for web spam detection in IoT environment

A Makkar, N Kumar - Future Generation Computer Systems, 2020 - Elsevier
From the last few years, Internet of Things has revolutionized the entire world. In this, various
smart objects perform the tasks of sensing and computing to provide uninterrupted services …

Keyword query based focused Web crawler

M Kumar, A Bindal, R Gautam, R Bhatia - Procedia Computer Science, 2018 - Elsevier
Finding information on Web is a difficult and challenging task because of the extremely large
volume of data. Search engine can be used to facilitate this task, but it is still difficult to cover …

Sentiment-focused web crawling

AG Vural, BB Cambazoglu, P Karagoz - ACM Transactions on the Web …, 2014 - dl.acm.org
Sentiments and opinions expressed in Web pages towards objects, entities, and products
constitute an important portion of the textual content available in the Web. In the last decade …

A semantic focused web crawler based on a knowledge representation schema

J Hernandez, HM Marin-Castro, M Morales-Sandoval - Applied Sciences, 2020 - mdpi.com
The Web has become the main source of information in the digital world, expanding to
heterogeneous domains and continuously growing. By means of a search engine, users can …

LSCrawler: a framework for an enhanced focused web crawler based on link semantics

M Yuvarani, A Kannan - … on Web Intelligence (WI 2006 Main …, 2006 - ieeexplore.ieee.org
The traditional process of focused web crawler is to harvest a collection of web documents
that are focused on the topical subspaces. The intricacy of focused crawlers is identifying the …

Architecture of a grid-enabled Web search engine

BB Cambazoglu, E Karaca, T Kucukyilmaz… - Information processing & …, 2007 - Elsevier
Search Engine for South-East Europe (SE4SEE) is a socio-cultural search engine running
on the grid infrastructure. It offers a personalized, on-demand, country-specific, category …

An ontology-based focused crawler

L Kozanidis - Natural Language and Information Systems: 13th …, 2008 - Springer
In this paper we present a novel approach for building a focused crawler. The goal of our
crawler is to effectively identify web pages that relate to a set of pre-defined topics and …

Data replication

MT Özsu, P Valduriez, MT Özsu, P Valduriez - Principles of Distributed …, 2011 - Springer
As we discussed in previous chapters, distributed databases are typically replicated. The
purposes of replication are multiple: 1. System availability. As discussed in Chapter 1 …