E-FFC: an enhanced form-focused crawler for domain-specific deep web databases

Y Li, Y Wang, J Du - Journal of Intelligent Information Systems, 2013 - Springer
A key problem of retrieving, integrating and mining rich and high quality information from
massive Deep Web Databases (WDBs) online is how to automatically and effectively …

A novel architecture for deep web crawler

DK Sharma, AK Sharma - International Journal of Information …, 2011 - igi-global.com
A traditional crawler picks up a URL, retrieves the corresponding page and extracts various
links, adding them to the queue. A deep Web crawler, after adding links to the queue, checks …

Deep web information retrieval process: A technical survey

DK Sharma, AK Sharma - Models for Capitalizing on Web …, 2012 - igi-global.com
Web crawlers specialize in downloading web content and analyzing and indexing from
surface web, consisting of interlinked HTML pages. Web crawlers have limitations if the data …

A new architecture of an intelligent agent-based crawler for domain-specific deep web databases

Y Li, Y Wang, E Tian - … on Web Intelligence and Intelligent Agent …, 2012 - ieeexplore.ieee.org
A key problem of retrieving, integrating and mining rich and high quality information from
massive Deep Web Databases (WDBs) online is how to automatically and effectively …

Information Retrieval in the Hidden Web

S Ahmed, S Sharma, SL Yadav - New Opportunities for Sentiment …, 2021 - igi-global.com
Abstract Information retrieval is finding material of unstructured nature within large
collections stored on computers. Surface web consists of indexed content accessible by …

Taxonomies and ontologies in web semantic applications: the new emerging semantic lexicon-based model

V Di Lecce, M Calabrese - 2008 International Conference on …, 2008 - ieeexplore.ieee.org
This paper addresses the new emerging approach of Semantic Lexicon-based systems for
modeling semantic Web applications. The paper endeavors to shed lights on some …

An architecture for extracting information from hidden web databases using intelligent agent technology through reinforcement learning

L Singh, DK Sharma - 2013 IEEE conference on Information & …, 2013 - ieeexplore.ieee.org
The web contains enormous amount of information. From that enormous information only
small amount of that information is visible to users and a huge portion of the information is …

A QIIIEP based domain specific hidden web crawler

DK Sharma, AK Sharma - Proceedings of the International Conference & …, 2011 - dl.acm.org
For context based surfing of World Wide Web in a systematic and automatic manner, a web
crawler is required. The World Wide Web consists interlinked documents and resources that …

Challenges and Opportunities in Investigations of Online Sexual Exploitation of Children: Old Networks, Dark Web, and Proactive Response

F Fortin, S Paquette, S Gagné - Criminal Investigations of Sexual Offenses …, 2021 - Springer
The Internet provides private spaces that make it easier to engage in sexual activities,
including with minors, and has also facilitated contact with young victims. This chapter looks …

[PDF][PDF] Fingerprinting Lexical Contexts over the Web.

V Di Lecce, M Calabrese, D Soldo - J. Univers. Comput. Sci., 2009 - researchgate.net
In this paper a novel technique for identifying lexical contexts in web resources is presented.
The basic idea is to consider web site anchortexts as lexicalized descriptions of an …