Smart distributed web crawler

SK Bal, G Geetha - 2016 International Conference on …, 2016 - ieeexplore.ieee.org
Centralized crawlers are not adequate to spider meaningful and relevant portions of the
Web. A crawler with good scalability and load balancing can bring growth to performance …

CrawlPart: Creating crawl partitions in parallel crawlers

S Gupta, KK Bhatia - 2013 International Symposium on …, 2013 - ieeexplore.ieee.org
With the ever proliferating size and scale of the WWW [1], efficient ways of exploring content
are of increasing importance. How can we efficiently retrieve information from it through …

HiCrawl: A Hidden Web Crawler for Medical Domain

S Gupta, KK Bhatia - 2013 International Symposium on …, 2013 - ieeexplore.ieee.org
The Hidden Web refers to a huge portion of the WWW that holds numerous freely accessible
Web databases, hidden behind search form interfaces which can only be accessed through …

A multidomain layered approach in development of industrial ontology to support domain identification for unstructured text

R Kumaravel, S Selvaraj, C Mala - IEEE Transactions on …, 2018 - ieeexplore.ieee.org
Due to the emergence of digital revolution and competitiveness in recent decades, almost all
organizations and industries intend to develop solutions to extract information from …

Deep questions in the “deep or hidden” Web

S Gupta, KK Bhatia - Proceedings of the Second International Conference …, 2014 - Springer
Abstract The Hidden Web is a part of the Web that consists mainly of the information inside
databases, ie, anything behind an interactive electronic form (search interfaces), which …

[PDF][PDF] Design and Implementation of a Parallel hidden Web Crawler

S Gupta - 2015 - dspace-jcboseust.refread.com
Abstract World Wide Web (WWW) is the largest repository of information that covers data
from almost all the areas known to mankind. It is a source of information that is most …

A Novel Term Weighing Scheme Towards Efficient Crawl of Textual Databases

S Gupta, KK Bhatia - arXiv preprint arXiv:1311.0339, 2013 - arxiv.org
The Hidden Web is the vast repository of informational databases available only through
search form interfaces, accessible by therein typing a set of keywords in the search forms …

Degenerated Primer Design to Amplify the Heavy Chain Variable Region from Immunoglobulin cDNA

W Ying, C Wei, L Xu, C Bing - First International Multi …, 2006 - ieeexplore.ieee.org
The amplification of variable regions (Fv) of immunoglobulins (Ig) becomes a major
challenge in cloning antibody genes either from hybridoma cell lines or splenic B cells …