[PDF][PDF] An enhanced semantic focused web crawler based on hybrid string matching algorithm

KSS Prabha, C Mahesh, SP Raja - Cybernetics and Information …, 2021 - sciendo.com
Topic precise crawler is a special purpose web crawler, which downloads appropriate web
pages analogous to a particular topic by measuring cosine similarity or semantic similarity …

BOCA: A novel semantic blockchain-based authentication system of educational certificates

MD Nguyen, CH Nguyen-Dinh… - International Journal of …, 2022 - Taylor & Francis
In industrial and academic working environments, illegitimate academic certificates have
been prepared by individuals who aim to be employed without completing regulated training …

Weakly supervised learning for an effective focused web crawler

PRJ Dhanith, K Saeed, G Rohith, SP Raja - Engineering Applications of …, 2024 - Elsevier
Focused crawler traverses the Web to only collect pages that are relevant to a particular
topic, and is increasingly considered as a way to get around the scalability issues with …

Sentiment lexicon for cross-domain adaptation with multi-domain dataset in Indian languages enhanced with BERT classification model

K Suresh Kumar, C Helen Sulochana… - Journal of Intelligent …, 2022 - content.iospress.com
Many websites are attempting to offer a platform for users or customers to leave their reviews
and comments about the products or services in their native languages. The cross-domain …

Amelioration of linguistic semantic classifier with sentiment classifier manacle for the focused web crawler

KSS Prabha, C Mahesh, S Goundar… - International Journal of …, 2023 - Springer
Sentiment relevant information in the web pages concerning products, establishment, and
commodities concentrates principally on the available textual contents. Research on …

An Optimal Topic Centric Crawler for Acquiring Bio-medical Themes Utilizing Gaussian Support Vector Regression

S Rajiv, C Navaneethan - SN Computer Science, 2023 - Springer
Focused crawler (FC) is a web crawler that downloads only relevant web pages for a given
topic. The main source of biomedical information is now the Internet. The volume, pace …

An efficient content extraction method for webpage based on tag-line-block analysis

Z Chen, J Zhou, R Sun - Soft Computing, 2023 - Springer
Abstract World Wide Web is a vast information resource that can be used in a broad range of
applications. Web content is an efficient way to derive valuable information from webpages …

An enhanced focused web crawler for biomedical topics using attention enhanced Siamese long short term memory networks

JDPNR Mary, S Balasubramanian… - Brazilian Archives of …, 2021 - SciELO Brasil
The Internet is chosen to be one among the primary source of biomedical information. To
retrieve necessary biomedical information, the search engine needs an efficient, focused …

[HTML][HTML] A Critique Empirical Evaluation of Relevance Computation for Focused Web Crawlers

JDPNR Mary, S Balasubramanian… - Brazilian Archives of …, 2022 - SciELO Brasil
HIGHLIGHTS This paper presents a survey on focused web crawlers. This paper presents
the challenges in focused crawling research. This paper presents the highlights and …

Weakly supervised learning for an effective focused web crawler

PR Joe Dhanith, K Saeed, G Rohith, SP Raja - 2024 - dl.acm.org
Focused crawler traverses the Web to only collect pages that are relevant to a particular
topic, and is increasingly considered as a way to get around the scalability issues with …