主题网络爬虫研究综述

于娟, 刘强 - 计算机工程与科学, 2015 - joces.nudt.edu.cn
网络信息资源呈指数级增长, 面对用户越来越个性化的需求, 主题网络爬虫应运而生.
主题网络爬虫是一种下载特定主题网页的程序. 利用在采集页面过程获得的特定信息 …

Focused web crawler with revisit policy

S Mali, BB Meshram - Proceedings of the International Conference & …, 2011 - dl.acm.org
Focused crawlers aim to search only the subset of the web related to a specific topic, and
offer a potential solution to the problem. The major problem is how to retrieve the maximal …

iRecomendYou: A design proposal for the development of a pervasive recommendation system based on student's profile for Ecuador's students' candidature to a …

FM Pinto, M Estefania, N Cerón, R Andrade… - New Advances in …, 2016 - Springer
All recognized successful Ecuador's students have the opportunity to apply for a scholarship
abroad within a set of relevant world's universities listed on-line on SENESCYT's website …

FICA: A novel intelligent crawling algorithm based on reinforcement learning

AMZ Bidoki, N Yazdani… - Web Intelligence and …, 2009 - content.iospress.com
The web is a huge and highly dynamic environment which is growing exponentially in
content and developing fast in structure. No search engine can cover the whole web, thus it …

From Intelligent Crawling to Inclusive Fact-Checking: An End-to-End System

EN Sarr, L Faty, MD SARR… - … Computing in Data …, 2021 - ieeexplore.ieee.org
In this article, we present an aggregation platform of journalistic contents based on an
intelligent crawler of articles from Online Press, of an inclusive fact-checking approach and …

Web page importance ranking

W Gaul - Advances in Data Analysis and Classification, 2011 - Springer
An approach is proposed that uses a set of interesting Web pages as starting point for a
minimum walk algorithm to provide recommendations of additionally important Web …

[PDF][PDF] A bibliography of publications about the Google PageRank algorithm

NHF Beebe - Department of Mathematics, University of Utah, 2025 - netlib.sandia.gov
A Bibliography of Publications about the Google PageRank Algorithm Page 1 A Bibliography of
Publications about the Google PageRank Algorithm Nelson HF Beebe University of Utah …

ARAPONGA: uma ferramenta de apoio a recuperação de informação na web voltado a segurança de redes e sistemas

TG RODRIGUES - 2012 - bdtd.ibict.br
A área de segurança de redes de computadores e sistemas apresenta-se como uma das
maiores preocupações atualmente. À medida que o número de usuários de computadores …

Freshness tuning in focused crawler

S Mali, S Ninoriya, BB Meshram - … & Workshop on Emerging Trends in …, 2011 - dl.acm.org
The dynamic web keeps on changing and unnoticing an important event makes the result
incomplete. All of the web pages do not change. Even if some of them change, they do not …

Target oriented network intelligence collection: effective exploration of social networks

R Puzis, L Kachko, B Hagbi, R Stern, A Felner - World Wide Web, 2019 - Springer
Abstract Target Oriented Network Intelligence Collection (TONIC) is a crawling process
whose goal is to find social network profiles that contain information about a given target …