YAKE! Keyword extraction from single documents using multiple local features

R Campos, V Mangaravite, A Pasquali, A Jorge… - Information …, 2020 - Elsevier
As the amount of generated information grows, reading and summarizing texts of large
collections turns into a challenging task. Many documents do not come with descriptive …

Warcbase: Scalable analytics infrastructure for exploring web archives

J Lin, I Milligan, J Wiebe, A Zhou - Journal on Computing and Cultural …, 2017 - dl.acm.org
Web archiving initiatives around the world capture ephemeral Web content to preserve our
collective digital memory. However, unlocking the potential of Web archives for humanities …

Desiderata for exploratory search interfaces to web archives in support of scholarly activities

A Jackson, J Lin, I Milligan, N Ruest - Proceedings of the 16th ACM/IEEE …, 2016 - dl.acm.org
Web archiving initiatives around the world capture ephemeral web content to preserve our
collective digital memory. In this paper, we describe initial experiences in providing an …

Automatic generation of timelines for past-web events

R Campos, A Pasquali, A Jatowt, V Mangaravite… - The Past Web: Exploring …, 2021 - Springer
Despite significant advances in web archive infrastructures, the problem of exploring the
historical heritage preserved by web archives is yet to be solved. Timeline generation …

Interactive system for automatically generating temporal narratives

A Pasquali, V Mangaravite, R Campos… - Advances in Information …, 2019 - Springer
In this demo, we present a tool that allows to automatically generate temporal summarization
of news collections. Conta-me Histórias (Tell me stories) is a friendly user interface that …

Robots still outnumber humans in web archives, but less than before

HR Jayanetti, K Garg, S Alam, ML Nelson… - … Conference on Theory …, 2022 - Springer
To identify robots and humans and analyze their respective access patterns, we used the
Internet Archive's (IA) Wayback Machine access logs from 2012 and 2019, as well as …

Infrastructure for supporting exploration and discovery in web archives

J Lin, M Gholami, J Rao - … of the 23rd international conference on World …, 2014 - dl.acm.org
Web archiving initiatives around the world capture ephemeral web content to preserve our
collective digital memory. However, unlocking the potential of web archives requires tools …

[HTML][HTML] The ARCOMEM architecture for social-and semantic-driven web archiving

T Risse, E Demidova, S Dietze, W Peters, N Papailiou… - Future Internet, 2014 - mdpi.com
The constantly growing amount of Web content and the success of the Social Web lead to
increasing needs for Web archiving. These needs go beyond the pure preservation of Web …

Estimating contemporary relevance of past news

M Sato, A Jatowt, Y Duan, R Campos… - 2021 ACM/IEEE Joint …, 2021 - ieeexplore.ieee.org
Our society generates massive amounts of digital data, significant portion of which is being
archived and made accessible to the public for the current and future use. In addition …

Robots still outnumber humans in web archives in 2019, but less than in 2015 and 2012

HR Jayanetti, K Garg, S Alam, ML Nelson… - International Journal on …, 2024 - Springer
The significance of the web and the crucial role of web archives in its preservation highlight
the necessity of understanding how users, both human and robot, access web archive …