The evolution of web archiving

M Costa, D Gomes, MJ Silva - International Journal on Digital Libraries, 2017 - Springer
Web archives preserve information published on the web or digitized from printed
publications. Much of this information is unique and historically valuable. However, the lack …

Web archives as a data resource for digital scholars

E Vlassenroot, S Chambers, E Di Pretoro… - International Journal of …, 2019 - Springer
The aim of this article is to provide an exploratory analysis of the landscape of web archiving
activities in Europe. Our contribution, based on desk research, and complemented with data …

Web archive search as research: Methodological and theoretical implications

A Ben-David, H Huurdeman - Alexandria, 2014 - journals.sagepub.com
The field of web archiving is at a turning point. In the early years of web archiving, the single
URL has been the dominant unit for preservation and access. Access tools such as the …

Descriptive metadata for web archiving: literature review of user needs

J Venlet, KS Farrell, T Kim, AJ O'Dell, J Dooley - 2018 - apo.org.au
The OCLC Research Library Partnership Web Archiving Metadata Working Group was
formed to recommend descriptive metadata best practices for archived web content that …

Who and what links to the Internet Archive

Y AlNoamany, A AlSum, MC Weigle… - International Journal on …, 2014 - Springer
Abstract The Internet Archive's (IA) Wayback Machine is the largest and oldest public Web
archive and has become a significant repository of our recent history and cultural heritage …

The importance of web archives for humanities

D Gomes, M Costa - … Journal of Humanities and Arts Computing, 2014 - euppublishing.com
The web is the primary means of communication in developed societies. It contains
descriptions of recent events generated through distinct perspectives. Thus, the web is a …

Access patterns for robots and humans in web archives

YA AlNoamany, MC Weigle, ML Nelson - … of the 13th ACM/IEEE-CS joint …, 2013 - dl.acm.org
Although user access patterns on the live web are well-understood, there has been no
corresponding study of how users, both humans and robots, access web archives. Based on …

Robots still outnumber humans in web archives, but less than before

HR Jayanetti, K Garg, S Alam, ML Nelson… - … Conference on Theory …, 2022 - Springer
To identify robots and humans and analyze their respective access patterns, we used the
Internet Archive's (IA) Wayback Machine access logs from 2012 and 2019, as well as …

Learning temporal-dependent ranking models

M Costa, F Couto, M Silva - Proceedings of the 37th international ACM …, 2014 - dl.acm.org
Web archives already hold together more than 534 billion files and this number continues to
grow as new initiatives arise. Searching on all versions of these files acquired throughout …

A survey of web archive search architectures

M Costa, D Gomes, F Couto, M Silva - Proceedings of the 22nd …, 2013 - dl.acm.org
Web archives already hold more than 282 billion documents and users demand full-text
search to explore this historical information. This survey provides an overview of web …