The aim of this article is to provide an exploratory analysis of the landscape of web archiving activities in Europe. Our contribution, based on desk research, and complemented with data …
The field of web archiving is at a turning point. In the early years of web archiving, the single URL has been the dominant unit for preservation and access. Access tools such as the …
The OCLC Research Library Partnership Web Archiving Metadata Working Group was formed to recommend descriptive metadata best practices for archived web content that …
Abstract The Internet Archive's (IA) Wayback Machine is the largest and oldest public Web archive and has become a significant repository of our recent history and cultural heritage …
D Gomes, M Costa - … Journal of Humanities and Arts Computing, 2014 - euppublishing.com
The web is the primary means of communication in developed societies. It contains descriptions of recent events generated through distinct perspectives. Thus, the web is a …
Although user access patterns on the live web are well-understood, there has been no corresponding study of how users, both humans and robots, access web archives. Based on …
To identify robots and humans and analyze their respective access patterns, we used the Internet Archive's (IA) Wayback Machine access logs from 2012 and 2019, as well as …
M Costa, F Couto, M Silva - Proceedings of the 37th international ACM …, 2014 - dl.acm.org
Web archives already hold together more than 534 billion files and this number continues to grow as new initiatives arise. Searching on all versions of these files acquired throughout …
Web archives already hold more than 282 billion documents and users demand full-text search to explore this historical information. This survey provides an overview of web …