Workload characterization: A survey revisited

MC Calzarossa, L Massari, D Tessera - ACM Computing Surveys (CSUR …, 2016 - dl.acm.org
Workload characterization is a well-established discipline that plays a key role in many
performance engineering studies. The large-scale social behavior inherent in the …

A survey of web metrics

D Dhyani, WK Ng, SS Bhowmick - ACM Computing Surveys (CSUR), 2002 - dl.acm.org
The unabated growth and increasing significance of the World Wide Web has resulted in a
flurry of research activity to improve its capacity for serving information more effectively. But …

[图书][B] Modern information retrieval

R Baeza-Yates, B Ribeiro-Neto - 1999 - people.ischool.berkeley.edu
Information retrieval (IR) has changed considerably in recent years with the expansion of the
World Wide Web and the advent of modern and inexpensive graphical user interfaces and …

[图书][B] Web data mining: exploring hyperlinks, contents, and usage data

B Liu - 2011 - Springer
Liu has written a comprehensive text on Web mining, which consists of two parts. The first
part covers the data mining and machine learning foundations, where all the essential …

Data-Centric Systems and Applications

MJ Carey, S Ceri, P Bernstein, U Dayal, C Faloutsos… - Italy: Springer, 2006 - Springer
The rapid growth of the Web in the past two decades has made it the largest publicly
accessible data source in the world. Web mining aims to discover useful information or …

A large-scale study of the evolution of web pages

D Fetterly, M Manasse, M Najork, J Wiener - Proceedings of the 12th …, 2003 - dl.acm.org
How fast does the web change? Does most of the content remain unchanged once it has
been authored, or are the documents continuously updated? Do pages change a little or a …

What's new on the Web? The evolution of the Web from a search engine perspective

A Ntoulas, J Cho, C Olston - … of the 13th international conference on …, 2004 - dl.acm.org
We seek to gain improved insight into how Web search engines shouldcope with the
evolving Web, in an attempt to provide users with themost up-to-date results possible. For …

A framework for web science

T Berners-Lee, W Hall, JA Hendler… - … and Trends® in Web …, 2006 - nowpublishers.com
This text sets out a series of approaches to the analysis and synthesis of the World Wide
Web, and other web-like information structures. A comprehensive set of research questions …

Meta algorithms for hierarchical web caches

N Laoutaris, S Syntila… - … and Communications, 2004, 2004 - ieeexplore.ieee.org
Large scale hierarchical caches for Web content have been deployed widely in an attempt to
reduce delivery delays and bandwidth consumption and also to improve the scalability of …

Estimating frequency of change

J Cho, H Garcia-Molina - ACM Transactions on Internet Technology …, 2003 - dl.acm.org
Many online data sources are updated autonomously and independently. In this article, we
make the case for estimating the change frequency of data to improve Web crawlers, Web …