Answering queries using views: A survey

AY Halevy - The VLDB Journal, 2001 - Springer
The problem of answering queries using views is to find efficient methods of answering a
query using a set of previously defined materialized views over the database, rather than …

Data fusion

J Bleiholder, F Naumann - ACM computing surveys (CSUR), 2009 - dl.acm.org
The development of the Internet in recent years has made it possible and useful to access
many different information systems anywhere in the world to obtain information. While there …

[图书][B] Principles of distributed database systems

MT Özsu, P Valduriez - 1999 - Springer
The first edition of this book appeared in 1991 when the technology was new and there were
not too many products. In the Preface to the first edition, we had quoted Michael Stonebraker …

Data integration: A theoretical perspective

M Lenzerini - Proceedings of the twenty-first ACM SIGMOD-SIGACT …, 2002 - dl.acm.org
Data integration is the problem of combining data residing at different sources, and
providing the user with a unified view of these data. The problem of designing data …

The state of the art in distributed query processing

D Kossmann - ACM Computing Surveys (CSUR), 2000 - dl.acm.org
Distributed data processing is becoming a reality. Businesses want to do it for many
reasons, and they often must do it in order to stay competitive. While much of the …

[PDF][PDF] Data integration: The teenage years

A Halevy, A Rajaraman, J Ordille - … conference on Very large data bases, 2006 - cin.ufpe.br
Data integration is a pervasive challenge faced in applications that need to query across
multiple autonomous and heterogeneous data sources. Data integration is crucial in large …

Extracting structured data from web pages

A Arasu, H Garcia-Molina - Proceedings of the 2003 ACM SIGMOD …, 2003 - dl.acm.org
Many web sites contain large sets of pages generated using a common template or layout.
For example, Amazon lays out the author, title, comments, etc. in the same way in all its book …

Querying semi-structured data

S Abiteboul - Database Theory—ICDT'97: 6th International …, 1997 - Springer
The amount of data of all kinds available electronically has increased dramatically in recent
years. The data resides in different forms, ranging from unstructured data in file systems to …

Large-scale live active learning: Training object detectors with crawled data and crowds

S Vijayanarasimhan, K Grauman - International journal of computer vision, 2014 - Springer
Active learning and crowdsourcing are promising ways to efficiently build up training sets for
object recognition, but thus far techniques are tested in artificially controlled settings …

Reconciling schemas of disparate data sources: A machine-learning approach

AH Doan, P Domingos, AY Halevy - Proceedings of the 2001 ACM …, 2001 - dl.acm.org
A data-integration system provides access to a multitude of data sources through a single
mediated schema. A key bottleneck in building such systems has been the laborious manual …