An overview of data warehousing and OLAP technology

S Chaudhuri, U Dayal - ACM Sigmod record, 1997 - dl.acm.org
Data warehousing and on-line analytical processing (OLAP) are essential elements of
decision support, which has increasingly become a focus of the database industry. Many …

A primitive operator for similarity joins in data cleaning

S Chaudhuri, V Ganti, R Kaushik - … International Conference on …, 2006 - ieeexplore.ieee.org
Data cleaning based on similarities involves identification of" close" tuples, where closeness
is evaluated using a variety of similarity functions chosen to suit the domain and application …

Dynamic page generation acceleration using component-level caching

A Datta - US Patent 6,622,168, 2003 - Google Patents
6,026,413 A 2/2000 Challenger et al.... 707/202 The preloader uses a cache replacement
manager to manage 6,055,572 A 4/2000 Saksena............... 709/224 the replacement of …

Hard-core distributions for somewhat hard problems

R Impagliazzo - Proceedings of IEEE 36th Annual Foundations …, 1995 - ieeexplore.ieee.org
Consider a decision problem that cannot be 1-/spl delta/approximated by circuits of a given
size in the sense that any such circuit fails to give the correct answer on at least a/spl …

On computing correlated aggregates over continual data streams

J Gehrke, F Korn, D Srivastava - ACM SIGMOD Record, 2001 - dl.acm.org
In many applications from telephone fraud detection to network management, data arrives in
a stream, and there is a need to maintain a variety of statistical summary information about a …

[PDF][PDF] Rethinking Database System Architecture: Towards a Self-Tuning RISC-Style Database System.

S Chaudhuri, G Weikum - VLDB, 2000 - vldb.org
Database technology is one of the cornerstones for the new millennium's IT landscape.
However, database systems as a unit of code packaging and deployment are at a crossroad …

Mining data streams under block evolution

V Ganti, J Gehrke, R Ramakrishnan - Acm Sigkdd Explorations …, 2002 - dl.acm.org
In this paper we survey recent work on incremental data mining model maintenance and
change detection under block evolution. In block evolution, a dataset is updated periodically …

A method for representing and querying temporal information in OWL

MJ O'Connor, AK Das - International joint conference on biomedical …, 2010 - Springer
Ontologies are becoming a core technology for supporting the sharing, integration, and
management of information sources in Semantic Web applications. As critical as ontologies …

Querying multidimensional databases

L Cabibbo, R Torlone - International Workshop on Database …, 1997 - Springer
Multidimensional databases are large collections of data, often historical, used for
sophisticated analysis oriented to decision making. This activity is supported by an emerging …

Stubby: A transformation-based optimizer for mapreduce workflows

H Lim, H Herodotou, S Babu - arXiv preprint arXiv:1208.0082, 2012 - arxiv.org
There is a growing trend of performing analysis on large datasets using workflows
composed of MapReduce jobs connected through producer-consumer relationships based …