S Chaudhuri, V Ganti, R Kaushik - … International Conference on …, 2006 - ieeexplore.ieee.org
Data cleaning based on similarities involves identification of" close" tuples, where closeness is evaluated using a variety of similarity functions chosen to suit the domain and application …
A Datta - US Patent 6,622,168, 2003 - Google Patents
6,026,413 A 2/2000 Challenger et al.... 707/202 The preloader uses a cache replacement manager to manage 6,055,572 A 4/2000 Saksena............... 709/224 the replacement of …
R Impagliazzo - Proceedings of IEEE 36th Annual Foundations …, 1995 - ieeexplore.ieee.org
Consider a decision problem that cannot be 1-/spl delta/approximated by circuits of a given size in the sense that any such circuit fails to give the correct answer on at least a/spl …
In many applications from telephone fraud detection to network management, data arrives in a stream, and there is a need to maintain a variety of statistical summary information about a …
Database technology is one of the cornerstones for the new millennium's IT landscape. However, database systems as a unit of code packaging and deployment are at a crossroad …
In this paper we survey recent work on incremental data mining model maintenance and change detection under block evolution. In block evolution, a dataset is updated periodically …
MJ O'Connor, AK Das - International joint conference on biomedical …, 2010 - Springer
Ontologies are becoming a core technology for supporting the sharing, integration, and management of information sources in Semantic Web applications. As critical as ontologies …
L Cabibbo, R Torlone - International Workshop on Database …, 1997 - Springer
Multidimensional databases are large collections of data, often historical, used for sophisticated analysis oriented to decision making. This activity is supported by an emerging …
There is a growing trend of performing analysis on large datasets using workflows composed of MapReduce jobs connected through producer-consumer relationships based …