A survey on unsupervised outlier detection in high‐dimensional numerical data

A Zimek, E Schubert, HP Kriegel - Statistical Analysis and Data …, 2012 - Wiley Online Library
High‐dimensional data in Euclidean space pose special challenges to data mining
algorithms. These challenges are often indiscriminately subsumed under the term 'curse of …

Density‐based clustering

HP Kriegel, P Kröger, J Sander… - … reviews: data mining and …, 2011 - Wiley Online Library
Clustering refers to the task of identifying groups or clusters in a data set. In density‐based
clustering, a cluster is a set of data objects spread in the data space over a contiguous …

[图书][B] Data cleaning

IF Ilyas, X Chu - 2019 - books.google.com
This is an overview of the end-to-end data cleaning process. Data quality is one of the most
important problems in data management, since dirty data often leads to inaccurate data …

Clustering high-dimensional data: A survey on subspace clustering, pattern-based clustering, and correlation clustering

HP Kriegel, P Kröger, A Zimek - … on knowledge discovery from data (tkdd …, 2009 - dl.acm.org
As a prolific research area in data mining, subspace clustering and related problems
induced a vast quantity of proposed solutions. However, many publications compare a new …

Density‐based clustering

RJGB Campello, P Kröger, J Sander… - … Reviews: Data Mining …, 2020 - Wiley Online Library
Clustering refers to the task of identifying groups or clusters in a data set. In density‐based
clustering, a cluster is a set of data objects spread in the data space over a contiguous …

Evaluating clustering in subspace projections of high dimensional data

E Müller, S Günnemann, I Assent, T Seidl - Proceedings of the VLDB …, 2009 - dl.acm.org
Clustering high dimensional data is an emerging research field. Subspace clustering or
projected clustering group similar objects in subspaces, ie projections, of the full space. In …

Clustering high dimensional data

I Assent - Wiley Interdisciplinary Reviews: Data Mining and …, 2012 - Wiley Online Library
High‐dimensional data, ie, data described by a large number of attributes, pose specific
challenges to clustering. The so‐called 'curse of dimensionality', coined originally to …

A survey on enhanced subspace clustering

K Sim, V Gopalkrishnan, A Zimek, G Cong - Data mining and knowledge …, 2013 - Springer
Subspace clustering finds sets of objects that are homogeneous in subspaces of high-
dimensional datasets, and has been successfully applied in many domains. In recent years …

Clustering validation measures

H Xiong, Z Li - Data clustering, 2018 - taylorfrancis.com
Clustering, one of the most important unsupervised learning problems, is the task of dividing
a set of objects into clusters such that objects within the same cluster are similar while …

Statistical selection of relevant subspace projections for outlier ranking

E Müller, M Schiffer, T Seidl - 2011 IEEE 27th international …, 2011 - ieeexplore.ieee.org
Outlier mining is an important data analysis task to distinguish exceptional outliers from
regular objects. For outlier mining in the full data space, there are well established methods …