A comprehensive survey of anomaly detection techniques for high dimensional big data

S Thudumu, P Branch, J Jin, J Singh - Journal of Big Data, 2020 - Springer
Anomaly detection in high dimensional data is becoming a fundamental research problem
that has various applications in the real world. However, many existing anomaly detection …

A survey on unsupervised outlier detection in high‐dimensional numerical data

A Zimek, E Schubert, HP Kriegel - Statistical Analysis and Data …, 2012 - Wiley Online Library
High‐dimensional data in Euclidean space pose special challenges to data mining
algorithms. These challenges are often indiscriminately subsumed under the term 'curse of …

Robust subspace clustering

M Soltanolkotabi, E Elhamifar, EJ Candes - 2014 - projecteuclid.org
Robust subspace clustering Page 1 The Annals of Statistics 2014, Vol. 42, No. 2, 669–699
DOI: 10.1214/13-AOS1199 © Institute of Mathematical Statistics, 2014 ROBUST …

Minkowski metric, feature weighting and anomalous cluster initializing in K-Means clustering

RC De Amorim, B Mirkin - Pattern Recognition, 2012 - Elsevier
This paper represents another step in overcoming a drawback of K-Means, its lack of
defense against noisy features, using feature weights in the criterion. The Weighted K …

Using multidimensional clustering based collaborative filtering approach improving recommendation diversity

X Li, T Murata - 2012 IEEE/WIC/ACM International Conferences …, 2012 - ieeexplore.ieee.org
In this paper, we present a hybrid recommendation approach for discovering potential
preferences of individual users. The proposed approach provides a flexible solution that …

Scalable anomaly ranking of attributed neighborhoods

B Perozzi, L Akoglu - Proceedings of the 2016 SIAM International …, 2016 - SIAM
Given a graph with node attributes, what neighborhoods are anomalous? To answer this
question, one needs a quality score that utilizes both structure and attributes. Popular …

Clustering high dimensional data

I Assent - Wiley Interdisciplinary Reviews: Data Mining and …, 2012 - Wiley Online Library
High‐dimensional data, ie, data described by a large number of attributes, pose specific
challenges to clustering. The so‐called 'curse of dimensionality', coined originally to …

The role of hubness in clustering high-dimensional data

N Tomasev, M Radovanovic… - IEEE transactions on …, 2013 - ieeexplore.ieee.org
High-dimensional data arise naturally in many domains, and have regularly presented a
great challenge for traditional data mining techniques, both in terms of effectiveness and …

A survey on enhanced subspace clustering

K Sim, V Gopalkrishnan, A Zimek, G Cong - Data mining and knowledge …, 2013 - Springer
Subspace clustering finds sets of objects that are homogeneous in subspaces of high-
dimensional datasets, and has been successfully applied in many domains. In recent years …

Grale: Designing networks for graph learning

J Halcrow, A Mosoi, S Ruth, B Perozzi - Proceedings of the 26th ACM …, 2020 - dl.acm.org
How can we find the right graph for semi-supervised learning? In real world applications, the
choice of which edges to use for computation is the first step in any graph learning process …