[HTML][HTML] A comprehensive survey of anomaly detection techniques for high dimensional big data

S Thudumu, P Branch, J Jin, J Singh - Journal of Big Data, 2020 - Springer
Anomaly detection in high dimensional data is becoming a fundamental research problem
that has various applications in the real world. However, many existing anomaly detection …

[HTML][HTML] Survey on exact kNN queries over high-dimensional data space

N Ukey, Z Yang, B Li, G Zhang, Y Hu, W Zhang - Sensors, 2023 - mdpi.com
k nearest neighbours (kNN) queries are fundamental in many applications, ranging from
data mining, recommendation system and Internet of Things, to Industry 4.0 framework …

[图书][B] Data classification

CC Aggarwal, CC Aggarwal - 2015 - Springer
The classification problem is closely related to the clustering problem discussed in Chaps. 6
and 7. While the clustering problem is that of determining similar groups of data points, the …

[图书][B] An introduction to outlier analysis

CC Aggarwal, CC Aggarwal - 2017 - Springer
Outliers are also referred to as abnormalities, discordants, deviants, or anomalies in the data
mining and statistics literature. In most applications, the data is created by one or more …

A survey of text clustering algorithms

CC Aggarwal, CX Zhai - Mining text data, 2012 - Springer
Clustering is a widely studied data mining problem in the text domains. The problem finds
numerous applications in customer segmentation, classification, collaborative filtering …

Clustering high-dimensional data: A survey on subspace clustering, pattern-based clustering, and correlation clustering

HP Kriegel, P Kröger, A Zimek - … on knowledge discovery from data (tkdd …, 2009 - dl.acm.org
As a prolific research area in data mining, subspace clustering and related problems
induced a vast quantity of proposed solutions. However, many publications compare a new …

Citypulse: Large scale data analytics framework for smart cities

D Puiu, P Barnaghi, R Tönjes, D Kümper, MI Ali… - IEEE …, 2016 - ieeexplore.ieee.org
Our world and our lives are changing in many ways. Communication, networking, and
computing technologies are among the most influential enablers that shape our lives today …

Discovering similar multidimensional trajectories

M Vlachos, G Kollios… - … international conference on …, 2002 - ieeexplore.ieee.org
We investigate techniques for analysis and retrieval of object trajectories in two or three
dimensional space. Such data usually contain a large amount of noise, that has made …

Outlier detection for high dimensional data

CC Aggarwal, PS Yu - Proceedings of the 2001 ACM SIGMOD …, 2001 - dl.acm.org
The outlier detection problem has important applications in the field of fraud detection,
network robustness analysis, and intrusion detection. Most such applications are high …

Locally adaptive dimensionality reduction for indexing large time series databases

E Keogh, K Chakrabarti, M Pazzani… - Proceedings of the 2001 …, 2001 - dl.acm.org
Similarity search in large time series databases has attracted much research interest
recently. It is a difficult problem because of the typically high dimensionality of the data.. The …