The k-means Algorithm: A Comprehensive Survey and Performance Evaluation

M Ahmed, R Seraj, SMS Islam - Electronics, 2020 - mdpi.com
The k-means clustering algorithm is considered one of the most powerful and popular data
mining algorithms in the research community. However, despite its popularity, the algorithm …

A comprehensive survey of anomaly detection techniques for high dimensional big data

S Thudumu, P Branch, J Jin, J Singh - Journal of Big Data, 2020 - Springer
Anomaly detection in high dimensional data is becoming a fundamental research problem
that has various applications in the real world. However, many existing anomaly detection …

Clustering algorithms: A comparative approach

MZ Rodriguez, CH Comin, D Casanova, OM Bruno… - PloS one, 2019 - journals.plos.org
Many real-world systems can be studied in terms of pattern recognition tasks, so that proper
use (and understanding) of machine learning methods in practical applications becomes …

Survey of state-of-the-art mixed data clustering algorithms

A Ahmad, SS Khan - Ieee Access, 2019 - ieeexplore.ieee.org
Mixed data comprises both numeric and categorical features, and mixed datasets occur
frequently in many domains, such as health, finance, and marketing. Clustering is often …

Development of prediction models for next-day building energy consumption and peak power demand using data mining techniques

C Fan, F Xiao, S Wang - Applied Energy, 2014 - Elsevier
This paper presents a data mining (DM) based approach to developing ensemble models
for predicting next-day energy consumption and peak power demand, with the aim of …

A survey on feature selection approaches for clustering

E Hancer, B Xue, M Zhang - Artificial Intelligence Review, 2020 - Springer
The massive growth of data in recent years has led challenges in data mining and machine
learning tasks. One of the major challenges is the selection of relevant features from the …

[PDF][PDF] 聚类分析研究中的若干问题

王骏, 王士同, 邓赵红 - 控制与决策, 2012 - researchgate.net
聚类分析研究中的若干问题 Page 1 第27 卷第3 期 Vol. 27 No. 3 控制与决策 Control and
Decision 2012 年3 月 Mar. 2012 聚类分析研究中的若干问题 文章编号: 1001-0920 (2012) 03-0321-08 …

Feature selection for clustering: A review

S Alelyani, J Tang, H Liu - Data Clustering, 2018 - taylorfrancis.com
Dimensionality reduction techniques can be categorized mainly into feature extraction and
feature selection. In the feature extraction approach, features are projected into a new space …

[图书][B] Data mining with Rattle and R: The art of excavating data for knowledge discovery

G Williams - 2011 - books.google.com
Data mining is the art and science of intelligent data analysis. By building knowledge from
information, data mining adds considerable value to the ever increasing stores of electronic …

Clustering high-dimensional data: A survey on subspace clustering, pattern-based clustering, and correlation clustering

HP Kriegel, P Kröger, A Zimek - … on knowledge discovery from data (tkdd …, 2009 - dl.acm.org
As a prolific research area in data mining, subspace clustering and related problems
induced a vast quantity of proposed solutions. However, many publications compare a new …