A comprehensive survey of clustering algorithms: State-of-the-art machine learning applications, taxonomy, challenges, and future research prospects

AE Ezugwu, AM Ikotun, OO Oyelade… - … Applications of Artificial …, 2022 - Elsevier
Clustering is an essential tool in data mining research and applications. It is the subject of
active research in many fields of study, such as computer science, data science, statistics …

Biclustering algorithms for biological data analysis: a survey

SC Madeira, AL Oliveira - IEEE/ACM transactions on …, 2004 - ieeexplore.ieee.org
A large number of clustering approaches have been proposed for the analysis of gene
expression data obtained from microarray experiments. However, the results from the …

[图书][B] Data clustering: theory, algorithms, and applications

G Gan, C Ma, J Wu - 2020 - SIAM
The monograph Data Clustering: Theory, Algorithms, and Applications was published in
2007. Starting with the common ground and knowledge for data clustering, the monograph …

Clustering high-dimensional data: A survey on subspace clustering, pattern-based clustering, and correlation clustering

HP Kriegel, P Kröger, A Zimek - … on knowledge discovery from data (tkdd …, 2009 - dl.acm.org
As a prolific research area in data mining, subspace clustering and related problems
induced a vast quantity of proposed solutions. However, many publications compare a new …

Subspace clustering for high dimensional data: a review

L Parsons, E Haque, H Liu - Acm sigkdd explorations newsletter, 2004 - dl.acm.org
Subspace clustering is an extension of traditional clustering that seeks to find clusters in
different subspaces within a dataset. Often in high dimensional data, many dimensions are …

Cluster analysis for gene expression data: a survey

D Jiang, C Tang, A Zhang - IEEE Transactions on knowledge …, 2004 - ieeexplore.ieee.org
DNA microarray technology has now made it possible to simultaneously monitor the
expression levels of thousands of genes during important biological processes and across …

A systematic comparison and evaluation of biclustering methods for gene expression data

A Prelić, S Bleuler, P Zimmermann, A Wille… - …, 2006 - academic.oup.com
Motivation: In recent years, there have been various efforts to overcome the limitations of
standard clustering approaches for the analysis of gene expression data by grouping genes …

An entropy weighting k-means algorithm for subspace clustering of high-dimensional sparse data

L Jing, MK Ng, JZ Huang - IEEE Transactions on knowledge …, 2007 - ieeexplore.ieee.org
This paper presents a new k-means type algorithm for clustering high-dimensional objects in
sub-spaces. In high-dimensional data, clusters of objects often exist in subspaces rather …

[图书][B] Network anomaly detection: A machine learning perspective

DK Bhattacharyya, JK Kalita - 2013 - books.google.com
With the rapid rise in the ubiquity and sophistication of Internet technology and the
accompanying growth in the number of network attacks, network intrusion detection has …

The Inferelator: an algorithm for learning parsimonious regulatory networks from systems-biology data sets de novo

R Bonneau, DJ Reiss, P Shannon, M Facciotti, L Hood… - Genome biology, 2006 - Springer
We present a method (the Inferelator) for deriving genome-wide transcriptional regulatory
interactions, and apply the method to predict a large portion of the regulatory network of the …