Clustering by pattern similarity in large data sets

H Wang, W Wang, J Yang, PS Yu - Proceedings of the 2002 ACM …, 2002 - dl.acm.org
Clustering is the process of grouping a set of objects into classes of similar objects. Although
definitions of similarity vary from one clustering model to another, in most of these models …

A systematic comparative evaluation of biclustering techniques

VA Padilha, RJGB Campello - BMC bioinformatics, 2017 - Springer
Background Biclustering techniques are capable of simultaneously clustering rows and
columns of a data matrix. These techniques became very popular for the analysis of gene …

Enhanced soft subspace clustering integrating within-cluster and between-cluster information

Z Deng, KS Choi, FL Chung, S Wang - Pattern recognition, 2010 - Elsevier
While within-cluster information is commonly utilized in most soft subspace clustering
approaches in order to develop the algorithms, other important information such as between …

Biclustering in data mining

S Busygin, O Prokopyev, PM Pardalos - Computers & Operations Research, 2008 - Elsevier
Biclustering consists in simultaneous partitioning of the set of samples and the set of their
attributes (features) into subsets (classes). Samples and features classified together are …

Locally adaptive metrics for clustering high dimensional data

C Domeniconi, D Gunopulos, S Ma, B Yan… - Data Mining and …, 2007 - Springer
Clustering suffers from the curse of dimensionality, and similarity functions that use all input
features with equal relevance may not be effective. We introduce an algorithm that discovers …

Clustering approaches for high‐dimensional databases: A review

M Mittal, LM Goyal, DJ Hemanth… - … Reviews: Data Mining …, 2019 - Wiley Online Library
Data mining is an inevitable task in most of the emerging computing technologies as it
debilitates the complexity of datasets by rendering a better insight. Moreover, it entails the …

Tricluster: an effective algorithm for mining coherent clusters in 3d microarray data

L Zhao, MJ Zaki - Proceedings of the 2005 ACM SIGMOD international …, 2005 - dl.acm.org
In this paper we introduce a novel algorithm called TRICLUSTER, for mining coherent
clusters in three-dimensional (3D) gene expression datasets. TRICLUSTER can mine …

Scalability and sparsity issues in recommender datasets: a survey

M Singh - Knowledge and Information Systems, 2020 - Springer
Recommender systems have been widely used in various domains including movies, news,
music with an aim to provide the most relevant proposals to users from a variety of available …

[图书][B] Protein interaction networks: computational analysis

A Zhang - 2009 - books.google.com
The analysis of protein-protein interactions is fundamental to the understanding of cellular
organization, processes, and functions. Recent large-scale investigations of protein-protein …

A survey on enhanced subspace clustering

K Sim, V Gopalkrishnan, A Zimek, G Cong - Data mining and knowledge …, 2013 - Springer
Subspace clustering finds sets of objects that are homogeneous in subspaces of high-
dimensional datasets, and has been successfully applied in many domains. In recent years …