Are cluster validity measures (in) valid?

M Gagolewski, M Bartoszuk, A Cena - Information Sciences, 2021 - Elsevier
Internal cluster validity measures (such as the Calinski–Harabasz, Dunn, or Davies–Bouldin
indices) are frequently used for selecting the appropriate number of partitions a dataset …

An efficient entropy based dissimilarity measure to cluster categorical data

AK Kar, AC Mishra, SK Mohanty - Engineering Applications of Artificial …, 2023 - Elsevier
Clustering is an unsupervised learning technique that discovers intrinsic groups based on
proximity between data points. Therefore, the performance of clustering techniques mainly …

An entropy-based weighted dissimilarity metric for numerical data clustering using the distribution of intra feature differences

AA Khan, AC Mishra, SK Mohanty - Knowledge-Based Systems, 2023 - Elsevier
Suitable selection of a proximity measure is one of the fundamental requirements of
clustering. With conventional (dis) similarity measures, many clustering algorithms do not …

Rough set theory applied to finite dimensional vector spaces

A Fatima, I Javaid - Information Sciences, 2024 - Elsevier
In this paper, we study finite dimensional vector spaces using rough set theory (RST) by
defining a Boolean information system IB associated with a vector space V for a given basis …

EDMD: An Entropy based Dissimilarity measure to cluster Mixed-categorical Data

AK Kar, MM Akhter, AC Mishra, SK Mohanty - Pattern Recognition, 2024 - Elsevier
The effectiveness of clustering techniques is significantly influenced by proximity measures
irrespective of type of data and categorical data is no exception. Most of the existing …

Cohesive clustering algorithm based on high-dimensional generalized Fermat points

T Li, X Wang, H Zhong - Information Sciences, 2022 - Elsevier
Utilizing high-dimensional generalized Fermat points (F d-points) as cluster centers, we
propose a new method F d-points Linkage (FL) for calculating intra-cluster and inter-cluster …

EDMIX: an entropy-based dissimilarity measure to cluster mixed data comprising of numerical–nominal–ordinal attributes

AK Kar, AC Mishra, SK Mohanty - Knowledge and Information Systems, 2025 - Springer
Existing (dis) similarity measures for mixed data often ignore data distribution and ordering
information along numerical and categorical features, respectively, during distance …