Optimal feature subset selection in high dimensional data clustering

KC Sharmili… - International Journal of …, 2016 - inderscienceonline.com
Feature subset selection is the process of identifying and removing many irrelevant and
redundant features. Initially, the input micro array dataset is selected from the medical …

Clustering for high dimensional categorical data based on text similarity

GS Narayana, D Vasumathi - … of the 2nd international conference on …, 2016 - dl.acm.org
It is a well-known fact that a variety of cluster analysis techniques exist to group objects
which have characteristics related to one another. But the fact of the matter is the …

Differential feature recognition of breast cancer patients based on minimum spanning tree clustering and f-statistics

J Xie, Y Li, Y Zhou, M Wang - … , HIS 2016, Shanghai, China, November 5-7 …, 2016 - Springer
The differential feature recognition algorithm of breast cancer patients is presented in this
paper based on minimum spanning tree (MST) and F-statistics. The algorithm uses the …

A Clustering based Feature Selection Approach using Maximum Spanning Tree

MH Tarek, S Akhter, S Ahmed, MS Islam… - … University Journal of …, 2022 - banglajol.info
Mutual information (MI) based feature selection methods are getting popular as its ability to
capture the nonlinear and linear relationship among random variables and thus it performs …

An Effective Machine Learning Approach for Clustering Categorical Data with High Dimensions

S Umar, TD Deressa, TB Yadesa, GB Beshan… - … Conference on Artificial …, 2021 - Springer
Many modern real world databases include redundant quantities of categorical data that
contribute in data processing and efficient decision-making with their advances in database …

Effective Feature Subset Identification Using Adaptive Bee Colony Algorithm

K Suja, R Ab - Intelligent Systems and Computer Technology, 2020 - ebooks.iospress.nl
The irrelevant features, along with redundant features, severely affect the accuracy of the
learning machines. Feature subset selection as the process of identifying and removing …

Probabilistic Approach for Evaluating Metabolite Sample Integrity

BM Slaff, ST Jensen, AM Weljie - arXiv preprint arXiv:1506.04443, 2015 - arxiv.org
The success of metabolomics studies depends upon the" fitness" of each biological sample
used for analysis: it is critical that metabolite levels reported for a biological sample …

[PDF][PDF] Extraction of Best Attribute Subset using Kruskal's Algorithm

SR Yadav, RP Patki - International Journal of Computer Applications, 2015 - Citeseer
Data mining is the technique by which one can extract efficient and effective data from huge
amount of raw data. There are various techniques for extracting useful data. Attribute …