Data discretization: taxonomy and big data challenge

S Ramírez‐Gallego, S García… - … : Data Mining and …, 2016 - Wiley Online Library
Discretization of numerical data is one of the most influential data preprocessing tasks in
knowledge discovery and data mining. The purpose of attribute discretization is to find …

[HTML][HTML] Network traffic classification for data fusion: A survey

J Zhao, X Jing, Z Yan, W Pedrycz - Information Fusion, 2021 - Elsevier
Traffic classification groups similar or related traffic data, which is one main stream
technique of data fusion in the field of network management and security. With the rapid …

Tutorial on practical tips of the most influential data preprocessing algorithms in data mining

S García, J Luengo, F Herrera - Knowledge-Based Systems, 2016 - Elsevier
Data preprocessing is a major and essential stage whose main goal is to obtain final data
sets that can be considered correct and useful for further data mining algorithms. This paper …

Gene selection for microarray data classification based on Gray Wolf Optimizer enhanced with TRIZ-inspired operators

OA Alomari, SN Makhadmeh, MA Al-Betar… - Knowledge-Based …, 2021 - Elsevier
DNA microarray technology is the fabrication of a single chip to contain a thousand genetic
codes. Each microarray experiment can analyze many thousands of genes in parallel. The …

A survey of discretization techniques: Taxonomy and empirical analysis in supervised learning

S Garcia, J Luengo, JA Sáez, V Lopez… - IEEE transactions on …, 2012 - ieeexplore.ieee.org
Discretization is an essential preprocessing technique used in many knowledge discovery
and data mining tasks. Its main goal is to transform a set of continuous attributes into discrete …

A fitting model for feature selection with fuzzy rough sets

C Wang, Y Qi, M Shao, Q Hu, D Chen… - IEEE Transactions on …, 2016 - ieeexplore.ieee.org
A fuzzy rough set is an important rough set model used for feature selection. It uses the fuzzy
rough dependency as a criterion for feature selection. However, this model can merely …

Topological approaches to covering rough sets

W Zhu - Information sciences, 2007 - Elsevier
Rough sets, a tool for data mining, deal with the vagueness and granularity in information
systems. This paper studies covering-based rough sets from the topological view. We …

On three types of covering-based rough sets

W Zhu, FY Wang - IEEE transactions on knowledge and data …, 2007 - ieeexplore.ieee.org
Rough set theory is a useful tool for data mining. It is based on equivalence relations and
has been extended to covering-based generalized rough set. This paper studies three kinds …

Generalized rough sets based on relations

W Zhu - Information Sciences, 2007 - Elsevier
Rough set theory has been proposed by Pawlak as a tool for dealing with the vagueness
and granularity in information systems. The core concepts of classical rough sets are lower …

An efficient feature generation approach based on deep learning and feature selection techniques for traffic classification

H Shi, H Li, D Zhang, C Cheng, X Cao - Computer Networks, 2018 - Elsevier
Substantial recent efforts have been made on the application of Machine Learning (ML)
techniques to flow statistical features for traffic classification. However, the classification …