ROSEFW-RF: the winner algorithm for the ECBDL’14 big data competition: an extremely imbalanced...

JL Leevy, TM Khoshgoftaar, RA Bauder, N Seliya - Journal of Big Data, 2018 - Springer

In a majority–minority classification problem, class imbalance in the dataset (s) can
dramatically skew the performance of classifiers, introducing a prediction bias for the …

被引用次数：698 相关文章所有 9 个版本

[HTML] springer.com

[HTML][HTML] Learning from imbalanced data: open challenges and future directions

B Krawczyk - Progress in artificial intelligence, 2016 - Springer

Despite more than two decades of continuous development learning from imbalanced data
is still a focus of intense research. Starting as a problem of skewed distributions of binary …

被引用次数：2322 相关文章所有 7 个版本

A practical tutorial on bagging and boosting based ensembles for machine learning: Algorithms, software tools, performance study, practical perspectives and …

S González, S García, J Del Ser, L Rokach, F Herrera - Information Fusion, 2020 - Elsevier

Ensembles, especially ensembles of decision trees, are one of the most popular and
successful techniques in machine learning. Recently, the number of ensemble-based …

被引用次数：336 相关文章所有 2 个版本

[HTML] springer.com Full View

[HTML][HTML] Big data preprocessing: methods and prospects

S García, S Ramírez-Gallego, J Luengo, JM Benítez… - Big data analytics, 2016 - Springer

The massive growth in the scale of data has been observed in recent years being a key
factor of the Big Data scenario. Big Data can be defined as high volume, velocity and variety …

被引用次数：684 相关文章所有 11 个版本

Machine learning meets omics: applications and perspectives

R Li, L Li, Y Xu, J Yang - Briefings in Bioinformatics, 2022 - academic.oup.com

The innovation of biotechnologies has allowed the accumulation of omics data at an
alarming rate, thus introducing the era of 'big data'. Extracting inherent valuable knowledge …

被引用次数：90 相关文章所有 5 个版本

[PDF] arxiv.org

Imbalanced deep learning by minority class incremental rectification

Q Dong, S Gong, X Zhu - IEEE transactions on pattern analysis …, 2018 - ieeexplore.ieee.org

Model learning from class imbalanced training data is a long-standing and significant
challenge for machine learning. In particular, existing deep learning methods consider …

被引用次数：386 相关文章所有 12 个版本

A Pearson's correlation coefficient based decision tree and its parallel implementation

Y Mu, X Liu, L Wang - Information Sciences, 2018 - Elsevier

In this paper, a Pearson's correlation coefficient based decision tree (PCC-Tree) is
established and its parallel implementation is developed in the framework of Map-Reduce …

被引用次数：288 相关文章所有 2 个版本

[PDF] core.ac.uk

kNN-IS: An Iterative Spark-based design of the k-Nearest Neighbors classifier for big data

J Maillo, S Ramírez, I Triguero, F Herrera - Knowledge-Based Systems, 2017 - Elsevier

Abstract The k-Nearest Neighbors classifier is a simple yet effective widely renowned
method in data mining. The actual application of this model in the big data domain is not …

被引用次数：379 相关文章所有 15 个版本

Learning imbalanced datasets based on SMOTE and Gaussian distribution

T Pan, J Zhao, W Wu, J Yang - Information Sciences, 2020 - Elsevier

The learning of imbalanced datasets is a ubiquitous challenge for researchers in the fields of
data mining and machine learning. Conventional classifiers are often biased towards the …

被引用次数：171 相关文章所有 2 个版本

[HTML] sciencedirect.com

[HTML][HTML] Improving K-means clustering with enhanced Firefly Algorithms

H Xie, L Zhang, CP Lim, Y Yu, C Liu, H Liu… - Applied Soft …, 2019 - Elsevier

In this research, we propose two variants of the Firefly Algorithm (FA), namely inward
intensified exploration FA (IIEFA) and compound intensified exploration FA (CIEFA), for …

被引用次数：177 相关文章所有 11 个版本

高级搜索

QQ 群