A survey on addressing high-class imbalance in big data

JL Leevy, TM Khoshgoftaar, RA Bauder, N Seliya - Journal of Big Data, 2018 - Springer
In a majority–minority classification problem, class imbalance in the dataset (s) can
dramatically skew the performance of classifiers, introducing a prediction bias for the …

Big data preprocessing: methods and prospects

S García, S Ramírez-Gallego, J Luengo, JM Benítez… - Big data analytics, 2016 - Springer
The massive growth in the scale of data has been observed in recent years being a key
factor of the Big Data scenario. Big Data can be defined as high volume, velocity and variety …

A review of deep learning-based recommender system in e-learning environments

T Liu, Q Wu, L Chang, T Gu - Artificial Intelligence Review, 2022 - Springer
While the recent emergence of a large number of online course resources has made life
more convenient for many people, it has also caused information overload. According to a …

An insight into imbalanced big data classification: outcomes and challenges

A Fernández, S del Río, NV Chawla… - Complex & Intelligent …, 2017 - Springer
Big Data applications are emerging during the last years, and researchers from many
disciplines are aware of the high advantages related to the knowledge extraction from this …

Severely imbalanced big data challenges: investigating data sampling approaches

T Hasanin, TM Khoshgoftaar, JL Leevy, RA Bauder - Journal of Big Data, 2019 - Springer
Severe class imbalance between majority and minority classes in Big Data can bias the
predictive performance of Machine Learning algorithms toward the majority (negative) class …

Hybrid classifier ensemble for imbalanced data

K Yang, Z Yu, X Wen, W Cao, CLP Chen… - IEEE transactions on …, 2019 - ieeexplore.ieee.org
The class imbalance problem has become a leading challenge. Although conventional
imbalance learning methods are proposed to tackle this problem, they have some …

360 Degrees rumor detection: When explanations got some explaining to do

B Janssens, L Schetgen, M Bogaert, M Meire… - European Journal of …, 2024 - Elsevier
Unverified rumor detection recently received considerable academic attention due to the
societal impact resulting from this potential misinformation. Previous work in this area mainly …

Data sampling approaches with severely imbalanced big data for medicare fraud detection

RA Bauder, TM Khoshgoftaar… - 2018 IEEE 30th …, 2018 - ieeexplore.ieee.org
Class imbalance is an important problem in machine learning. With increases in available
information and the growing use of Big Data sources to extract meaning from data, the …

Investigating random undersampling and feature selection on bioinformatics big data

T Hasanin, TM Khoshgoftaar, J Leevy… - 2019 IEEE Fifth …, 2019 - ieeexplore.ieee.org
This paper aims to address a key research issue regarding the ECBDL'14 bioinformatics big
data competition. The ECBDL'14 dataset was the big data target in the competition, and it …

A literature survey on various aspect of class imbalance problem in data mining

S Goswami, AK Singh - Multimedia Tools and Applications, 2024 - Springer
Data has become much widely available in recent years. Since the past years, Learning
classifiers from unbalanced data is a crucial issue that comes up frequently in classification …