A survey on addressing high-class imbalance in big data

JL Leevy, TM Khoshgoftaar, RA Bauder, N Seliya - Journal of Big Data, 2018 - Springer
In a majority–minority classification problem, class imbalance in the dataset (s) can
dramatically skew the performance of classifiers, introducing a prediction bias for the …

Machine learning-based clinical decision support system for treatment recommendation and overall survival prediction of hepatocellular carcinoma: a multi-center …

KH Lee, GH Choi, J Yun, J Choi, MJ Goh, DH Sinn… - npj Digital …, 2024 - nature.com
The treatment decisions for patients with hepatocellular carcinoma are determined by a wide
range of factors, and there is a significant difference between the recommendations of …

Examining characteristics of predictive models with imbalanced big data

T Hasanin, TM Khoshgoftaar, JL Leevy, N Seliya - Journal of Big Data, 2019 - Springer
High class imbalance between majority and minority classes in datasets can skew the
performance of Machine Learning algorithms and bias predictions in favor of the majority …

Survey of software defect prediction features

S Qiu, BE, J He, L Liu - Neural Computing and Applications, 2024 - Springer
Software defect prediction (SDP) is a technique that uses known software features and
defect information to predict target software defects. It helps reduce software development …

Cost‐Sensitive Radial Basis Function Neural Network Classifier for Software Defect Prediction

P Kumudha, R Venkatesan - The Scientific World Journal, 2016 - Wiley Online Library
Effective prediction of software modules, those that are prone to defects, will enable software
developers to achieve efficient allocation of resources and to concentrate on quality …

Exploring the Most Effective Information for Satellite-Derived Bathymetry Models in Different Water Qualities

Z Liu, H Liu, Y Ma, X Ma, J Yang, Y Jiang, S Li - Remote Sensing, 2024 - mdpi.com
Satellite-derived bathymetry (SDB) is an effective means of obtaining global shallow water
depths. However, the effect of inherent optical properties (IOPs) on the accuracy of SDB …

A Survey of Methods for Handling Disk Data Imbalance

S Yuan, P Wu, Y Chen, Q Li - arXiv preprint arXiv:2310.08867, 2023 - arxiv.org
Class imbalance exists in many classification problems, and since the data is designed for
accuracy, imbalance in data classes can lead to classification challenges with a few classes …

A novel feature selection method for software fault prediction model

C Cui, B Liu, G Li - 2019 Annual Reliability and Maintainability …, 2019 - ieeexplore.ieee.org
Software fault prediction (SFP) is an active issue in software engineering (SE). At present,
machine learning (ML) has been successfully applied to SFP classification problems …

An empirical investigation of combining filter-based feature subset selection and data sampling for software defect prediction

K Gao, TM Khoshgoftaar… - International Journal of …, 2015 - World Scientific
The main goal of software quality engineering is to produce a high-quality software product
through the use of various techniques and processes. Classification models are effective …

A comparative study of hybrid feature selection methods using correlation coefficient for microarray data

C Arunkumar, S Ramakrishnan - Journal of Network and Innovative …, 2016 - cspub-jnic.org
Feature selection is a key challenge before the process of classification could be performed.
The classification accuracy would increase by using a good feature selection method and …