A survey of predictive modeling on imbalanced domains

P Branco, L Torgo, RP Ribeiro - ACM computing surveys (CSUR), 2016 - dl.acm.org
Many real-world data-mining applications involve obtaining predictive models using
datasets with strongly imbalanced distributions of the target variable. Frequently, the least …

A scoping review of machine learning in psychotherapy research

K Aafjes-van Doorn, C Kamsteeg, J Bate… - Psychotherapy …, 2021 - Taylor & Francis
Abstract Machine learning (ML) offers robust statistical and probabilistic techniques that can
help to make sense of large amounts of data. This scoping review paper aims to broadly …

Revisiting the calibration of modern neural networks

M Minderer, J Djolonga, R Romijnders… - Advances in …, 2021 - proceedings.neurips.cc
Accurate estimation of predictive uncertainty (model calibration) is essential for the safe
application of neural networks. Many instances of miscalibration in modern neural networks …

The Matthews correlation coefficient (MCC) is more informative than Cohen's Kappa and Brier score in binary classification assessment

D Chicco, MJ Warrens, G Jurman - Ieee Access, 2021 - ieeexplore.ieee.org
Even if measuring the outcome of binary classifications is a pivotal task in machine learning
and statistics, no consensus has been reached yet about which statistical rate to employ to …

Probabilistic extension of precision, recall, and f1 score for more thorough evaluation of classification models

R Yacouby, D Axman - Proceedings of the first workshop on …, 2020 - aclanthology.org
In pursuit of the perfect supervised NLP classifier, razor thin margins and low-resource test
sets can make modeling decisions difficult. Popular metrics such as Accuracy, Precision …

Machine learning–XGBoost analysis of language networks to classify patients with epilepsy

L Torlay, M Perrone-Bertolotti, E Thomas, M Baciu - Brain informatics, 2017 - Springer
Our goal was to apply a statistical approach to allow the identification of atypical language
patterns and to differentiate patients with epilepsy from healthy subjects, based on their …

An up-to-date comparison of state-of-the-art classification algorithms

C Zhang, C Liu, X Zhang, G Almpanidis - Expert Systems with Applications, 2017 - Elsevier
Current benchmark reports of classification algorithms generally concern common classifiers
and their variants but do not include many algorithms that have been introduced in recent …

Precision-recall-gain curves: PR analysis done right

P Flach, M Kull - Advances in neural information processing …, 2015 - proceedings.neurips.cc
Precision-Recall analysis abounds in applications of binary classification where true
negatives do not add value and hence should not affect assessment of the classifier's …

Researcher bias: The use of machine learning in software defect prediction

M Shepperd, D Bowes, T Hall - IEEE Transactions on Software …, 2014 - ieeexplore.ieee.org
Background. The ability to predict defect-prone software components would be valuable.
Consequently, there have been many empirical studies to evaluate the performance of …

Performance evaluation in machine learning: the good, the bad, the ugly, and the way forward

P Flach - Proceedings of the AAAI conference on artificial …, 2019 - aaai.org
This paper gives an overview of some ways in which our understanding of performance
evaluation measures for machine-learned classifiers has improved over the last twenty …