A unified view of performance metrics: Translating threshold choice into expected classification...

P Branco, L Torgo, RP Ribeiro - ACM computing surveys (CSUR), 2016 - dl.acm.org

Many real-world data-mining applications involve obtaining predictive models using
datasets with strongly imbalanced distributions of the target variable. Frequently, the least …

被引用次数：1337 相关文章所有 13 个版本

[PDF] academia.edu

A scoping review of machine learning in psychotherapy research

K Aafjes-van Doorn, C Kamsteeg, J Bate… - Psychotherapy …, 2021 - Taylor & Francis

Abstract Machine learning (ML) offers robust statistical and probabilistic techniques that can
help to make sense of large amounts of data. This scoping review paper aims to broadly …

被引用次数：143 相关文章所有 7 个版本

[PDF] neurips.cc

Revisiting the calibration of modern neural networks

M Minderer, J Djolonga, R Romijnders… - Advances in …, 2021 - proceedings.neurips.cc

Accurate estimation of predictive uncertainty (model calibration) is essential for the safe
application of neural networks. Many instances of miscalibration in modern neural networks …

被引用次数：277 相关文章所有 8 个版本

[PDF] ieee.org

The Matthews correlation coefficient (MCC) is more informative than Cohen's Kappa and Brier score in binary classification assessment

D Chicco, MJ Warrens, G Jurman - Ieee Access, 2021 - ieeexplore.ieee.org

Even if measuring the outcome of binary classifications is a pivotal task in machine learning
and statistics, no consensus has been reached yet about which statistical rate to employ to …

被引用次数：242 相关文章所有 10 个版本

[PDF] aclanthology.org

Probabilistic extension of precision, recall, and f1 score for more thorough evaluation of classification models

R Yacouby, D Axman - Proceedings of the first workshop on …, 2020 - aclanthology.org

In pursuit of the perfect supervised NLP classifier, razor thin margins and low-resource test
sets can make modeling decisions difficult. Popular metrics such as Accuracy, Precision …

被引用次数：274 相关文章所有 4 个版本

[PDF] springer.com

Machine learning–XGBoost analysis of language networks to classify patients with epilepsy

L Torlay, M Perrone-Bertolotti, E Thomas, M Baciu - Brain informatics, 2017 - Springer

Our goal was to apply a statistical approach to allow the identification of atypical language
patterns and to differentiate patients with epilepsy from healthy subjects, based on their …

被引用次数：434 相关文章所有 19 个版本

[PDF] kaust.edu.sa

An up-to-date comparison of state-of-the-art classification algorithms

C Zhang, C Liu, X Zhang, G Almpanidis - Expert Systems with Applications, 2017 - Elsevier

Current benchmark reports of classification algorithms generally concern common classifiers
and their variants but do not include many algorithms that have been introduced in recent …

被引用次数：435 相关文章所有 5 个版本

[PDF] neurips.cc

Precision-recall-gain curves: PR analysis done right

P Flach, M Kull - Advances in neural information processing …, 2015 - proceedings.neurips.cc

Precision-Recall analysis abounds in applications of binary classification where true
negatives do not add value and hence should not affect assessment of the classifier's …

被引用次数：420 相关文章所有 11 个版本

[PDF] brunel.ac.uk

Researcher bias: The use of machine learning in software defect prediction

M Shepperd, D Bowes, T Hall - IEEE Transactions on Software …, 2014 - ieeexplore.ieee.org

Background. The ability to predict defect-prone software components would be valuable.
Consequently, there have been many empirical studies to evaluate the performance of …

被引用次数：439 相关文章所有 13 个版本

[PDF] aaai.org

Performance evaluation in machine learning: the good, the bad, the ugly, and the way forward

P Flach - Proceedings of the AAAI conference on artificial …, 2019 - aaai.org

This paper gives an overview of some ways in which our understanding of performance
evaluation measures for machine-learned classifiers has improved over the last twenty …

被引用次数：164 相关文章所有 8 个版本

高级搜索

QQ 群