Using AUC and accuracy in evaluating learning algorithms

J Huang, CX Ling - IEEE Transactions on knowledge and Data …, 2005 - ieeexplore.ieee.org
The area under the ROC (receiver operating characteristics) curve, or simply AUC, has been
traditionally used in medical diagnosis since the 1970s. It has recently been proposed as an …

ROC confidence bands: An empirical evaluation

SA Macskassy, F Provost, S Rosset - Proceedings of the 22nd …, 2005 - dl.acm.org
This paper is about constructing confidence bands around ROC curves. We first introduce to
the machine learning community three band-generating methods from the medical field, and …

Confidence intervals for the area under the ROC curve

C Cortes, M Mohri - Advances in neural information …, 2004 - proceedings.neurips.cc
In many applications, good ranking is a highly desirable performance for a classifier. The
criterion commonly used to measure the ranking quality of a classification algorithm is the …

AUC: a better measure than accuracy in comparing learning algorithms

CX Ling, J Huang, H Zhang - … in Artificial Intelligence: 16th Conference of …, 2003 - Springer
Predictive accuracy has been widely used as the main criterion for comparing the predictive
ability of classification systems (such as C4. 5, neural networks, and Naive Bayes). Most of …

A critical analysis of variants of the AUC

S Vanderlooy, E Hüllermeier - Machine Learning, 2008 - Springer
The area under the ROC curve, or AUC, has been widely used to assess the ranking
performance of binary scoring classifiers. Given a sample, the metric considers the ordering …

Model selection via the AUC

S Rosset - Proceedings of the twenty-first international conference …, 2004 - dl.acm.org
We present a statistical analysis of the AUC as an evaluation criterion for classification
scoring models. First, we consider significance tests for the difference between AUC scores …

[图书][B] Evaluating learning algorithms: a classification perspective

N Japkowicz, M Shah - 2011 - books.google.com
The field of machine learning has matured to the point where many sophisticated learning
approaches can be applied to practical applications. Thus it is of critical importance that …

A lot of randomness is hiding in accuracy

A Ben-David - Engineering Applications of Artificial Intelligence, 2007 - Elsevier
The proportion of successful hits, usually referred to as “accuracy”, is by far the most
dominant meter for measuring classifiers' accuracy. This is despite of the fact that accuracy …

[PDF][PDF] Machine learning as an experimental science (revisited)

C Drummond - AAAI workshop on evaluation methods for machine …, 2006 - cdn.aaai.org
In 1988, Langley wrote an influential editorial in the journal Machine Learning titled
“Machine Learning as an Experimental Science”, arguing persuasively for a greater focus on …

Accuracy measures for the comparison of classifiers

V Labatut, H Cherifi - arXiv preprint arXiv:1207.3790, 2012 - arxiv.org
The selection of the best classification algorithm for a given dataset is a very widespread
problem. It is also a complex one, in the sense it requires to make several important …