[PDF][PDF] An Extension on" Statistical Comparisons of Classifiers over Multiple Data Sets" for all Pairwise Comparisons.

S Garcia, F Herrera - Journal of machine learning research, 2008 - jmlr.org
In a recently published paper in JMLR, Demšar (2006) recommends a set of non-parametric
statistical tests and procedures which can be safely used for comparing the performance of …

[PDF][PDF] Statistical comparisons of classifiers over multiple data sets

J Demšar - The Journal of Machine learning research, 2006 - jmlr.org
While methods for comparing two learning algorithms on a single data set have been
scrutinized for quite some time already, the issue of statistical tests for comparisons of more …

Parametric methods for comparing the performance of two classification algorithms evaluated by k-fold cross validation on multiple data sets

TT Wong - Pattern Recognition, 2017 - Elsevier
A popular procedure for identifying which one of two classification algorithms has a better
performance is to test them on multiple data sets, and the accuracies resulting from k-fold …

T-Friedman test: a new statistical test for multiple comparison with an adjustable conservativeness measure

J Liu, Y Xu - International Journal of Computational Intelligence …, 2022 - Springer
To prove that a certain algorithm is superior to the benchmark algorithms, the statistical
hypothesis tests are commonly adopted with experimental results on a number of datasets …

Advanced nonparametric tests for multiple comparisons in the design of experiments in computational intelligence and data mining: Experimental analysis of power

S García, A Fernández, J Luengo, F Herrera - Information sciences, 2010 - Elsevier
Experimental analysis of the performance of a proposed method is a crucial and necessary
task in an investigation. In this paper, we focus on the use of nonparametric statistical …

Combined 5× 2 cv F test for comparing supervised classification learning algorithms

E Alpaydm - Neural computation, 1999 - ieeexplore.ieee.org
Dietterich (1998) reviews five statistical tests and proposes the 5× 2 cv t test for determining
whether there is a significant difference between the error rates of two classifiers. In our …

Combination of multiple classifiers using local accuracy estimates

K Woods, WP Kegelmeyer… - IEEE transactions on …, 1997 - ieeexplore.ieee.org
This paper presents a method for combining classifiers that uses estimates of each
individual classifier's local accuracy in small regions of feature space surrounding an …

Classification accuracy as a proxy for two-sample testing

I Kim, A Ramdas, A Singh, L Wasserman - 2021 - projecteuclid.org
Classification accuracy as a proxy for two-sample testing Page 1 The Annals of Statistics
2021, Vol. 49, No. 1, 411–434 https://doi.org/10.1214/20-AOS1962 © Institute of Mathematical …

[PDF][PDF] On the Consistency of Multiclass Classification Methods.

A Tewari, PL Bartlett - Journal of Machine Learning Research, 2007 - jmlr.org
Binary classification is a well studied special case of the classification problem. Statistical
properties of binary classifiers, such as consistency, have been investigated in a variety of …

[PDF][PDF] Permutation tests for studying classifier performance.

M Ojala, GC Garriga - Journal of machine learning research, 2010 - jmlr.org
We explore the framework of permutation-based p-values for assessing the performance of
classifiers. In this paper we study two simple permutation tests. The first test assess whether …