Deep ROC analysis and AUC as balanced average accuracy to improve model selection, understanding and interpretation

AM Carrington, DG Manuel, PW Fieguth… - arXiv preprint arXiv …, 2021 - arxiv.org
… , an average of aggregate measures, which is different from … [11] DJ Hand, “Measuring
classifier performance: A coherent … index for evaluating the classification performance at a high …

Empirical analysis of performance assessment for imbalanced classification

JG Gaudreault, P Branco - Machine Learning, 2024 - Springer
… the context or that interpreting the AUC of intersecting ROC … these aggregated results for
all of the performance metrics. … is that the ROC AUC and H-Measure metrics, while not being …

The area under the ROC curve as a measure of clustering quality

PA Jaskowiak, IG Costa, RJGB Campello - Data Mining and Knowledge …, 2022 - Springer
… In this work, we explore AUC as a performance measure in … Even though the ROC graph has
visual appeal, an aggregated … Although classification scores may not be interpreted as strict …

On averaging ROC curves

J Hogan, NM Adams - Transactions on Machine Learning Research, 2023 - openreview.net
… of aggregation and carefully delineates the interpretations of each in … AUCs, as a higher
AUC does not imply an everywhere dominating ROC curve. As a scalar measure of performance

Performance evaluation in machine learning: the good, the bad, the ugly, and the way forward

P Flach - Proceedings of the AAAI conference on artificial …, 2019 - aaai.org
… For example, classification performance can be measured by … (AUC) is itself meaningful as
an aggregated performance … without checking that it is indeed coherent to do so. These issues …

[HTML][HTML] The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation

D Chicco, G Jurman - BMC genomics, 2020 - Springer
… role, being a popular performance measure when a singular … by checking both the PR AUC
and the ROC AUC, focusing on … the only aggregate rate highlighting the weak performance of …

A comparison of classification methods across different data complexity scenarios and datasets

M Scholz, T Wimmer - Expert Systems with Applications, 2021 - Elsevier
… if only 3 classifier results are aggregated. Moreover, our results reveal … The H -measure has
been developed as a coherent … are very similar to those generated with the AUC measure. …

Development and validation of an interpretable deep learning framework for Alzheimer's disease classification

S Qiu, PS Joshi, MI Miller, C Xue, X Zhou, C Karjadi… - Brain, 2020 - academic.oup.com
… computed the area under curve (AUC) values. Additionally, we … We also compared performance
of the deep learning models … We then aggregated disease probability maps across the …

A multi-level classification and modified PSO clustering based ensemble approach for credit scoring

I Singh, N Kumar, KG Srinivasa, S Maini, U Ahuja… - Applied Soft …, 2021 - Elsevier
… Multi-level classification aggregates the … of AUC and H measure as well. Similar to what
was observed for the Australian dataset, the classifier showing relatively superior performance

Learning dynamic weights for an ensemble of deep models applied to medical imaging classification

AGC Pacheco, T Trappenberg… - 2020 International Joint …, 2020 - ieeexplore.ieee.org
… images of retinal optical coherence tomography (OCT) with 4 … We also observe that the
ensemble improves the AUC only … the ensemble’s performance by aggregating the models’ …