Optimal classifier for imbalanced data using Matthews Correlation Coefficient metric

S Boughorbel, F Jarray, M El-Anbari - PloS one, 2017 - journals.plos.org
Data imbalance is frequently encountered in biomedical applications. Resampling
techniques can be used in binary classification to tackle this issue. However such solutions …

Designing multi-label classifiers that maximize F measures: State of the art

I Pillai, G Fumera, F Roli - Pattern Recognition, 2017 - Elsevier
Multi-label classification problems usually occur in tasks related to information retrieval, like
text and image annotation, and are receiving increasing attention from the machine learning …

Precision-recall-gain curves: PR analysis done right

P Flach, M Kull - Advances in neural information processing …, 2015 - proceedings.neurips.cc
Precision-Recall analysis abounds in applications of binary classification where true
negatives do not add value and hence should not affect assessment of the classifier's …

Optimal thresholding of classifiers to maximize F1 measure

ZC Lipton, C Elkan, B Naryanaswamy - … 15-19, 2014. Proceedings, Part II …, 2014 - Springer
This paper provides new insight into maximizing F1 measures in the context of binary
classification and also in the context of multilabel classification. The harmonic mean of …

Generalized and scalable optimal sparse decision trees

J Lin, C Zhong, D Hu, C Rudin… - … on Machine Learning, 2020 - proceedings.mlr.press
Decision tree optimization is notoriously difficult from a computational perspective but
essential for the field of interpretable machine learning. Despite efforts over the past 40 …

A unified view of multi-label performance measures

XZ Wu, ZH Zhou - international conference on machine …, 2017 - proceedings.mlr.press
Multi-label classification deals with the problem where each instance is associated with
multiple class labels. Because evaluation in multi-label classification is more complicated …

Consistent binary classification with generalized performance metrics

OO Koyejo, N Natarajan… - Advances in neural …, 2014 - proceedings.neurips.cc
Performance metrics for binary classification are designed to capture tradeoffs between four
fundamental population quantities: true positives, false positives, true negatives and false …

Scalable learning of non-decomposable objectives

E Eban, M Schain, A Mackey… - Artificial intelligence …, 2017 - proceedings.mlr.press
Modern retrieval systems are often driven by an underlying machine learning model. The
goal of such systems is to identify and possibly rank the few most relevant items for a given …

Maximum F1-score discriminative training criterion for automatic mispronunciation detection

H Huang, H Xu, X Wang… - IEEE/ACM Transactions on …, 2015 - ieeexplore.ieee.org
We carry out an in-depth investigation on a newly proposed Maximum F1-score Criterion
(MFC) discriminative training objective function for Goodness of Pronunciation (GOP) based …

A no-regret generalization of hierarchical softmax to extreme multi-label classification

M Wydmuch, K Jasinska, M Kuznetsov… - Advances in neural …, 2018 - proceedings.neurips.cc
Extreme multi-label classification (XMLC) is a problem of tagging an instance with a small
subset of relevant labels chosen from an extremely large pool of possible labels. Large label …