A gentle introduction to conformal prediction and distribution-free uncertainty quantification

AN Angelopoulos, S Bates - arXiv preprint arXiv:2107.07511, 2021 - arxiv.org
Black-box machine learning models are now routinely used in high-risk settings, like
medical diagnostics, which demand uncertainty quantification to avoid consequential model …

Estimating means of bounded random variables by betting

I Waudby-Smith, A Ramdas - Journal of the Royal Statistical …, 2024 - academic.oup.com
We derive confidence intervals (CIs) and confidence sequences (CSs) for the classical
problem of estimating a bounded mean. Our approach generalizes and improves on the …

Conformal prediction: A gentle introduction

AN Angelopoulos, S Bates - Foundations and Trends® in …, 2023 - nowpublishers.com
Black-box machine learning models are now routinely used in high-risk settings, like
medical diagnostics, which demand uncertainty quantification to avoid consequential model …

Distribution-free uncertainty quantification for classification under label shift

A Podkopaev, A Ramdas - Uncertainty in artificial …, 2021 - proceedings.mlr.press
Trustworthy deployment of ML models requires a proper measure of uncertainty, especially
in safety-critical applications. We focus on uncertainty quantification (UQ) for classification …

Human-aligned calibration for ai-assisted decision making

N Corvelo Benz, M Rodriguez - Advances in Neural …, 2023 - proceedings.neurips.cc
Whenever a binary classifier is used to provide decision support, it typically provides both a
label prediction and a confidence value. Then, the decision maker is supposed to use the …

Top-label calibration and multiclass-to-binary reductions

C Gupta, A Ramdas - arXiv preprint arXiv:2107.08353, 2021 - arxiv.org
A multiclass classifier is said to be top-label calibrated if the reported probability for the
predicted class--the top-label--is calibrated, conditioned on the top-label. This conditioning …

Improving screening processes via calibrated subset selection

L Wang, T Joachims… - … Conference on Machine …, 2022 - proceedings.mlr.press
Many selection processes such as finding patients qualifying for a medical trial or retrieval
pipelines in search engines consist of multiple stages, where an initial screening stage …

Online Platt scaling with calibeating

C Gupta, A Ramdas - International Conference on Machine …, 2023 - proceedings.mlr.press
We present an online post-hoc calibration method, called Online Platt Scaling (OPS), which
combines the Platt scaling technique with online logistic regression. We demonstrate that …

Causal isotonic calibration for heterogeneous treatment effects

L Van Der Laan, E Ulloa-Pérez… - International …, 2023 - proceedings.mlr.press
We propose causal isotonic calibration, a novel nonparametric method for calibrating
predictors of heterogeneous treatment effects. Furthermore, we introduce cross-calibration, a …

Experts Don't Cheat: Learning What You Don't Know By Predicting Pairs

DD Johnson, D Tarlow, D Duvenaud… - arXiv preprint arXiv …, 2024 - arxiv.org
Identifying how much a model ${\widehat {p}} _ {\theta}(Y| X) $ knows about the stochastic
real-world process $ p (Y| X) $ it was trained on is important to ensure it avoids producing …