查看文章

mlr.press 中的 [PDF]

Trainable calibration measures for neural networks from kernel mean embeddings

作者

Aviral Kumar, Sunita Sarawagi, Ujjwal Jain

发表日期

2018/7/3

研讨会论文

International Conference on Machine Learning

页码范围

2805-2814

出版商

PMLR

简介

Modern neural networks have recently been found to be poorly calibrated, primarily in the direction of over-confidence. Methods like entropy penalty and temperature smoothing improve calibration by clamping confidence, but in doing so compromise the many legitimately confident predictions. We propose a more principled fix that minimizes an explicit calibration error during training. We present MMCE, a RKHS kernel based measure of calibration that is efficiently trainable alongside the negative likelihood loss without careful hyper-parameter tuning. Theoretically too, MMCE is a sound measure of calibration that is minimized at perfect calibration, and whose finite sample estimates are consistent and enjoy fast convergence rates. Extensive experiments on several network architectures demonstrate that MMCE is a fast, stable, and accurate method to minimize calibration error while maximally preserving the number of high confidence predictions.

引用总数

被引用次数：277

20182019202020212022202320242 12 29 48 57 83 45

学术搜索中的文章

Trainable calibration measures for neural networks from kernel mean embeddings

A Kumar, S Sarawagi, U Jain - International Conference on Machine Learning, 2018

被引用次数：277 相关文章所有 6 个版本