Why normalizing flows fail to detect out-of-distribution data

P Kirichenko, P Izmailov… - Advances in neural …, 2020 - proceedings.neurips.cc
Detecting out-of-distribution (OOD) data is crucial for robust machine learning systems.
Normalizing flows are flexible deep generative models that often surprisingly fail to …

Multiple instance learning: A survey of problem characteristics and applications

MA Carbonneau, V Cheplygina, E Granger… - Pattern Recognition, 2018 - Elsevier
Multiple instance learning (MIL) is a form of weakly supervised learning where training
instances are arranged in sets, called bags, and a label is provided for the entire bag. This …

Masked autoregressive flow for density estimation

G Papamakarios, T Pavlakou… - Advances in neural …, 2017 - proceedings.neurips.cc
Autoregressive models are among the best performing neural density estimators. We
describe an approach for increasing the flexibility of an autoregressive model, based on …

Learning to pivot with adversarial networks

G Louppe, M Kagan, K Cranmer - Advances in neural …, 2017 - proceedings.neurips.cc
Several techniques for domain adaptation have been proposed to account for differences in
the distribution of the data used for training and testing. The majority of this work focuses on …

The effects of random undersampling with simulated class imbalance for big data

T Hasanin, T Khoshgoftaar - 2018 IEEE international …, 2018 - ieeexplore.ieee.org
With the recent explosion of big data, real-world data are increasingly being affected by
larger degrees of class imbalance, likely hindering Machine Learning algorithm …

Mining for Dark Matter Substructure: Inferring subhalo population properties from strong lenses with machine learning

J Brehmer, S Mishra-Sharma, J Hermans… - The Astrophysical …, 2019 - iopscience.iop.org
The subtle and unique imprint of dark matter substructure on extended arcs in strong-lensing
systems contains a wealth of information about the properties and distribution of dark matter …

Approximating likelihood ratios with calibrated discriminative classifiers

K Cranmer, J Pavez, G Louppe - arXiv preprint arXiv:1506.02169, 2015 - arxiv.org
In many fields of science, generalized likelihood ratio tests are established tools for
statistical inference. At the same time, it has become increasingly common that a simulator …

Big data machine learning using apache spark MLlib

M Assefi, E Behravesh, G Liu… - 2017 ieee international …, 2017 - ieeexplore.ieee.org
Artificial intelligence, and particularly machine learning, has been used in many ways by the
research community to turn a variety of diverse and even heterogeneous data sources into …

RanBox: anomaly detection in the copula space

T Dorigo, M Fumanelli, C Maccani, M Mojsovska… - Journal of High Energy …, 2023 - Springer
A bstract The unsupervised search for overdense regions in high-dimensional feature
spaces, where locally high population densities may be associated with anomalous …

Cost efficient gradient boosting

S Peter, F Diego, FA Hamprecht… - Advances in neural …, 2017 - proceedings.neurips.cc
Many applications require learning classifiers or regressors that are both accurate and
cheap to evaluate. Prediction cost can be drastically reduced if the learned predictor is …