A survey of contextual optimization methods for decision-making under uncertainty

U Sadana, A Chenreddy, E Delage, A Forel… - European Journal of …, 2024 - Elsevier
Recently there has been a surge of interest in operations research (OR) and the machine
learning (ML) community in combining prediction algorithms and optimization techniques to …

On neural differential equations

P Kidger - arXiv preprint arXiv:2202.02435, 2022 - arxiv.org
The conjoining of dynamical systems and deep learning has become a topic of great
interest. In particular, neural differential equations (NDEs) demonstrate that neural networks …

Theseus: A library for differentiable nonlinear optimization

L Pineda, T Fan, M Monge… - Advances in …, 2022 - proceedings.neurips.cc
We present Theseus, an efficient application-agnostic open source library for differentiable
nonlinear least squares (DNLS) optimization built on PyTorch, providing a common …

Craft: Concept recursive activation factorization for explainability

T Fel, A Picard, L Bethune, T Boissin… - Proceedings of the …, 2023 - openaccess.thecvf.com
Attribution methods are a popular class of explainability methods that use heatmaps to
depict the most important areas of an image that drive a model decision. Nevertheless …

Linear adversarial concept erasure

S Ravfogel, M Twiton, Y Goldberg… - … on Machine Learning, 2022 - proceedings.mlr.press
Modern neural models trained on textual data rely on pre-trained representations that
emerge without direct supervision. As these representations are increasingly being used in …

The elements of differentiable programming

M Blondel, V Roulet - arXiv preprint arXiv:2403.14606, 2024 - arxiv.org
Artificial intelligence has recently experienced remarkable advances, fueled by large
models, vast datasets, accelerated hardware, and, last but not least, the transformative …

A graph-based methodology for constructing computational models that automates adjoint-based sensitivity analysis

V Gandarillas, AJ Joshy, MZ Sperry, AK Ivanov… - Structural and …, 2024 - Springer
The adjoint method provides an efficient way to compute sensitivities for system models with
a large number of inputs. However, implementing the adjoint method requires significant …

On implicit bias in overparameterized bilevel optimization

P Vicol, JP Lorraine, F Pedregosa… - International …, 2022 - proceedings.mlr.press
Many problems in machine learning involve bilevel optimization (BLO), including
hyperparameter optimization, meta-learning, and dataset distillation. Bilevel problems …

Synergies between disentanglement and sparsity: Generalization and identifiability in multi-task learning

S Lachapelle, T Deleu, D Mahajan… - International …, 2023 - proceedings.mlr.press
Although disentangled representations are often said to be beneficial for downstream tasks,
current empirical and theoretical understanding is limited. In this work, we provide evidence …

Making scalable meta learning practical

S Choe, SV Mehta, H Ahn… - Advances in neural …, 2024 - proceedings.neurips.cc
Despite its flexibility to learn diverse inductive biases in machine learning programs, meta
learning (ie,\learning to learn) has long been recognized to suffer from poor scalability due …