Efficient and modular implicit differentiation

U Sadana, A Chenreddy, E Delage, A Forel… - European Journal of …, 2024 - Elsevier

Recently there has been a surge of interest in operations research (OR) and the machine
learning (ML) community in combining prediction algorithms and optimization techniques to …

被引用次数：93 相关文章所有 7 个版本

[PDF] arxiv.org

On neural differential equations

P Kidger - arXiv preprint arXiv:2202.02435, 2022 - arxiv.org

The conjoining of dynamical systems and deep learning has become a topic of great
interest. In particular, neural differential equations (NDEs) demonstrate that neural networks …

被引用次数：364 相关文章所有 4 个版本

[PDF] neurips.cc

Theseus: A library for differentiable nonlinear optimization

L Pineda, T Fan, M Monge… - Advances in …, 2022 - proceedings.neurips.cc

We present Theseus, an efficient application-agnostic open source library for differentiable
nonlinear least squares (DNLS) optimization built on PyTorch, providing a common …

被引用次数：93 相关文章所有 6 个版本

[PDF] thecvf.com

Craft: Concept recursive activation factorization for explainability

T Fel, A Picard, L Bethune, T Boissin… - Proceedings of the …, 2023 - openaccess.thecvf.com

Attribution methods are a popular class of explainability methods that use heatmaps to
depict the most important areas of an image that drive a model decision. Nevertheless …

被引用次数：92 相关文章所有 18 个版本

[PDF] mlr.press

Linear adversarial concept erasure

S Ravfogel, M Twiton, Y Goldberg… - … on Machine Learning, 2022 - proceedings.mlr.press

Modern neural models trained on textual data rely on pre-trained representations that
emerge without direct supervision. As these representations are increasingly being used in …

被引用次数：87 相关文章所有 6 个版本

[PDF] arxiv.org

The elements of differentiable programming

M Blondel, V Roulet - arXiv preprint arXiv:2403.14606, 2024 - arxiv.org

Artificial intelligence has recently experienced remarkable advances, fueled by large
models, vast datasets, accelerated hardware, and, last but not least, the transformative …

被引用次数：30 相关文章所有 2 个版本

A graph-based methodology for constructing computational models that automates adjoint-based sensitivity analysis

V Gandarillas, AJ Joshy, MZ Sperry, AK Ivanov… - Structural and …, 2024 - Springer

The adjoint method provides an efficient way to compute sensitivities for system models with
a large number of inputs. However, implementing the adjoint method requires significant …

被引用次数：39 相关文章所有 2 个版本

[PDF] mlr.press

On implicit bias in overparameterized bilevel optimization

P Vicol, JP Lorraine, F Pedregosa… - International …, 2022 - proceedings.mlr.press

Many problems in machine learning involve bilevel optimization (BLO), including
hyperparameter optimization, meta-learning, and dataset distillation. Bilevel problems …

被引用次数：47 相关文章所有 12 个版本

[PDF] mlr.press

Synergies between disentanglement and sparsity: Generalization and identifiability in multi-task learning

S Lachapelle, T Deleu, D Mahajan… - International …, 2023 - proceedings.mlr.press

Although disentangled representations are often said to be beneficial for downstream tasks,
current empirical and theoretical understanding is limited. In this work, we provide evidence …

被引用次数：32 相关文章所有 5 个版本

[PDF] neurips.cc

Making scalable meta learning practical

S Choe, SV Mehta, H Ahn… - Advances in neural …, 2024 - proceedings.neurips.cc

Despite its flexibility to learn diverse inductive biases in machine learning programs, meta
learning (ie,\learning to learn) has long been recognized to suffer from poor scalability due …

被引用次数：16 相关文章所有 6 个版本

高级搜索

QQ 群