Fast differentiable sorting and ranking

M Blondel, O Teboul, Q Berthet… - … on Machine Learning, 2020 - proceedings.mlr.press
The sorting operation is one of the most commonly used building blocks in computer
programming. In machine learning, it is often used for robust statistics. However, seen as a …

Differential properties of sinkhorn approximation for learning with wasserstein distance

G Luise, A Rudi, M Pontil… - Advances in Neural …, 2018 - proceedings.neurips.cc
Applications of optimal transport have recently gained remarkable attention as a result of the
computational advantages of entropic regularization. However, in most situations the …

On fast leverage score sampling and optimal learning

A Rudi, D Calandriello, L Carratino… - Advances in Neural …, 2018 - proceedings.neurips.cc
Leverage score sampling provides an appealing way to perform approximate com-putations
for large matrices. Indeed, it allows to derive faithful approximations with a complexity …

Geometry-aware adaptation for pretrained models

N Roberts, X Li, D Adila, S Cromp… - Advances in …, 2024 - proceedings.neurips.cc
Abstract Machine learning models---including prominent zero-shot models---are often
trained on datasets whose labels are only a small proportion of a larger label space. Such …

Structured prediction with partial labelling through the infimum loss

V Cabannnes, A Rudi, F Bach - International Conference on …, 2020 - proceedings.mlr.press
Annotating datasets is one of the main costs in nowadays supervised learning. The goal of
weak supervision is to enable models to learn using only forms of labelling which are …

A general framework for consistent structured prediction with implicit loss embeddings

C Ciliberto, L Rosasco, A Rudi - Journal of Machine Learning Research, 2020 - jmlr.org
We propose and analyze a novel theoretical and algorithmic framework for structured
prediction. While so far the term has referred to discrete output spaces, here we consider …

Vector-valued least-squares regression under output regularity assumptions

L Brogat-Motte, A Rudi, C Brouard, J Rousu… - Journal of Machine …, 2022 - jmlr.org
We propose and analyse a reduced-rank method for solving least-squares regression
problems with infinite dimensional output. We derive learning bounds for our method, and …

Disambiguation of weak supervision leading to exponential convergence rates

VA Cabannnes, F Bach, A Rudi - … Conference on Machine …, 2021 - proceedings.mlr.press
Abstract Machine learning approached through supervised learning requires expensive
annotation of data. This motivates weakly supervised learning, where data are annotated …

A prediction model for ranking branch-and-bound procedures for the resource-constrained project scheduling problem

W Guo, M Vanhoucke, J Coelho - European Journal of Operational …, 2023 - Elsevier
The branch-and-bound (B&B) procedure is one of the most widely used techniques to get
optimal solutions for the resource-constrained project scheduling problem (RCPSP) …

Predictive inference with weak supervision

M Cauchois, S Gupta, A Ali, JC Duchi - Journal of Machine Learning …, 2024 - jmlr.org
The expense of acquiring labels in large-scale statistical machine learning makes partially
and weakly-labeled data attractive, though it is not always apparent how to leverage such …