Understanding reinforcement learning-based fine-tuning of diffusion models: A tutorial and review

M Uehara, Y Zhao, T Biancalani, S Levine - arXiv preprint arXiv …, 2024 - arxiv.org
This tutorial provides a comprehensive survey of methods for fine-tuning diffusion models to
optimize downstream reward functions. While diffusion models are widely known to provide …

Diffusion Schrödinger bridge matching

Y Shi, V De Bortoli, A Campbell… - Advances in Neural …, 2024 - proceedings.neurips.cc
Solving transport problems, ie finding a map transporting one given distribution to another,
has numerous applications in machine learning. Novel mass transport methods motivated …

Diffusion schrödinger bridge with applications to score-based generative modeling

V De Bortoli, J Thornton, J Heng… - Advances in Neural …, 2021 - proceedings.neurips.cc
Progressively applying Gaussian noise transforms complex data distributions to
approximately Gaussian. Reversing this dynamic defines a generative model. When the …

An invitation to sequential Monte Carlo samplers

C Dai, J Heng, PE Jacob, N Whiteley - Journal of the American …, 2022 - Taylor & Francis
ABSTRACT Statisticians often use Monte Carlo methods to approximate probability
distributions, primarily with Markov chain Monte Carlo and importance sampling. Sequential …

Solving schrödinger bridges via maximum likelihood

F Vargas, P Thodoroff, A Lamacraft, N Lawrence - Entropy, 2021 - mdpi.com
The Schrödinger bridge problem (SBP) finds the most likely stochastic evolution between
two probability distributions given a prior stochastic evolution. As well as applications in the …

Stochastic control liaisons: Richard sinkhorn meets gaspard monge on a schrodinger bridge

Y Chen, TT Georgiou, M Pavon - Siam Review, 2021 - SIAM
In 1931--1932, Erwin Schrödinger studied a hot gas Gedankenexperiment (an instance of
large deviations of the empirical distribution). Schrödinger's problem represents an early …

Improved sampling via learned diffusions

L Richter, J Berner - arXiv preprint arXiv:2307.01198, 2023 - arxiv.org
Recently, a series of papers proposed deep learning-based approaches to sample from
target distributions using controlled diffusion processes, being trained only on the …

Conditional simulation using diffusion Schrödinger bridges

Y Shi, V De Bortoli, G Deligiannidis… - Uncertainty in Artificial …, 2022 - proceedings.mlr.press
Denoising diffusion models have recently emerged as a powerful class of generative
models. They provide state-of-the-art results, not only for unconditional simulation, but also …

Score-based diffusion meets annealed importance sampling

A Doucet, W Grathwohl, AG Matthews… - Advances in Neural …, 2022 - proceedings.neurips.cc
More than twenty years after its introduction, Annealed Importance Sampling (AIS) remains
one of the most effective methods for marginal likelihood estimation. It relies on a sequence …

The schrödinger bridge between gaussian measures has a closed form

C Bunne, YP Hsieh, M Cuturi… - … Conference on Artificial …, 2023 - proceedings.mlr.press
The static optimal transport $(\mathrm {OT}) $ problem between Gaussians seeks to recover
an optimal map, or more generally a coupling, to morph a Gaussian into another. It has been …