Deciding what to model: Value-equivalent sampling for reinforcement learning

D Arumugam, B Van Roy - Advances in neural information …, 2022 - proceedings.neurips.cc
The quintessential model-based reinforcement-learning agent iteratively refines its
estimates or prior beliefs about the true underlying model of the environment. Recent …

Satisficing exploration for deep reinforcement learning

D Arumugam, S Kumar, R Gummadi… - arXiv preprint arXiv …, 2024 - arxiv.org
A default assumption in the design of reinforcement-learning algorithms is that a decision-
making agent always explores to learn optimal behavior. In sufficiently complex …

Capacity-achieving symbol distributions for directly modulated laser and direct detection systems

D Kim, H Kim - Optics Express, 2023 - opg.optica.org
We investigate the capacity-achieving symbol distributions for directly modulated laser
(DML) and direct-detection (DD) systems utilizing probabilistic constellation shaped pulse …

Deciding what to learn: A rate-distortion approach

D Arumugam, B Van Roy - International Conference on …, 2021 - proceedings.mlr.press
Agents that learn to select optimal actions represent a prominent focus of the sequential
decision-making literature. In the face of a complex environment or constraints on time and …

The value of information when deciding what to learn

D Arumugam, B Van Roy - Advances in neural information …, 2021 - proceedings.neurips.cc
All sequential decision-making agents explore so as to acquire knowledge about a
particular target. It is often the responsibility of the agent designer to construct this target …

Computing quantum channel capacities

N Ramakrishnan, R Iten, VB Scholz… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
The capacity of noisy quantum channels characterizes the highest rate at which information
can be reliably transmitted and it is therefore of practical as well as fundamental importance …

A Bregman proximal perspective on classical and quantum Blahut-Arimoto algorithms

K He, J Saunderson, H Fawzi - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
The Blahut-Arimoto algorithm is a well-known method to compute classical channel
capacities and rate-distortion functions. Recent works have extended this algorithm to …

Squeezing the Arimoto–Blahut algorithm for faster convergence

Y Yu - IEEE Transactions on Information Theory, 2010 - ieeexplore.ieee.org
The Arimoto-Blahut algorithm for computing the capacity of a discrete memoryless channel
is revisited. A so-called “squeezing” strategy is used to design algorithms that preserve its …

Analysis of the convergence speed of the Arimoto-Blahut algorithm by the second-order recurrence formula

K Nakagawa, Y Takei, S Hara… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
In this paper, we investigate the convergence speed of the Arimoto-Blahut algorithm. For
many channel matrices, the convergence speed is exponential, but for some channel …

Quantum blahut-arimoto algorithms

N Ramakrishnan, R Iten, V Scholz… - 2020 IEEE International …, 2020 - ieeexplore.ieee.org
We generalize alternating optimization algorithms of Blahut-Arimoto type to the quantum
setting. In particular, we give iterative algorithms to compute the mutual information of …