D Tolpin, T Dobkin - arXiv preprint arXiv:2108.03834, 2021 - arxiv.org
It is well known that reinforcement learning can be cast as inference in an appropriate
probabilistic model. However, this commonly involves introducing a distribution over agent …