J Lee,
E Ryu - Advances in Neural Information Processing …, 2024 - proceedings.neurips.cc
Value Iteration (VI) is foundational to the theory and practice of modern reinforcement
learning, and it is known to converge at a $\mathcal {O}(\gamma^ k) $-rate. Surprisingly …