Diffusion-based Reinforcement Learning via Q-weighted Variational Policy Optimization

S Ding, K Hu, Z Zhang, K Ren, W Zhang, J Yu… - arXiv preprint arXiv …, 2024 - arxiv.org
Diffusion models have garnered widespread attention in Reinforcement Learning (RL) for
their powerful expressiveness and multimodality. It has been verified that utilizing diffusion …