Z He, W Qiu, W Zhao,
X Shao, Z Liu - Information Sciences, 2025 - Elsevier
In model-based reinforcement learning, the conventional approach to addressing world
model bias is to use gradient optimization methods. However, using a singular policy from …