Behavior regularized offline reinforcement learning

Y Wu, G Tucker, O Nachum - arXiv preprint arXiv:1911.11361, 2019 - arxiv.org
In reinforcement learning (RL) research, it is common to assume access to direct online
interactions with the environment. However in many real-world applications, access to the …

[HTML][HTML] Behavior Regularized Offline Reinforcement Learning

Y Wu, G Tucker, O Nachum - researchain.net
In reinforcement learning (RL) research, it is common to assume access to direct online
interactions with the environment. However in many real-world applications, access to the …

Behavior Regularized Offline Reinforcement Learning

Y Wu, G Tucker, O Nachum - arXiv e-prints, 2019 - ui.adsabs.harvard.edu
In reinforcement learning (RL) research, it is common to assume access to direct online
interactions with the environment. However in many real-world applications, access to the …

[PDF][PDF] Behavior Regularized Offline Reinforcement Learning

Y Wu, G Tucker, O Nachum - arXiv preprint arXiv:1911.11361, 2019 - cs.cmu.edu
In reinforcement learning (RL) research, it is common to assume access to direct online
interactions with the environment. However in many real-world applications, access to the …

Behavior Regularized Offline Reinforcement Learning

Y Wu, G Tucker, O Nachum - openreview.net
In reinforcement learning (RL) research, it is common to assume access to direct online
interactions with the environment. However in many real-world applications, access to the …