Offline reinforcement learning: Tutorial, review, and perspectives on open problems

S Levine, A Kumar, G Tucker, J Fu - arXiv preprint arXiv:2005.01643, 2020 - arxiv.org
In this tutorial article, we aim to provide the reader with the conceptual tools needed to get
started on research on offline reinforcement learning algorithms: reinforcement learning …

Reinforcement learning in healthcare: A survey

C Yu, J Liu, S Nemati, G Yin - ACM Computing Surveys (CSUR), 2021 - dl.acm.org
As a subfield of machine learning, reinforcement learning (RL) aims at optimizing decision
making by using interaction samples of an agent with its environment and the potentially …

A minimalist approach to offline reinforcement learning

S Fujimoto, SS Gu - Advances in neural information …, 2021 - proceedings.neurips.cc
Offline reinforcement learning (RL) defines the task of learning from a fixed batch of data.
Due to errors in value estimation from out-of-distribution actions, most offline RL algorithms …

Toward causal representation learning

B Schölkopf, F Locatello, S Bauer, NR Ke… - Proceedings of the …, 2021 - ieeexplore.ieee.org
The two fields of machine learning and graphical causality arose and are developed
separately. However, there is, now, cross-pollination and increasing interest in both fields to …

Morel: Model-based offline reinforcement learning

R Kidambi, A Rajeswaran… - Advances in neural …, 2020 - proceedings.neurips.cc
In offline reinforcement learning (RL), the goal is to learn a highly rewarding policy based
solely on a dataset of historical interactions with the environment. This serves as an extreme …

Guidelines for reinforcement learning in healthcare

O Gottesman, F Johansson, M Komorowski, A Faisal… - Nature medicine, 2019 - nature.com
Guidelines for reinforcement learning in healthcare | Nature Medicine Skip to main content
Thank you for visiting nature.com. You are using a browser version with limited support for …

Model selection for offline reinforcement learning: Practical considerations for healthcare settings

S Tang, J Wiens - Machine Learning for Healthcare …, 2021 - proceedings.mlr.press
Reinforcement learning (RL) can be used to learn treatment policies and aid decision
making in healthcare. However, given the need for generalization over complex state/action …

Offline reinforcement learning: Fundamental barriers for value function approximation

DJ Foster, A Krishnamurthy, D Simchi-Levi… - arXiv preprint arXiv …, 2021 - arxiv.org
We consider the offline reinforcement learning problem, where the aim is to learn a decision
making policy from logged data. Offline RL--particularly when coupled with (value) function …

Leveraging factored action spaces for efficient offline reinforcement learning in healthcare

S Tang, M Makar, M Sjoding… - Advances in Neural …, 2022 - proceedings.neurips.cc
Many reinforcement learning (RL) applications have combinatorial action spaces, where
each action is a composition of sub-actions. A standard RL approach ignores this inherent …

A resource-constrained and privacy-preserving edge-computing-enabled clinical decision system: A federated reinforcement learning approach

Z Xue, P Zhou, Z Xu, X Wang, Y Xie… - IEEE Internet of …, 2021 - ieeexplore.ieee.org
Internet-of-Things-enabled E-health system, which could monitor and collect the personal
health information (PHI), has gradually transformed the clinical treatment to a more …