C Yu, J Liu, S Nemati, G Yin - ACM Computing Surveys (CSUR), 2021 - dl.acm.org
As a subfield of machine learning, reinforcement learning (RL) aims at optimizing decision making by using interaction samples of an agent with its environment and the potentially …
S Fujimoto, SS Gu - Advances in neural information …, 2021 - proceedings.neurips.cc
Offline reinforcement learning (RL) defines the task of learning from a fixed batch of data. Due to errors in value estimation from out-of-distribution actions, most offline RL algorithms …
The two fields of machine learning and graphical causality arose and are developed separately. However, there is, now, cross-pollination and increasing interest in both fields to …
R Kidambi, A Rajeswaran… - Advances in neural …, 2020 - proceedings.neurips.cc
In offline reinforcement learning (RL), the goal is to learn a highly rewarding policy based solely on a dataset of historical interactions with the environment. This serves as an extreme …
Guidelines for reinforcement learning in healthcare | Nature Medicine Skip to main content Thank you for visiting nature.com. You are using a browser version with limited support for …
S Tang, J Wiens - Machine Learning for Healthcare …, 2021 - proceedings.mlr.press
Reinforcement learning (RL) can be used to learn treatment policies and aid decision making in healthcare. However, given the need for generalization over complex state/action …
We consider the offline reinforcement learning problem, where the aim is to learn a decision making policy from logged data. Offline RL--particularly when coupled with (value) function …
Many reinforcement learning (RL) applications have combinatorial action spaces, where each action is a composition of sub-actions. A standard RL approach ignores this inherent …
Z Xue, P Zhou, Z Xu, X Wang, Y Xie… - IEEE Internet of …, 2021 - ieeexplore.ieee.org
Internet-of-Things-enabled E-health system, which could monitor and collect the personal health information (PHI), has gradually transformed the clinical treatment to a more …