Reinforcement learning with feedback graphs

C Dann, Y Mansour, M Mohri… - Advances in Neural …, 2020 - proceedings.neurips.cc
We study RL in the tabular MDP setting where the agent receives additional observations
per step in the form of transitions samples. Such additional observations can be provided in …

Reinforcement Learning with Feedback Graphs

C Dann, Y Mansour, M Mohri, A Sekhari… - arXiv e …, 2020 - ui.adsabs.harvard.edu
We study episodic reinforcement learning in Markov decision processes when the agent
receives additional feedback per step in the form of several transition observations. Such …

Reinforcement learning with feedback graphs

C Dann, Y Mansour, M Mohri, A Sekhari… - Proceedings of the 34th …, 2020 - dl.acm.org
We study RL in the tabular MDP setting where the agent receives additional observations
per step in the form of transitions samples. Such additional observations can be provided in …

Reinforcement Learning with Feedback Graphs

C Dann, Y Mansour, M Mohri, A Sekhari… - arXiv preprint arXiv …, 2020 - arxiv.org
We study episodic reinforcement learning in Markov decision processes when the agent
receives additional feedback per step in the form of several transition observations. Such …

Reinforcement learning with feedback graphs

C Dann, Y Mansour, M Mohri… - Advances in Neural …, 2020 - nyuscholars.nyu.edu
We study RL in the tabular MDP setting where the agent receives additional observations
per step in the form of transitions samples. Such additional observations can be provided in …

Reinforcement Learning with Feedback Graphs

C Dann, Y Mansour, M Mohri, A Sekhari, K Sridharan - research.google
We study reinforcement learning in tabular MDPs where the agent receives additional side
observations per step in the form of several transition samples--eg from data augmentation …

[PDF][PDF] Reinforcement Learning with Feedback Graphs

C Dann, Y Mansour, M Mohri, A Sekhari, K Sridharan - academia.edu
We study RL in the tabular MDP setting where the agent receives additional observations
per step in the form of transitions samples. Such additional observations can be provided in …

[PDF][PDF] Reinforcement Learning with Feedback Graphs

C Dann, Y Mansour, M Mohri, A Sekhari, K Sridharan - proceedings.neurips.cc
We study RL in the tabular MDP setting where the agent receives additional observations
per step in the form of transitions samples. Such additional observations can be provided in …

Reinforcement Learning with Feedback Graphs

C Dann, Y Mansour, M Mohri, A Sekhari, K Sridharan - research.google
We study reinforcement learning in tabular MDPs where the agent receives additional side
observations per step in the form of several transition samples--eg from data augmentation …