所有版本 - 学术资源搜索

Reinforcement learning with feedback graphs

C Dann, Y Mansour, M Mohri… - Advances in Neural …, 2020 - proceedings.neurips.cc

We study RL in the tabular MDP setting where the agent receives additional observations
per step in the form of transitions samples. Such additional observations can be provided in …

被引用次数：12 相关文章

Reinforcement Learning with Feedback Graphs

C Dann, Y Mansour, M Mohri, A Sekhari… - arXiv e …, 2020 - ui.adsabs.harvard.edu

We study episodic reinforcement learning in Markov decision processes when the agent
receives additional feedback per step in the form of several transition observations. Such …

Reinforcement learning with feedback graphs

C Dann, Y Mansour, M Mohri, A Sekhari… - Proceedings of the 34th …, 2020 - dl.acm.org

We study RL in the tabular MDP setting where the agent receives additional observations
per step in the form of transitions samples. Such additional observations can be provided in …

Reinforcement Learning with Feedback Graphs

C Dann, Y Mansour, M Mohri, A Sekhari… - arXiv preprint arXiv …, 2020 - arxiv.org

We study episodic reinforcement learning in Markov decision processes when the agent
receives additional feedback per step in the form of several transition observations. Such …

Reinforcement learning with feedback graphs

C Dann, Y Mansour, M Mohri… - Advances in Neural …, 2020 - nyuscholars.nyu.edu

We study RL in the tabular MDP setting where the agent receives additional observations
per step in the form of transitions samples. Such additional observations can be provided in …

Reinforcement Learning with Feedback Graphs

C Dann, Y Mansour, M Mohri, A Sekhari, K Sridharan - research.google

We study reinforcement learning in tabular MDPs where the agent receives additional side
observations per step in the form of several transition samples--eg from data augmentation …

[PDF] academia.edu

[PDF][PDF] Reinforcement Learning with Feedback Graphs

C Dann, Y Mansour, M Mohri, A Sekhari, K Sridharan - academia.edu

We study RL in the tabular MDP setting where the agent receives additional observations
per step in the form of transitions samples. Such additional observations can be provided in …

[PDF] neurips.cc

[PDF][PDF] Reinforcement Learning with Feedback Graphs

C Dann, Y Mansour, M Mohri, A Sekhari, K Sridharan - proceedings.neurips.cc

We study RL in the tabular MDP setting where the agent receives additional observations
per step in the form of transitions samples. Such additional observations can be provided in …

Reinforcement Learning with Feedback Graphs

C Dann, Y Mansour, M Mohri, A Sekhari, K Sridharan - research.google

We study reinforcement learning in tabular MDPs where the agent receives additional side
observations per step in the form of several transition samples--eg from data augmentation …

高级搜索

QQ 群

Reinforcement learning with feedback graphs

Reinforcement Learning with Feedback Graphs

Reinforcement learning with feedback graphs

Reinforcement Learning with Feedback Graphs

Reinforcement learning with feedback graphs

Reinforcement Learning with Feedback Graphs

[PDF][PDF] Reinforcement Learning with Feedback Graphs

[PDF][PDF] Reinforcement Learning with Feedback Graphs

Reinforcement Learning with Feedback Graphs

引用