Optimizing for the future in non-stationary mdps

K Khetarpal, M Riemer, I Rish, D Precup - Journal of Artificial Intelligence …, 2022 - jair.org

In this article, we aim to provide a literature review of different formulations and approaches
to continual reinforcement learning (RL), also known as lifelong or non-stationary RL. We …

被引用次数：264 相关文章所有 9 个版本

[HTML] tandfonline.com

[HTML][HTML] Reinforcement learning for systems pharmacology-oriented and personalized drug design

RK Tan, Y Liu, L Xie - Expert opinion on drug discovery, 2022 - Taylor & Francis

Introduction Many multi-genic systemic diseases such as neurological disorders,
inflammatory diseases, and the majority of cancers do not have effective treatments yet …

被引用次数：20 相关文章所有 4 个版本

[PDF] neurips.cc

Factored adaptation for non-stationary reinforcement learning

F Feng, B Huang, K Zhang… - Advances in Neural …, 2022 - proceedings.neurips.cc

Dealing with non-stationarity in environments (eg, in the transition dynamics) and objectives
(eg, in the reward functions) is a challenging problem that is crucial in real-world …

被引用次数：33 相关文章所有 9 个版本

[PDF] mlr.press

Deep reinforcement learning amidst continual structured non-stationarity

A Xie, J Harrison, C Finn - International Conference on …, 2021 - proceedings.mlr.press

As humans, our goals and our environment are persistently changing throughout our lifetime
based on our experiences, actions, and internal and external drives. In contrast, typical …

被引用次数：37 相关文章所有 3 个版本

[PDF] neurips.cc

Universal off-policy evaluation

Y Chandak, S Niekum, B da Silva… - Advances in …, 2021 - proceedings.neurips.cc

When faced with sequential decision-making problems, it is often useful to be able to predict
what would happen if decisions were made using a new policy. Those predictions must …

被引用次数：51 相关文章所有 11 个版本

[PDF] arxiv.org

Deep reinforcement learning amidst lifelong non-stationarity

A Xie, J Harrison, C Finn - arXiv preprint arXiv:2006.10701, 2020 - arxiv.org

As humans, our goals and our environment are persistently changing throughout our lifetime
based on our experiences, actions, and internal and external drives. In contrast, typical …

被引用次数：66 相关文章所有 3 个版本

[PDF] aaai.org

Provably efficient primal-dual reinforcement learning for cmdps with non-stationary objectives and constraints

Y Ding, J Lavaei - Proceedings of the AAAI Conference on Artificial …, 2023 - ojs.aaai.org

We consider primal-dual-based reinforcement learning (RL) in episodic constrained Markov
decision processes (CMDPs) with non-stationary objectives and constraints, which plays a …

被引用次数：24 相关文章所有 5 个版本

[PDF] royalsocietypublishing.org Full View

Bridging adaptive management and reinforcement learning for more robust decisions

M Chapman, L Xu, M Lapeyrolerie… - … Transactions of the …, 2023 - royalsocietypublishing.org

From out-competing grandmasters in chess to informing high-stakes healthcare decisions,
emerging methods from artificial intelligence are increasingly capable of making complex …

被引用次数：8 相关文章所有 7 个版本

[PDF] arxiv.org

Autonomous reinforcement learning: Formalism and benchmarking

A Sharma, K Xu, N Sardana, A Gupta… - arXiv preprint arXiv …, 2021 - arxiv.org

Reinforcement learning (RL) provides a naturalistic framing for learning through trial and
error, which is appealing both because of its simplicity and effectiveness and because of its …

被引用次数：26 相关文章所有 3 个版本

[PDF] neurips.cc

Tempo adaptation in non-stationary reinforcement learning

H Lee, Y Ding, J Lee, M Jin… - Advances in Neural …, 2024 - proceedings.neurips.cc

We first raise and tackle a``time synchronization''issue between the agent and the
environment in non-stationary reinforcement learning (RL), a crucial factor hindering its real …

被引用次数：3 相关文章所有 8 个版本

高级搜索

QQ 群