reinforcement learning empirical analysis- 学术资源搜索

Reinforcement learning for scriptless testing: An empirical investigation of reward functions

O Rodríguez-Valdés, TEJ Vos, B Marín… - … Conference on Research …, 2023 - Springer

… This paper explores using Reinforcement Learning (RL) [20] to implement more sophisticated
ways to select actions. RL is a branch of machine learning that is directly applicable in …

被引用次数：1 相关文章所有 3 个版本

[PDF] frontiersin.org

Software-in-the-Loop combined Reinforcement learning method for Dynamic Response Analysis of FOWTs

P Chen, J Chen, Z Hu - Frontiers in Marine Science, 2021 - frontiersin.org

… AI technology discussed in this paper is a reinforcement learning algorithm named DDPG.
An in-house program named DARwind is utilized to run the dynamics response analysis of …

被引用次数：17 相关文章所有 8 个版本

[PDF] aaai.org

Sample efficient reinforcement learning with REINFORCE

J Zhang, J Kim, B O'Donoghue, S Boyd - Proceedings of the AAAI …, 2021 - ojs.aaai.org

… In this paper, we consider the episodic reinforcement learning setting in which the agent
accesses p and r by interacting with the environment over successive episodes, ie, the agent …

被引用次数：83 相关文章所有 10 个版本

[PDF] arxiv.org

Behavior regularized offline reinforcement learning

Y Wu, G Tucker, O Nachum - arXiv preprint arXiv:1911.11361, 2019 - arxiv.org

In reinforcement learning (RL) research, it is common to assume access to direct online
interactions with the environment. However in many real-world applications, access to the …

被引用次数：692 相关文章所有 5 个版本

[PDF] mlr.press

Information-theoretic considerations in batch reinforcement learning

J Chen, N Jiang - … Conference on Machine Learning, 2019 - proceedings.mlr.press

… We are concerned with value-function approximation in batch-mode reinforcement
learning, which is related to and sometimes known as Approximate Dynamic Programming (ADP; …

被引用次数：376 相关文章所有 8 个版本

[PDF] ieee.org

Stability-certified reinforcement learning: A control-theoretic perspective

M Jin, J Lavaei - IEEE Access, 2020 - ieeexplore.ieee.org

… problem of certifying stability of reinforcement learning policies when interconnected with …
; furthermore, we analyze and establish its (non)conservatism. Empirical evaluations on two …

被引用次数：111 相关文章所有 10 个版本

[PDF] arxiv.org

Reinforcement learning for strategic recommendations

G Theocharous, Y Chandak, PS Thomas… - arXiv preprint arXiv …, 2020 - arxiv.org

… In Section 4 we present a practical reinforcement learning (RL) algorithm for implementing
an SR system with an application to ad offers. In Section 5 we present an algorithm for safely …

被引用次数：12 相关文章所有 4 个版本

[PDF] mlr.press

Reinforcement learning in configurable continuous environments

AM Metelli, E Ghelfi, M Restelli - … on Machine Learning, 2019 - proceedings.mlr.press

… After introducing our approach and providing a finite-sample analysis, we empirically
evaluate REMPS on both benchmark and realistic environments by comparing our results with …

被引用次数：18 相关文章所有 11 个版本

[PDF] mlr.press

Adaptive reward-poisoning attacks against reinforcement learning

X Zhang, Y Ma, A Singla, X Zhu - … on Machine Learning, 2020 - proceedings.mlr.press

… Next, we demonstrate how to solve for the optimal attack problem in practice, and empirically
show that with the techniques from Deep Reinforcement Learning (DRL), we can find …

被引用次数：132 相关文章所有 10 个版本

[PDF] mlr.press

Revisiting Peng's Q() for Modern Reinforcement Learning

T Kozuno, Y Tang, M Rowland… - … Machine Learning, 2021 - proceedings.mlr.press

… Off-policy multi-step reinforcement learning algorithms consist of conservative and non…
Motivated by the empirical results and the lack of theory, we carry out theoretical analyses of Peng’…

被引用次数：21 相关文章所有 8 个版本

高级搜索

QQ 群

Reinforcement learning for scriptless testing: An empirical investigation of reward functions

Software-in-the-Loop combined Reinforcement learning method for Dynamic Response Analysis of FOWTs

Sample efficient reinforcement learning with REINFORCE

Behavior regularized offline reinforcement learning

Information-theoretic considerations in batch reinforcement learning

Stability-certified reinforcement learning: A control-theoretic perspective

Reinforcement learning for strategic recommendations

Reinforcement learning in configurable continuous environments

Adaptive reward-poisoning attacks against reinforcement learning

Revisiting Peng's Q() for Modern Reinforcement Learning

引用