Reinforcement learning for scriptless testing: An empirical investigation of reward functions

O Rodríguez-Valdés, TEJ Vos, B Marín… - … Conference on Research …, 2023 - Springer
… This paper explores using Reinforcement Learning (RL) [20] to implement more sophisticated
ways to select actions. RL is a branch of machine learning that is directly applicable in …

Software-in-the-Loop combined Reinforcement learning method for Dynamic Response Analysis of FOWTs

P Chen, J Chen, Z Hu - Frontiers in Marine Science, 2021 - frontiersin.org
… AI technology discussed in this paper is a reinforcement learning algorithm named DDPG.
An in-house program named DARwind is utilized to run the dynamics response analysis of …

Sample efficient reinforcement learning with REINFORCE

J Zhang, J Kim, B O'Donoghue, S Boyd - Proceedings of the AAAI …, 2021 - ojs.aaai.org
… In this paper, we consider the episodic reinforcement learning setting in which the agent
accesses p and r by interacting with the environment over successive episodes, ie, the agent …

Behavior regularized offline reinforcement learning

Y Wu, G Tucker, O Nachum - arXiv preprint arXiv:1911.11361, 2019 - arxiv.org
In reinforcement learning (RL) research, it is common to assume access to direct online
interactions with the environment. However in many real-world applications, access to the …

Information-theoretic considerations in batch reinforcement learning

J Chen, N Jiang - … Conference on Machine Learning, 2019 - proceedings.mlr.press
… We are concerned with value-function approximation in batch-mode reinforcement
learning, which is related to and sometimes known as Approximate Dynamic Programming (ADP; …

Stability-certified reinforcement learning: A control-theoretic perspective

M Jin, J Lavaei - IEEE Access, 2020 - ieeexplore.ieee.org
… problem of certifying stability of reinforcement learning policies when interconnected with …
; furthermore, we analyze and establish its (non)conservatism. Empirical evaluations on two …

Reinforcement learning for strategic recommendations

G Theocharous, Y Chandak, PS Thomas… - arXiv preprint arXiv …, 2020 - arxiv.org
… In Section 4 we present a practical reinforcement learning (RL) algorithm for implementing
an SR system with an application to ad offers. In Section 5 we present an algorithm for safely …

Reinforcement learning in configurable continuous environments

AM Metelli, E Ghelfi, M Restelli - … on Machine Learning, 2019 - proceedings.mlr.press
… After introducing our approach and providing a finite-sample analysis, we empirically
evaluate REMPS on both benchmark and realistic environments by comparing our results with …

Adaptive reward-poisoning attacks against reinforcement learning

X Zhang, Y Ma, A Singla, X Zhu - … on Machine Learning, 2020 - proceedings.mlr.press
… Next, we demonstrate how to solve for the optimal attack problem in practice, and empirically
show that with the techniques from Deep Reinforcement Learning (DRL), we can find …

Revisiting Peng's Q() for Modern Reinforcement Learning

T Kozuno, Y Tang, M Rowland… - … Machine Learning, 2021 - proceedings.mlr.press
… Off-policy multi-step reinforcement learning algorithms consist of conservative and non…
Motivated by the empirical results and the lack of theory, we carry out theoretical analyses of Peng’…