Revisiting Peng's Q() for Modern Reinforcement Learning

T Kozuno, Y Tang, M Rowland… - … Machine Learning, 2021 - proceedings.mlr.press
… Off-policy multi-step reinforcement learning algorithms consist of conservative and non…
Motivated by the empirical results and the lack of theory, we carry out theoretical analyses of Peng’…

Multi-market bidding behavior analysis of energy storage system based on inverse reinforcement learning

Q Tang, H Guo, Q Chen - IEEE Transactions on Power Systems, 2022 - ieeexplore.ieee.org
EMPIRICAL ANALYSIS In this section, we show the empirical analysis results obtained
based on real market data to test the feasibility of the proposed framework and algorithms. This …

How to leverage unlabeled data in offline reinforcement learning

T Yu, A Kumar, Y Chebotar… - … Machine Learning, 2022 - proceedings.mlr.press
… We perform extensive theoretical and empirical analysis to study conditions under which
this simple approach, UDS, would either excel or fail, and we analyze how reweighting the …

Reinforcement learning in practice: Opportunities and challenges

Y Li - arXiv preprint arXiv:2202.11296, 2022 - arxiv.org
… This article is a gentle discussion about the field of reinforcement learning in practice, about
… to reinforcement learning (RL), and its relationship with deep learning, machine learning

A dataset perspective on offline reinforcement learning

K Schweighofer, M Dinu, A Radler… - … Lifelong Learning …, 2022 - proceedings.mlr.press
… The application of Reinforcement Learning (RL) in real world … empirical measures for the
datasets sampled by the behavioral policy in deterministic MDPs. The first empirical measure …

A distributional code for value in dopamine-based reinforcement learning

W Dabney, Z Kurth-Nelson, N Uchida, CK Starkweather… - Nature, 2020 - nature.com
… single scalar quantity, which supports learning about the expectation, or … reinforcement
learning inspired by recent artificial intelligence research on distributional reinforcement learning

Algorithmic framework for model-based deep reinforcement learning with theoretical guarantees

Y Luo, H Xu, Y Li, Y Tian, T Darrell, T Ma - arXiv preprint arXiv:1807.03858, 2018 - arxiv.org
… Model-based reinforcement learning (RL) is considered to be a promising approach to
reduce the sample complexity that hinders model-free RL. However, the theoretical …

Recurrent experience replay in distributed reinforcement learning

S Kapturowski, G Ostrovski, J Quan… - … on learning …, 2018 - openreview.net
Reinforcement Learning (RL) has seen a rejuvenation of research interest recently due to …
Second, we perform an empirical study into the effects of several approaches to RNN training …

Traffic signal optimization through discrete and continuous reinforcement learning with robustness analysis in downtown Tehran

M Aslani, S Seipel, MS Mesgari, M Wiering - Advanced Engineering …, 2018 - Elsevier
… In recent years, reinforcement learning (RL) has shown great potential for traffic signal
control because of its high adaptability, flexibility, and scalability. However, designing RL-…

A Dataset Perspective on Offline Reinforcement Learning

K Schweighofer, A Radler, MC Dinu… - arXiv preprint arXiv …, 2021 - arxiv.org
… The application of Reinforcement Learning (RL) in real world … empirical measures for the
datasets sampled by the behavioral policy in deterministic MDPs. The first empirical measure …