Deep reinforcement learning: An overview

Y Li - arXiv preprint arXiv:1701.07274, 2017 - arxiv.org
We give an overview of recent exciting achievements of deep reinforcement learning (RL).
We discuss six core elements, six important mechanisms, and twelve applications. We start …

Evaluation of deep learning models for multi-step ahead time series prediction

R Chandra, S Goyal, R Gupta - Ieee Access, 2021 - ieeexplore.ieee.org
Time series prediction with neural networks has been the focus of much research in the past
few decades. Given the recent deep learning revolution, there has been much attention in …

Deep reinforcement learning

SE Li - Reinforcement learning for sequential decision and …, 2023 - Springer
Similar to humans, RL agents use interactive learning to successfully obtain satisfactory
decision strategies. However, in many cases, it is desirable to learn directly from …

Kernel instrumental variable regression

R Singh, M Sahani, A Gretton - Advances in Neural …, 2019 - proceedings.neurips.cc
Instrumental variable (IV) regression is a strategy for learning causal relationships in
observational data. If measurements of input X and output Y are confounded, the causal …

Query-based workload forecasting for self-driving database management systems

L Ma, D Van Aken, A Hefny, G Mezerhane… - Proceedings of the …, 2018 - dl.acm.org
The first step towards an autonomous database management system (DBMS) is the ability to
model the target application's workload. This is necessary to allow the system to anticipate …

Provably efficient reinforcement learning in partially observable dynamical systems

M Uehara, A Sekhari, JD Lee… - Advances in Neural …, 2022 - proceedings.neurips.cc
Abstract We study Reinforcement Learning for partially observable systems using function
approximation. We propose a new PO-bilinear framework, that is general enough to include …

Pac reinforcement learning for predictive state representations

W Zhan, M Uehara, W Sun, JD Lee - arXiv preprint arXiv:2207.05738, 2022 - arxiv.org
In this paper we study online Reinforcement Learning (RL) in partially observable dynamical
systems. We focus on the Predictive State Representations (PSRs) model, which is an …

Future-dependent value-based off-policy evaluation in pomdps

M Uehara, H Kiyohara, A Bennett… - Advances in …, 2024 - proceedings.neurips.cc
We study off-policy evaluation (OPE) for partially observable MDPs (POMDPs) with general
function approximation. Existing methods such as sequential importance sampling …

Recurrent neural filters: Learning independent bayesian filtering steps for time series prediction

B Lim, S Zohren, S Roberts - 2020 International Joint …, 2020 - ieeexplore.ieee.org
Despite the recent popularity of deep generative state space models, few comparisons have
been made between network architectures and the inference steps of the Bayesian filtering …

Energy-based predictive representations for partially observed reinforcement learning

T Zhang, T Ren, C Xiao, W Xiao… - Uncertainty in …, 2023 - proceedings.mlr.press
In real-world applications, handling partial observability is a common requirement for
reinforcement learning algorithms, which is not captured by a Markov decision process …