Predictive state recurrent neural networks

Y Li - arXiv preprint arXiv:1701.07274, 2017 - arxiv.org

We give an overview of recent exciting achievements of deep reinforcement learning (RL).
We discuss six core elements, six important mechanisms, and twelve applications. We start …

被引用次数：1941 相关文章所有 6 个版本

[PDF] ieee.org

Evaluation of deep learning models for multi-step ahead time series prediction

R Chandra, S Goyal, R Gupta - Ieee Access, 2021 - ieeexplore.ieee.org

Time series prediction with neural networks has been the focus of much research in the past
few decades. Given the recent deep learning revolution, there has been much attention in …

被引用次数：176 相关文章所有 7 个版本

[PDF] arxiv.org

Deep reinforcement learning

SE Li - Reinforcement learning for sequential decision and …, 2023 - Springer

Similar to humans, RL agents use interactive learning to successfully obtain satisfactory
decision strategies. However, in many cases, it is desirable to learn directly from …

被引用次数：366 相关文章所有 9 个版本

[PDF] neurips.cc

Kernel instrumental variable regression

R Singh, M Sahani, A Gretton - Advances in Neural …, 2019 - proceedings.neurips.cc

Instrumental variable (IV) regression is a strategy for learning causal relationships in
observational data. If measurements of input X and output Y are confounded, the causal …

被引用次数：191 相关文章所有 11 个版本

[PDF] cmu.edu

Query-based workload forecasting for self-driving database management systems

L Ma, D Van Aken, A Hefny, G Mezerhane… - Proceedings of the …, 2018 - dl.acm.org

The first step towards an autonomous database management system (DBMS) is the ability to
model the target application's workload. This is necessary to allow the system to anticipate …

被引用次数：252 相关文章所有 16 个版本

[PDF] neurips.cc

Provably efficient reinforcement learning in partially observable dynamical systems

M Uehara, A Sekhari, JD Lee… - Advances in Neural …, 2022 - proceedings.neurips.cc

Abstract We study Reinforcement Learning for partially observable systems using function
approximation. We propose a new PO-bilinear framework, that is general enough to include …

被引用次数：35 相关文章所有 8 个版本

[PDF] arxiv.org

Pac reinforcement learning for predictive state representations

W Zhan, M Uehara, W Sun, JD Lee - arXiv preprint arXiv:2207.05738, 2022 - arxiv.org

In this paper we study online Reinforcement Learning (RL) in partially observable dynamical
systems. We focus on the Predictive State Representations (PSRs) model, which is an …

被引用次数：43 相关文章所有 3 个版本

[PDF] neurips.cc

Future-dependent value-based off-policy evaluation in pomdps

M Uehara, H Kiyohara, A Bennett… - Advances in …, 2024 - proceedings.neurips.cc

We study off-policy evaluation (OPE) for partially observable MDPs (POMDPs) with general
function approximation. Existing methods such as sequential importance sampling …

被引用次数：18 相关文章所有 8 个版本

[PDF] arxiv.org

Recurrent neural filters: Learning independent bayesian filtering steps for time series prediction

B Lim, S Zohren, S Roberts - 2020 International Joint …, 2020 - ieeexplore.ieee.org

Despite the recent popularity of deep generative state space models, few comparisons have
been made between network architectures and the inference steps of the Bayesian filtering …

被引用次数：60 相关文章所有 8 个版本

[PDF] mlr.press

Energy-based predictive representations for partially observed reinforcement learning

T Zhang, T Ren, C Xiao, W Xiao… - Uncertainty in …, 2023 - proceedings.mlr.press

In real-world applications, handling partial observability is a common requirement for
reinforcement learning algorithms, which is not captured by a Markov decision process …

被引用次数：3 相关文章所有 5 个版本

高级搜索

QQ 群