Practical learning of predictive state representations

R Yu, S Zheng, A Anandkumar, Y Yue - 2018 - openreview.net

We present Tensor-Train RNN (TT-RNN), a novel family of neural sequence architectures for
multivariate forecasting in environments with nonlinear dynamics. Long-term forecasting in …

被引用次数：162 相关文章所有 6 个版本

[PDF] arxiv.org

Decoupling dynamics and reward for transfer learning

A Zhang, H Satija, J Pineau - arXiv preprint arXiv:1804.10689, 2018 - arxiv.org

Current reinforcement learning (RL) methods can successfully learn single tasks but often
generalize poorly to modest perturbations in task domain or training procedure. In this work …

被引用次数：90 相关文章所有 5 个版本

[PDF] arxiv.org

Long-term forecasting using higher order tensor RNNs

R Yu, S Zheng, A Anandkumar, Y Yue - arXiv preprint arXiv:1711.00073, 2017 - arxiv.org

We present Higher-Order Tensor RNN (HOT-RNN), a novel family of neural sequence
architectures for multivariate forecasting in environments with nonlinear dynamics. Long …

被引用次数：72 相关文章所有 2 个版本

[PDF] neurips.cc

Predictive state recurrent neural networks

C Downey, A Hefny, B Boots… - Advances in Neural …, 2017 - proceedings.neurips.cc

We present a new model, Predictive State Recurrent Neural Networks (PSRNNs), for filtering
and prediction in dynamical systems. PSRNNs draw on insights from both Recurrent Neural …

被引用次数：59 相关文章所有 9 个版本

[PDF] wisc.edu

[图书][B] Display ad measurement using observational data: A reinforcement learning approach

S Tunuguntla - 2022 - search.proquest.com

This paper introduces an observational framework for measuring the effects of display
advertising. Causal measurement of display ad effects using observational data presents a …

被引用次数：3 相关文章

Apprenticeship learning for a predictive state representation of anesthesia

P Humbert, C Dubost, J Audiffren… - IEEE Transactions on …, 2019 - ieeexplore.ieee.org

Objective: In this paper, we present an original decision support algorithm to assist the
anesthesiologists delivery of drugs to maintain the optimal Depth of Anesthesia (DoA) …

被引用次数：3 相关文章所有 5 个版本

[PDF] mcgill.ca

Towards alignment of Reinforcement Learning agents; for consideration of safety, robustness and fairness.

H Satija - 2024 - escholarship.mcgill.ca

Reinforcement Learning (RL) has emerged as the standard paradigm for sequential
decision-making and a framework for general intelligence. At its core, the RL problem is one …

[PDF] ualberta.ca

An Exploration of Predictive Representations of State

C Ma - 2020 - era.library.ualberta.ca

The predictive representations hypothesis is that representing the state of the world in terms
of predictions about the future will result in good generalization. In this thesis, good …

被引用次数：1 相关文章

[PDF] laurentoudre.fr

[PDF][PDF] Apprenticeship Learning for a Predictive State Representation of Anesthesia

J Audiffren, L Oudre - laurentoudre.fr

Objective: In this paper, we present an original decision support algorithm to assist the
anesthesiologists delivery of drugs to maintain the optimal Depth of Anesthesia (DoA) …

A Consistent Method for Learning OOMs from Asymptotically Stationary Time Series Data Containing Missing Values

T Liu - arXiv preprint arXiv:1808.03873, 2018 - arxiv.org

In the traditional framework of spectral learning of stochastic time series models, model
parameters are estimated based on trajectories of fully recorded observations. However …

高级搜索

QQ 群