Long-term forecasting using tensor-train rnns

R Yu, S Zheng, A Anandkumar, Y Yue - 2018 - openreview.net
We present Tensor-Train RNN (TT-RNN), a novel family of neural sequence architectures for
multivariate forecasting in environments with nonlinear dynamics. Long-term forecasting in …

Decoupling dynamics and reward for transfer learning

A Zhang, H Satija, J Pineau - arXiv preprint arXiv:1804.10689, 2018 - arxiv.org
Current reinforcement learning (RL) methods can successfully learn single tasks but often
generalize poorly to modest perturbations in task domain or training procedure. In this work …

Long-term forecasting using higher order tensor RNNs

R Yu, S Zheng, A Anandkumar, Y Yue - arXiv preprint arXiv:1711.00073, 2017 - arxiv.org
We present Higher-Order Tensor RNN (HOT-RNN), a novel family of neural sequence
architectures for multivariate forecasting in environments with nonlinear dynamics. Long …

Predictive state recurrent neural networks

C Downey, A Hefny, B Boots… - Advances in Neural …, 2017 - proceedings.neurips.cc
We present a new model, Predictive State Recurrent Neural Networks (PSRNNs), for filtering
and prediction in dynamical systems. PSRNNs draw on insights from both Recurrent Neural …

[图书][B] Display ad measurement using observational data: A reinforcement learning approach

S Tunuguntla - 2022 - search.proquest.com
This paper introduces an observational framework for measuring the effects of display
advertising. Causal measurement of display ad effects using observational data presents a …

Apprenticeship learning for a predictive state representation of anesthesia

P Humbert, C Dubost, J Audiffren… - IEEE Transactions on …, 2019 - ieeexplore.ieee.org
Objective: In this paper, we present an original decision support algorithm to assist the
anesthesiologists delivery of drugs to maintain the optimal Depth of Anesthesia (DoA) …

Towards alignment of Reinforcement Learning agents; for consideration of safety, robustness and fairness.

H Satija - 2024 - escholarship.mcgill.ca
Reinforcement Learning (RL) has emerged as the standard paradigm for sequential
decision-making and a framework for general intelligence. At its core, the RL problem is one …

An Exploration of Predictive Representations of State

C Ma - 2020 - era.library.ualberta.ca
The predictive representations hypothesis is that representing the state of the world in terms
of predictions about the future will result in good generalization. In this thesis, good …

[PDF][PDF] Apprenticeship Learning for a Predictive State Representation of Anesthesia

J Audiffren, L Oudre - laurentoudre.fr
Objective: In this paper, we present an original decision support algorithm to assist the
anesthesiologists delivery of drugs to maintain the optimal Depth of Anesthesia (DoA) …

A Consistent Method for Learning OOMs from Asymptotically Stationary Time Series Data Containing Missing Values

T Liu - arXiv preprint arXiv:1808.03873, 2018 - arxiv.org
In the traditional framework of spectral learning of stochastic time series models, model
parameters are estimated based on trajectories of fully recorded observations. However …