Recurrent predictive state policy networks

A Hefny, Z Marinho, W Sun… - International …, 2018 - proceedings.mlr.press
Abstract We introduce Recurrent Predictive State Policy (RPSP) networks, a recurrent
architecture that brings insights from predictive state representations to reinforcement …

[PDF][PDF] Reinforcement learning under a multi-agent predictive state representation model: Method and theory

Z Zhang, Z Yang, H Liu, P Tokekar… - The Tenth International …, 2022 - par.nsf.gov
We study reinforcement learning for partially observable multi-agent systems where each
agent only has access to its own observation and reward and aims to maximize its …

Mobile robot navigation using learning-based method based on predictive state representation in a dynamic environment

K Matsumoto, A Kawamura, Q An… - 2022 IEEE/SICE …, 2022 - ieeexplore.ieee.org
Mobile robot navigation in a dynamic environment with pedestrians is essential for service
robots operating in a living environment. Accordingly, the robot needs to understand and …

Traffic Signal Control for Large-Scale Road Networks Based on Deep Reinforcement with PSR

Z Zhou, H Zhang, Y Zhang - 2024 3rd Conference on Fully …, 2024 - ieeexplore.ieee.org
This paper investigates the problem of Traffic Signal Control (TSC) in large-scale road
networks. In extensive road networks, it is customary to define each intersection as an agent …

[PDF][PDF] Efficient Methods for Prediction and Control in Partially Observable Environments.

A Hefny - 2018 - reports-archive.adm.cs.cmu.edu
State estimation and tracking (also known as filtering) is an integral part of any system
performing inference in a partially observable environment, whether it is a robot that is …

[PDF][PDF] Unified Models for Dynamical Systems

C Downey - 2019 - reports-archive.adm.cs.cmu.edu
Intuitively a dynamical system is any observable quantity which changes over time
according to some fixed rule. Building models to understand, predict, and control dynamical …

A Consistent Method for Learning OOMs from Asymptotically Stationary Time Series Data Containing Missing Values

T Liu - arXiv preprint arXiv:1808.03873, 2018 - arxiv.org
In the traditional framework of spectral learning of stochastic time series models, model
parameters are estimated based on trajectories of fully recorded observations. However …

[引用][C] Department of Computer Science and Electrical En-gineering

T Liu - 2018