- 学术资源搜索

Recurrent predictive state policy networks

A Hefny, Z Marinho, W Sun… - International …, 2018 - proceedings.mlr.press

Abstract We introduce Recurrent Predictive State Policy (RPSP) networks, a recurrent
architecture that brings insights from predictive state representations to reinforcement …

被引用次数：33 相关文章所有 8 个版本

[PDF] nsf.gov

[PDF][PDF] Reinforcement learning under a multi-agent predictive state representation model: Method and theory

Z Zhang, Z Yang, H Liu, P Tokekar… - The Tenth International …, 2022 - par.nsf.gov

We study reinforcement learning for partially observable multi-agent systems where each
agent only has access to its own observation and reward and aims to maximize its …

被引用次数：10 相关文章所有 3 个版本

[PDF] kyushu-u.ac.jp

Mobile robot navigation using learning-based method based on predictive state representation in a dynamic environment

K Matsumoto, A Kawamura, Q An… - 2022 IEEE/SICE …, 2022 - ieeexplore.ieee.org

Mobile robot navigation in a dynamic environment with pedestrians is essential for service
robots operating in a living environment. Accordingly, the robot needs to understand and …

被引用次数：4 相关文章所有 3 个版本

Traffic Signal Control for Large-Scale Road Networks Based on Deep Reinforcement with PSR

Z Zhou, H Zhang, Y Zhang - 2024 3rd Conference on Fully …, 2024 - ieeexplore.ieee.org

This paper investigates the problem of Traffic Signal Control (TSC) in large-scale road
networks. In extensive road networks, it is customary to define each intersection as an agent …

[PDF] cmu.edu

[PDF][PDF] Efficient Methods for Prediction and Control in Partially Observable Environments.

A Hefny - 2018 - reports-archive.adm.cs.cmu.edu

State estimation and tracking (also known as filtering) is an integral part of any system
performing inference in a partially observable environment, whether it is a robot that is …

被引用次数：2 相关文章所有 8 个版本

[PDF] cmu.edu

[PDF][PDF] Unified Models for Dynamical Systems

C Downey - 2019 - reports-archive.adm.cs.cmu.edu

Intuitively a dynamical system is any observable quantity which changes over time
according to some fixed rule. Building models to understand, predict, and control dynamical …

A Consistent Method for Learning OOMs from Asymptotically Stationary Time Series Data Containing Missing Values

T Liu - arXiv preprint arXiv:1808.03873, 2018 - arxiv.org

In the traditional framework of spectral learning of stochastic time series models, model
parameters are estimated based on trajectories of fully recorded observations. However …

[PDF][PDF] Kernel and Moment Based Prediction and Planning: Applications to Robotics and Natural Language Processing

Z Marinho - 2018 - kilthub.cmu.edu

This thesis focuses on moment and kernel-based methods for applications in Robotics and
Natural Language Processing. Kernel and moment-based learning leverage information …

[引用][C] Department of Computer Science and Electrical En-gineering

T Liu - 2018

[引用][C] for System Identification through Supervised Learning

A Hefny - 2017 - Georgia Institute of Technology

高级搜索

QQ 群