Learning to run with actor-critic ensemble

Modeling pedestrian behavior in pedestrian-vehicle near misses: A continuous Gaussian Process Inverse Reinforcement Learning (GP-IRL) approach

P Nasernejad, T Sayed, R Alsaleh - Accident Analysis & Prevention, 2021 - Elsevier

Using simulation models to conduct safety assessments can have several advantages as it
enables the evaluation of the safety of various design and traffic management options before …

被引用次数：43 相关文章所有 6 个版本

Ensemble-based deep reinforcement learning for robust cooperative wind farm control

B He, H Zhao, G Liang, J Zhao, J Qiu… - International Journal of …, 2022 - Elsevier

The wake effect is the major obstacle to reaching the maximum power generation for wind
farms, since choosing the suitable wake model that satisfies both computational cost and …

被引用次数：14 相关文章所有 3 个版本

A cerebellum-inspired prediction and correction model for motion control of a musculoskeletal robot

J Zhang, J Chen, W Wu, H Qiao - IEEE Transactions on …, 2022 - ieeexplore.ieee.org

It is an important issue that how to regulate the existing control models of musculoskeletal
robots to improve the ability of motion learning and generalization. In this article, based on …

被引用次数：16 相关文章

TASAC: A twin-actor reinforcement learning framework with a stochastic policy with an application to batch process control

T Joshi, H Kodamana, H Kandath, N Kaisare - Control Engineering Practice, 2023 - Elsevier

Due to their complex nonlinear dynamics and batch-to-batch variability, batch processes
pose a challenge for process control. Due to the absence of accurate models and resulting …

被引用次数：7 相关文章

[PDF] arxiv.org

Artificial intelligence for prosthetics: Challenge solutions

Ł Kidziński, C Ong, SP Mohanty, J Hicks… - The NeurIPS'18 …, 2020 - Springer

Abstract In the NeurIPS 2018 Artificial Intelligence for Prosthetics challenge, participants
were tasked with building a controller for a musculoskeletal model with a goal of matching a …

被引用次数：47 相关文章所有 5 个版本

[PDF] aaai.org

Ace: An actor ensemble algorithm for continuous control with tree search

S Zhang, H Yao - Proceedings of the AAAI Conference on Artificial …, 2019 - aaai.org

In this paper, we propose an actor ensemble algorithm, named ACE, for continuous control
with a deterministic policy in reinforcement learning. In ACE, we use actor ensemble (ie …

被引用次数：31 相关文章所有 9 个版本

[PDF] arxiv.org

MoDem-V2: Visuo-Motor World Models for Real-World Robot Manipulation

P Lancaster, N Hansen, A Rajeswaran… - arXiv preprint arXiv …, 2023 - arxiv.org

Robotic systems that aspire to operate in uninstrumented real-world environments must
perceive the world directly via onboard sensing. Vision-based learning systems aim to …

被引用次数：2 相关文章所有 2 个版本

[PDF] openreview.net

Multi-critic actor learning: Teaching RL policies to act with style

S Mysore, G Cheng, Y Zhao, K Saenko… - … Conference on Learning …, 2022 - openreview.net

Using a single value function (critic) shared over multiple tasks in Actor-Critic multi-task
reinforcement learning (MTRL) can result in negative interference between tasks, which can …

被引用次数：12 相关文章所有 2 个版本

[PDF] arxiv.org

Efficient and robust reinforcement learning with uncertainty-based value expansion

B Zhou, H Zeng, F Wang, Y Li, H Tian - arXiv preprint arXiv:1912.05328, 2019 - arxiv.org

By integrating dynamics models into model-free reinforcement learning (RL) methods,
model-based value expansion (MVE) algorithms have shown a significant advantage in …

被引用次数：20 相关文章所有 3 个版本

Empowering the diversity and individuality of option: Residual soft option critic framework

A Zhu, F Chen, H Xu, D Ouyang… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org

Extracting temporal abstraction (option), which empowers the action space, is a crucial
challenge in hierarchical reinforcement learning. Under a well-structured action space …

被引用次数：5 相关文章所有 3 个版本

高级搜索

QQ 群