Residual policy learning facilitates efficient model-free autonomous racing

R Zhang, J Hou, G Chen, Z Li, J Chen… - IEEE Robotics and …, 2022 - ieeexplore.ieee.org
Motion planning for autonomous racing is a challenging task due to the safety requirement
while driving aggressively. Most previous solutions utilize the prior information or depend on …

Distributional reinforcement learning-based energy arbitrage strategies in imbalance settlement mechanism

SSK Madahi, B Claessens, C Develder - Journal of Energy Storage, 2024 - Elsevier
Growth in the penetration of renewable energy sources makes supply more uncertain and
leads to an increase in the system imbalance. This trend, together with the single imbalance …

Action noise in off-policy deep reinforcement learning: Impact on exploration and performance

J Hollenstein, S Auddy, M Saveriano… - arXiv preprint arXiv …, 2022 - arxiv.org
Many Deep Reinforcement Learning (D-RL) algorithms rely on simple forms of exploration
such as the additive action noise often used in continuous control domains. Typically, the …

Continuous control with coarse-to-fine reinforcement learning

Y Seo, J Uruç, S James - arXiv preprint arXiv:2407.07787, 2024 - arxiv.org
Despite recent advances in improving the sample-efficiency of reinforcement learning (RL)
algorithms, designing an RL algorithm that can be practically deployed in real-world …

Latent imagination facilitates zero-shot transfer in autonomous racing

A Brunnbauer, L Berducci… - … on robotics and …, 2022 - ieeexplore.ieee.org
World models learn behaviors in a latent imagination space to enhance the sample-
efficiency of deep reinforcement learning (RL) algorithms. While learning world models for …

Continuous control with action quantization from demonstrations

R Dadashi, L Hussenot, D Vincent, S Girgin… - arXiv preprint arXiv …, 2021 - arxiv.org
In this paper, we propose a novel Reinforcement Learning (RL) framework for problems with
continuous action spaces: Action Quantization from Demonstrations (AQuaDem). The …

Solving continuous control via q-learning

T Seyde, P Werner, W Schwarting… - arXiv preprint arXiv …, 2022 - arxiv.org
While there has been substantial success for solving continuous control with actor-critic
methods, simpler critic-only methods such as Q-learning find limited application in the …

Reinforcement learning with simple sequence priors

T Saanum, N Éltető, P Dayan… - Advances in Neural …, 2024 - proceedings.neurips.cc
In reinforcement learning (RL), simplicity is typically quantified on an action-by-action basis--
but this timescale ignores temporal regularities, like repetitions, often present in sequential …

Bingham policy parameterization for 3d rotations in reinforcement learning

S James, P Abbeel - arXiv preprint arXiv:2202.03957, 2022 - arxiv.org
We propose a new policy parameterization for representing 3D rotations during
reinforcement learning. Today in the continuous control reinforcement learning literature …

Dynamic multi-team racing: Competitive driving on 1/10-th scale vehicles via learning in simulation

P Werner, T Seyde, P Drews, TM Balch… - … Conference on Robot …, 2023 - openreview.net
Autonomous racing is a challenging task that requires vehicle handling at the dynamic limits
of friction. While single-agent scenarios like Time Trials are solved competitively with …