Is bang-bang control all you need? solving continuous control with bernoulli policies

R Zhang, J Hou, G Chen, Z Li, J Chen… - IEEE Robotics and …, 2022 - ieeexplore.ieee.org

Motion planning for autonomous racing is a challenging task due to the safety requirement
while driving aggressively. Most previous solutions utilize the prior information or depend on …

被引用次数：42 相关文章所有 3 个版本

[PDF] arxiv.org

Distributional reinforcement learning-based energy arbitrage strategies in imbalance settlement mechanism

SSK Madahi, B Claessens, C Develder - Journal of Energy Storage, 2024 - Elsevier

Growth in the penetration of renewable energy sources makes supply more uncertain and
leads to an increase in the system imbalance. This trend, together with the single imbalance …

被引用次数：5 相关文章

[PDF] arxiv.org

Action noise in off-policy deep reinforcement learning: Impact on exploration and performance

J Hollenstein, S Auddy, M Saveriano… - arXiv preprint arXiv …, 2022 - arxiv.org

Many Deep Reinforcement Learning (D-RL) algorithms rely on simple forms of exploration
such as the additive action noise often used in continuous control domains. Typically, the …

被引用次数：28 相关文章所有 7 个版本

[PDF] arxiv.org

Continuous control with coarse-to-fine reinforcement learning

Y Seo, J Uruç, S James - arXiv preprint arXiv:2407.07787, 2024 - arxiv.org

Despite recent advances in improving the sample-efficiency of reinforcement learning (RL)
algorithms, designing an RL algorithm that can be practically deployed in real-world …

被引用次数：5 相关文章所有 5 个版本

[PDF] arxiv.org

Latent imagination facilitates zero-shot transfer in autonomous racing

A Brunnbauer, L Berducci… - … on robotics and …, 2022 - ieeexplore.ieee.org

World models learn behaviors in a latent imagination space to enhance the sample-
efficiency of deep reinforcement learning (RL) algorithms. While learning world models for …

被引用次数：41 相关文章所有 8 个版本

[PDF] arxiv.org

Continuous control with action quantization from demonstrations

R Dadashi, L Hussenot, D Vincent, S Girgin… - arXiv preprint arXiv …, 2021 - arxiv.org

In this paper, we propose a novel Reinforcement Learning (RL) framework for problems with
continuous action spaces: Action Quantization from Demonstrations (AQuaDem). The …

被引用次数：35 相关文章所有 7 个版本

[PDF] arxiv.org

Solving continuous control via q-learning

T Seyde, P Werner, W Schwarting… - arXiv preprint arXiv …, 2022 - arxiv.org

While there has been substantial success for solving continuous control with actor-critic
methods, simpler critic-only methods such as Q-learning find limited application in the …

被引用次数：19 相关文章所有 3 个版本

[PDF] neurips.cc

Reinforcement learning with simple sequence priors

T Saanum, N Éltető, P Dayan… - Advances in Neural …, 2024 - proceedings.neurips.cc

In reinforcement learning (RL), simplicity is typically quantified on an action-by-action basis--
but this timescale ignores temporal regularities, like repetitions, often present in sequential …

被引用次数：9 相关文章所有 6 个版本

[PDF] arxiv.org

Bingham policy parameterization for 3d rotations in reinforcement learning

S James, P Abbeel - arXiv preprint arXiv:2202.03957, 2022 - arxiv.org

We propose a new policy parameterization for representing 3D rotations during
reinforcement learning. Today in the continuous control reinforcement learning literature …

被引用次数：14 相关文章所有 2 个版本

[PDF] openreview.net

Dynamic multi-team racing: Competitive driving on 1/10-th scale vehicles via learning in simulation

P Werner, T Seyde, P Drews, TM Balch… - … Conference on Robot …, 2023 - openreview.net

Autonomous racing is a challenging task that requires vehicle handling at the dynamic limits
of friction. While single-agent scenarios like Time Trials are solved competitively with …

被引用次数：3 相关文章所有 2 个版本

高级搜索

QQ 群