Alternating optimisation and quadrature for robust control

K Chatzilygeroudis, V Vassiliades… - IEEE Transactions …, 2019 - ieeexplore.ieee.org

Most policy search (PS) algorithms require thousands of training episodes to find an
effective policy, which is often infeasible with a physical robot. This survey article focuses on …

被引用次数：205 相关文章所有 17 个版本

[PDF] arxiv.org

Imitation is not enough: Robustifying imitation with reinforcement learning for challenging driving scenarios

Y Lu, J Fu, G Tucker, X Pan, E Bronstein… - 2023 IEEE/RSJ …, 2023 - ieeexplore.ieee.org

Imitation learning (IL) is a simple and powerful way to use high-quality human driving data,
which can be collected at scale, to produce human-like behavior. However, policies based …

被引用次数：82 相关文章所有 4 个版本

[PDF] arxiv.org

Risk averse robust adversarial reinforcement learning

X Pan, D Seita, Y Gao, J Canny - … International Conference on …, 2019 - ieeexplore.ieee.org

Deep reinforcement learning has recently made significant progress in solving computer
games and robotic control tasks. A known problem, though, is that policies overfit to the …

被引用次数：120 相关文章所有 11 个版本

[PDF] springer.com

A modern retrospective on probabilistic numerics

CJ Oates, TJ Sullivan - Statistics and computing, 2019 - Springer

This article attempts to place the emergence of probabilistic numerics as a mathematical–
statistical research field within its historical context and to explore how its gradual …

被引用次数：84 相关文章所有 11 个版本

[PDF] springer.com

Probabilistic solutions to ordinary differential equations as nonlinear Bayesian filtering: a new perspective

F Tronarp, H Kersting, S Särkkä, P Hennig - Statistics and Computing, 2019 - Springer

We formulate probabilistic numerical approximations to solutions of ordinary differential
equations (ODEs) as problems in Gaussian process (GP) regression with nonlinear …

被引用次数：72 相关文章所有 12 个版本

[PDF] mlr.press

Embedding synthetic off-policy experience for autonomous driving via zero-shot curricula

E Bronstein, S Srinivasan, S Paul… - … on Robot Learning, 2023 - proceedings.mlr.press

ML-based motion planning is a promising approach to produce agents that exhibit complex
behaviors, and automatically adapt to novel environments. In the context of autonomous …

被引用次数：16 相关文章所有 5 个版本

[PDF] plos.org

Personalized brain stimulation for effective neurointervention across participants

NER van Bueren, TL Reed, V Nguyen… - PLoS computational …, 2021 - journals.plos.org

Accumulating evidence from human-based research has highlighted that the prevalent one-
size-fits-all approach for neural and behavioral interventions is inefficient. This approach can …

被引用次数：24 相关文章所有 18 个版本

[PDF] neurips.cc

Kernel quadrature with randomly pivoted cholesky

E Epperly, E Moreno - Advances in Neural Information …, 2023 - proceedings.neurips.cc

This paper presents new quadrature rules for functions in a reproducing kernel Hilbert space
using nodes drawn by a sampling algorithm known as randomly pivoted Cholesky. The …

被引用次数：6 相关文章所有 5 个版本

[PDF] mdpi.com

Drone elevation control based on python-unity integrated framework for reinforcement learning applications

MAB Abbass, HS Kang - Drones, 2023 - mdpi.com

Reinforcement learning (RL) applications require a huge effort to become established in real-
world environments, due to the injury and break down risks during interactions between the …

被引用次数：13 相关文章所有 4 个版本

[PDF] vu.nl

Directed locomotion for modular robots with evolvable morphologies

G Lan, M Jelisavcic, DM Roijers, E Haasdijk… - Parallel Problem Solving …, 2018 - Springer

Morphologically evolving robot systems need to include a learning period right after 'birth'to
acquire a controller that fits the newly created body. In this paper, we investigate learning …

被引用次数：39 相关文章所有 7 个版本

高级搜索

QQ 群