A survey on policy search algorithms for learning robot controllers in a handful of trials

K Chatzilygeroudis, V Vassiliades… - IEEE Transactions …, 2019 - ieeexplore.ieee.org
Most policy search (PS) algorithms require thousands of training episodes to find an
effective policy, which is often infeasible with a physical robot. This survey article focuses on …

Imitation is not enough: Robustifying imitation with reinforcement learning for challenging driving scenarios

Y Lu, J Fu, G Tucker, X Pan, E Bronstein… - 2023 IEEE/RSJ …, 2023 - ieeexplore.ieee.org
Imitation learning (IL) is a simple and powerful way to use high-quality human driving data,
which can be collected at scale, to produce human-like behavior. However, policies based …

Risk averse robust adversarial reinforcement learning

X Pan, D Seita, Y Gao, J Canny - … International Conference on …, 2019 - ieeexplore.ieee.org
Deep reinforcement learning has recently made significant progress in solving computer
games and robotic control tasks. A known problem, though, is that policies overfit to the …

A modern retrospective on probabilistic numerics

CJ Oates, TJ Sullivan - Statistics and computing, 2019 - Springer
This article attempts to place the emergence of probabilistic numerics as a mathematical–
statistical research field within its historical context and to explore how its gradual …

Probabilistic solutions to ordinary differential equations as nonlinear Bayesian filtering: a new perspective

F Tronarp, H Kersting, S Särkkä, P Hennig - Statistics and Computing, 2019 - Springer
We formulate probabilistic numerical approximations to solutions of ordinary differential
equations (ODEs) as problems in Gaussian process (GP) regression with nonlinear …

Embedding synthetic off-policy experience for autonomous driving via zero-shot curricula

E Bronstein, S Srinivasan, S Paul… - … on Robot Learning, 2023 - proceedings.mlr.press
ML-based motion planning is a promising approach to produce agents that exhibit complex
behaviors, and automatically adapt to novel environments. In the context of autonomous …

Personalized brain stimulation for effective neurointervention across participants

NER van Bueren, TL Reed, V Nguyen… - PLoS computational …, 2021 - journals.plos.org
Accumulating evidence from human-based research has highlighted that the prevalent one-
size-fits-all approach for neural and behavioral interventions is inefficient. This approach can …

Kernel quadrature with randomly pivoted cholesky

E Epperly, E Moreno - Advances in Neural Information …, 2023 - proceedings.neurips.cc
This paper presents new quadrature rules for functions in a reproducing kernel Hilbert space
using nodes drawn by a sampling algorithm known as randomly pivoted Cholesky. The …

Drone elevation control based on python-unity integrated framework for reinforcement learning applications

MAB Abbass, HS Kang - Drones, 2023 - mdpi.com
Reinforcement learning (RL) applications require a huge effort to become established in real-
world environments, due to the injury and break down risks during interactions between the …

Directed locomotion for modular robots with evolvable morphologies

G Lan, M Jelisavcic, DM Roijers, E Haasdijk… - Parallel Problem Solving …, 2018 - Springer
Morphologically evolving robot systems need to include a learning period right after 'birth'to
acquire a controller that fits the newly created body. In this paper, we investigate learning …