Brax--a differentiable physics engine for large scale rigid body simulation

CD Freeman, E Frey, A Raichuk, S Girgin… - arXiv preprint arXiv …, 2021 - arxiv.org
We present Brax, an open source library for rigid body simulation with a focus on
performance and parallelism on accelerators, written in JAX. We present results on a suite of …

Gradients are not all you need

L Metz, CD Freeman, SS Schoenholz… - arXiv preprint arXiv …, 2021 - arxiv.org
Differentiable programming techniques are widely used in the community and are
responsible for the machine learning renaissance of the past several decades. While these …

Physics-informed machine learning for modeling and control of dynamical systems

TX Nghiem, J Drgoňa, C Jones, Z Nagy… - 2023 American …, 2023 - ieeexplore.ieee.org
Physics-informed machine learning (PIML) is a set of methods and tools that systematically
integrate machine learning (ML) algorithms with physical constraints and abstract …

Optimal rates for bandit nonstochastic control

YJ Sun, S Newman, E Hazan - Advances in Neural …, 2024 - proceedings.neurips.cc
Abstract Linear Quadratic Regulator (LQR) and Linear Quadratic Gaussian (LQG) control
are foundational and extensively researched problems in optimal control. We investigate …

Online nonstochastic model-free reinforcement learning

U Ghai, A Gupta, W Xia, K Singh… - Advances in Neural …, 2024 - proceedings.neurips.cc
We investigate robust model-free reinforcement learning algorithms designed for
environments that may be dynamic or even adversarial. Traditional state-based policies …

Training efficient controllers via analytic policy gradient

N Wiedemann, V Wüest, A Loquercio… - … on Robotics and …, 2023 - ieeexplore.ieee.org
Control design for robotic systems is complex and often requires solving an optimization to
follow a trajectory accurately. Online optimization approaches like Model Predictive Control …

Controlgym: Large-scale safety-critical control environments for benchmarking reinforcement learning algorithms

X Zhang, W Mao, S Mowlavi, M Benosman… - arXiv preprint arXiv …, 2023 - arxiv.org
We introduce controlgym, a library of thirty-six safety-critical industrial control settings, and
ten infinite-dimensional partial differential equation (PDE)-based control problems …

Controlgym: Large-scale control environments for benchmarking reinforcement learning algorithms

X Zhang, W Mao, S Mowlavi… - 6th Annual Learning …, 2024 - proceedings.mlr.press
We introduce controlgym, a library of thirty-six industrial control settings, and ten infinite-
dimensional partial differential equation (PDE)-based control problems. Integrated within the …

Online Learning for Obstacle Avoidance

D Snyder, M Booker, N Simon, W Xia… - … on Robot Learning, 2023 - proceedings.mlr.press
We approach the fundamental problem of obstacle avoidance for robotic systems via the
lens of online learning. In contrast to prior work that either assumes worst-case realizations …

A regret minimization approach to multi-agent control

U Ghai, U Madhushani, N Leonard… - … on Machine Learning, 2022 - proceedings.mlr.press
We study the problem of multi-agent control of a dynamical system with known dynamics
and adversarial disturbances. Our study focuses on optimal control without centralized …