- 学术资源搜索

Brax--a differentiable physics engine for large scale rigid body simulation

CD Freeman, E Frey, A Raichuk, S Girgin… - arXiv preprint arXiv …, 2021 - arxiv.org

We present Brax, an open source library for rigid body simulation with a focus on
performance and parallelism on accelerators, written in JAX. We present results on a suite of …

被引用次数：283 相关文章所有 5 个版本

[PDF] arxiv.org

Gradients are not all you need

L Metz, CD Freeman, SS Schoenholz… - arXiv preprint arXiv …, 2021 - arxiv.org

Differentiable programming techniques are widely used in the community and are
responsible for the machine learning renaissance of the past several decades. While these …

被引用次数：88 相关文章所有 2 个版本

[PDF] arxiv.org

Physics-informed machine learning for modeling and control of dynamical systems

TX Nghiem, J Drgoňa, C Jones, Z Nagy… - 2023 American …, 2023 - ieeexplore.ieee.org

Physics-informed machine learning (PIML) is a set of methods and tools that systematically
integrate machine learning (ML) algorithms with physical constraints and abstract …

被引用次数：43 相关文章所有 11 个版本

[PDF] neurips.cc

Optimal rates for bandit nonstochastic control

YJ Sun, S Newman, E Hazan - Advances in Neural …, 2024 - proceedings.neurips.cc

Abstract Linear Quadratic Regulator (LQR) and Linear Quadratic Gaussian (LQG) control
are foundational and extensively researched problems in optimal control. We investigate …

被引用次数：8 相关文章所有 6 个版本

[PDF] neurips.cc

Online nonstochastic model-free reinforcement learning

U Ghai, A Gupta, W Xia, K Singh… - Advances in Neural …, 2024 - proceedings.neurips.cc

We investigate robust model-free reinforcement learning algorithms designed for
environments that may be dynamic or even adversarial. Traditional state-based policies …

被引用次数：10 相关文章所有 6 个版本

[PDF] arxiv.org

Training efficient controllers via analytic policy gradient

N Wiedemann, V Wüest, A Loquercio… - … on Robotics and …, 2023 - ieeexplore.ieee.org

Control design for robotic systems is complex and often requires solving an optimization to
follow a trajectory accurately. Online optimization approaches like Model Predictive Control …

被引用次数：19 相关文章所有 10 个版本

[PDF] arxiv.org

Controlgym: Large-scale safety-critical control environments for benchmarking reinforcement learning algorithms

X Zhang, W Mao, S Mowlavi, M Benosman… - arXiv preprint arXiv …, 2023 - arxiv.org

We introduce controlgym, a library of thirty-six safety-critical industrial control settings, and
ten infinite-dimensional partial differential equation (PDE)-based control problems …

被引用次数：4 相关文章所有 2 个版本

[PDF] mlr.press

Controlgym: Large-scale control environments for benchmarking reinforcement learning algorithms

X Zhang, W Mao, S Mowlavi… - 6th Annual Learning …, 2024 - proceedings.mlr.press

We introduce controlgym, a library of thirty-six industrial control settings, and ten infinite-
dimensional partial differential equation (PDE)-based control problems. Integrated within the …

被引用次数：3 相关文章所有 2 个版本

[PDF] mlr.press

Online Learning for Obstacle Avoidance

D Snyder, M Booker, N Simon, W Xia… - … on Robot Learning, 2023 - proceedings.mlr.press

We approach the fundamental problem of obstacle avoidance for robotic systems via the
lens of online learning. In contrast to prior work that either assumes worst-case realizations …

被引用次数：2 相关文章所有 7 个版本

[PDF] mlr.press

A regret minimization approach to multi-agent control

U Ghai, U Madhushani, N Leonard… - … on Machine Learning, 2022 - proceedings.mlr.press

We study the problem of multi-agent control of a dynamical system with known dynamics
and adversarial disturbances. Our study focuses on optimal control without centralized …

被引用次数：6 相关文章所有 6 个版本

高级搜索

QQ 群

Brax--a differentiable physics engine for large scale rigid body simulation

Gradients are not all you need

Physics-informed machine learning for modeling and control of dynamical systems

Optimal rates for bandit nonstochastic control

Online nonstochastic model-free reinforcement learning

Training efficient controllers via analytic policy gradient

Controlgym: Large-scale safety-critical control environments for benchmarking reinforcement learning algorithms

Controlgym: Large-scale control environments for benchmarking reinforcement learning algorithms

Online Learning for Obstacle Avoidance

A regret minimization approach to multi-agent control

引用