Improved communication efficiency in federated natural policy gradient via admm-based gradient updates

G Lan, H Wang, J Anderson, C Brinton… - arXiv preprint arXiv …, 2023 - arxiv.org
Federated reinforcement learning (FedRL) enables agents to collaboratively train a global
policy without sharing their individual data. However, high communication overhead …

Model-free learning with heterogeneous dynamical systems: A federated LQR approach

H Wang, LF Toso, A Mitra, J Anderson - arXiv preprint arXiv:2308.11743, 2023 - arxiv.org
We study a model-free federated linear quadratic regulator (LQR) problem where M agents
with unknown, distinct yet similar dynamics collaboratively learn an optimal policy to …

Finite-time analysis of on-policy heterogeneous federated reinforcement learning

C Zhang, H Wang, A Mitra, J Anderson - arXiv preprint arXiv:2401.15273, 2024 - arxiv.org
Federated reinforcement learning (FRL) has emerged as a promising paradigm for reducing
the sample complexity of reinforcement learning tasks by exploiting information from …

[HTML][HTML] Wireless control: Retrospective and open vistas

M Pezzutto, S Dey, E Garone, K Gatsis… - Annual Reviews in …, 2024 - Elsevier
The convergence of wireless networks and control engineering has been a technological
driver since the beginning of this century. It has significantly contributed to a wide set of …

Imitation and transfer learning for LQG control

T Guo, AAR Al Makdah, V Krishnan… - IEEE Control Systems …, 2023 - ieeexplore.ieee.org
In this letter we study an imitation and transfer learning setting for Linear Quadratic Gaussian
(LQG) control, where (i) the system dynamics, noise statistics and cost function are unknown …

Meta-learning operators to optimality from multi-task non-iid data

TTCK Zhang, LF Toso, J Anderson, N Matni - arXiv preprint arXiv …, 2023 - arxiv.org
A powerful concept behind much of the recent progress in machine learning is the extraction
of common features across data from heterogeneous sources or tasks. Intuitively, using all of …

Fleet Policy Learning via Weight Merging and An Application to Robotic Tool-Use

L Wang, K Zhang, A Zhou, M Simchowitz… - arXiv preprint arXiv …, 2023 - arxiv.org
Fleets of robots ingest massive amounts of streaming data generated by interacting with
their environments, far more than those that can be stored or transmitted with ease. At the …

Multi-task system identification of similar linear time-invariant dynamical systems

Y Chen, AM Ospina, F Pasqualetti… - 2023 62nd IEEE …, 2023 - ieeexplore.ieee.org
This paper presents a system identification framework-inspired by multi-task learning-to
estimate the dynamics of a given number of linear time-invariant (LTI) systems jointly by …

Sample-Efficient Linear Representation Learning from Non-IID Non-Isotropic Data

TTCK Zhang, LF Toso, J Anderson… - The Twelfth International …, 2024 - openreview.net
A powerful concept behind much of the recent progress in machine learning is the extraction
of common features across data from heterogeneous sources or tasks. Intuitively, using all of …

Meta-Learning Linear Quadratic Regulators: A Policy Gradient MAML Approach for the Model-free LQR

LF Toso, D Zhan, J Anderson, H Wang - arXiv preprint arXiv:2401.14534, 2024 - arxiv.org
We investigate the problem of learning Linear Quadratic Regulators (LQR) in a multi-task,
heterogeneous, and model-free setting. We characterize the stability and personalization …