Finite-time analysis of on-policy heterogeneous federated reinforcement learning

A Adibi, N Dal Fabbro, L Schenato… - International …, 2024 - proceedings.mlr.press

Motivated by applications in large-scale and multi-agent reinforcement learning, we study
the non-asymptotic performance of stochastic approximation (SA) schemes with delayed …

被引用次数：3 相关文章所有 4 个版本

[PDF] arxiv.org

Federated offline reinforcement learning: Collaborative single-policy coverage suffices

J Woo, L Shi, G Joshi, Y Chi - arXiv preprint arXiv:2402.05876, 2024 - arxiv.org

Offline reinforcement learning (RL), which seeks to learn an optimal policy using offline data,
has garnered significant interest due to its potential in critical applications where online data …

被引用次数：2 相关文章所有 5 个版本

[PDF] arxiv.org

Compressed Federated Reinforcement Learning with a Generative Model

A Beikmohammadi, S Khirirat, S Magnússon - arXiv preprint arXiv …, 2024 - arxiv.org

Reinforcement learning has recently gained unprecedented popularity, yet it still grapples
with sample inefficiency. Addressing this challenge, federated reinforcement learning …

被引用次数：2 相关文章所有 2 个版本

[PDF] arxiv.org

DASA: Delay-Adaptive Multi-Agent Stochastic Approximation

ND Fabbro, A Adibi, HV Poor, SR Kulkarni… - arXiv preprint arXiv …, 2024 - arxiv.org

We consider a setting in which $ N $ agents aim to speedup a common Stochastic
Approximation (SA) problem by acting in parallel and communicating with a central server …

被引用次数：1 相关文章

[PDF] arxiv.org

高级搜索

QQ 群