Learning for decentralized control of multiagent systems in large, partially-observable stochasti...

S Omidshafiei, J Pazis, C Amato… - … on Machine Learning, 2017 - proceedings.mlr.press

Many real-world tasks involve multiple agents with partial observability and limited
communication. Learning is challenging in these settings due to local viewpoints of agents …

被引用次数：672 相关文章所有 7 个版本

[PDF] arxiv.org

Multi-robot coordination and planning in uncertain and adversarial environments

L Zhou, P Tokekar - Current Robotics Reports, 2021 - Springer

Abstract Purpose of Review Deploying a team of robots that can carefully coordinate their
actions can make the entire system robust to individual failures. In this report, we review …

被引用次数：44 相关文章所有 7 个版本

[PDF] jmlr.org

Approximate information state for approximate planning and reinforcement learning in partially observed systems

J Subramanian, A Sinha, R Seraj, A Mahajan - Journal of Machine …, 2022 - jmlr.org

We propose a theoretical framework for approximate planning and learning in partially
observed systems. Our framework is based on the fundamental notion of information state …

被引用次数：100 相关文章所有 14 个版本

[PDF] jair.org

Modeling and planning with macro-actions in decentralized POMDPs

C Amato, G Konidaris, LP Kaelbling, JP How - Journal of Artificial …, 2019 - jair.org

Decentralized partially observable Markov decision processes (Dec-POMDPs) are general
models for decentralized multi-agent decision making under uncertainty. However, they …

被引用次数：161 相关文章所有 32 个版本

[PDF] mit.edu

Policy search for multi-robot coordination under uncertainty

C Amato, G Konidaris, A Anders… - … Journal of Robotics …, 2016 - journals.sagepub.com

We introduce a principled method for multi-robot coordination based on a general model
(termed a MacDec-POMDP) of multi-robot cooperative planning in the presence of …

被引用次数：98 相关文章所有 8 个版本

[PDF] arxiv.org

Deep reinforcement learning for event-driven multi-agent decision processes

K Menda, YC Chen, J Grana, JW Bono… - IEEE Transactions …, 2018 - ieeexplore.ieee.org

The incorporation of macro-actions (temporally extended actions) into multi-agent decision
problems has the potential to address the curse of dimensionality associated with such …

被引用次数：74 相关文章所有 7 个版本

[PDF] mlr.press

Macro-action-based deep multi-agent reinforcement learning

Y Xiao, J Hoffman, C Amato - Conference on Robot Learning, 2020 - proceedings.mlr.press

In real-world multi-robot systems, performing high-quality, collaborative behaviors requires
robots to asynchronously reason about high-level action selection at varying time durations …

被引用次数：37 相关文章所有 7 个版本

[PDF] mlr.press

Learning to act in decentralized partially observable MDPs

J Dibangoye, O Buffet - International Conference on …, 2018 - proceedings.mlr.press

We address a long-standing open problem of reinforcement learning in decentralized
partially observable Markov decision processes. Previous attempts focussed on different …

被引用次数：44 相关文章所有 23 个版本

Cooperative multi-agent policy gradient

G Bono, JS Dibangoye, L Matignon, F Pereyron… - Machine Learning and …, 2019 - Springer

Reinforcement Learning (RL) for decentralized partially observable Markov decision
processes (Dec-POMDPs) is lagging behind the spectacular breakthroughs of single-agent …

被引用次数：43 相关文章所有 4 个版本

[PDF] arxiv.org

Learning for multi-robot cooperation in partially observable stochastic environments with macro-actions

M Liu, K Sivakumar, S Omidshafiei… - 2017 IEEE/RSJ …, 2017 - ieeexplore.ieee.org

This paper presents a data-driven approach for multi-robot coordination in partially-
observable domains based on Decentralized Partially Observable Markov Decision …

被引用次数：43 相关文章所有 8 个版本

高级搜索

QQ 群