A survey of progress on cooperative multi-agent reinforcement learning in open environment

L Yuan, Z Zhang, L Li, C Guan, Y Yu - arXiv preprint arXiv:2312.01058, 2023 - arxiv.org
Multi-agent Reinforcement Learning (MARL) has gained wide attention in recent years and
has made progress in various fields. Specifically, cooperative MARL focuses on training a …

[HTML][HTML] A survey on multi-agent reinforcement learning and its application

Z Ning, L Xie - Journal of Automation and Intelligence, 2024 - Elsevier
Multi-agent reinforcement learning (MARL) has been a rapidly evolving field. This paper
presents a comprehensive survey of MARL and its applications. We trace the historical …

Scalable multi-agent reinforcement learning for warehouse logistics with robotic and human co-workers

A Krnjaic, RD Steleac, JD Thomas… - 2024 IEEE/RSJ …, 2024 - ieeexplore.ieee.org
We consider a warehouse in which dozens of mobile robots and human pickers work
together to collect and deliver items within the warehouse. The fundamental problem we …

Pangu-agent: A fine-tunable generalist agent with structured reasoning

F Christianos, G Papoudakis, M Zimmer… - arXiv preprint arXiv …, 2023 - arxiv.org
A key method for creating Artificial Intelligence (AI) agents is Reinforcement Learning (RL).
However, constructing a standalone RL policy that maps perception to action directly …

An introduction to centralized training for decentralized execution in cooperative multi-agent reinforcement learning

C Amato - arXiv preprint arXiv:2409.03052, 2024 - arxiv.org
Multi-agent reinforcement learning (MARL) has exploded in popularity in recent years. Many
approaches have been developed but they can be divided into three main types: centralized …

Multi-agent Reinforcement Learning for the Control of Three-Dimensional Rayleigh–Bénard Convection

J Vasanth, J Rabault, F Alcántara-Ávila… - Flow, Turbulence and …, 2024 - Springer
Deep reinforcement learning (DRL) has found application in numerous use-cases pertaining
to flow control. Multi-agent RL (MARL), a variant of DRL, has shown to be more effective …

Ask more, know better: Reinforce-Learned Prompt Questions for Decision Making with Large Language Models

X Yan, Y Song, X Cui, F Christianos, H Zhang… - arXiv preprint arXiv …, 2023 - arxiv.org
Large language models (LLMs) demonstrate their promise in tackling complicated practical
challenges by combining action-based policies with chain of thought (CoT) reasoning …

A general learning framework for open ad hoc teamwork using graph-based policy learning

A Rahman, I Carlucho, N Höpner… - Journal of Machine …, 2023 - jmlr.org
Open ad hoc teamwork is the problem of training a single agent to efficiently collaborate with
an unknown group of teammates whose composition may change over time. A variable team …

Multi-agent verification and control with probabilistic model checking

D Parker - International Conference on Quantitative Evaluation of …, 2023 - Springer
Probabilistic model checking is a technique for formal automated reasoning about software
or hardware systems that operate in the context of uncertainty or stochasticity. It builds upon …

Multi-Agent Environments for Vehicle Routing Problems

R Gama, D Fuertes, CR del-Blanco… - arXiv preprint arXiv …, 2024 - arxiv.org
Research on Reinforcement Learning (RL) approaches for discrete optimization problems
has increased considerably, extending RL to an area classically dominated by Operations …