Lightzero: A unified benchmark for monte carlo tree search in general sequential decision scenarios

Y Niu, Y Pu, Z Yang, X Li, T Zhou… - Advances in …, 2024 - proceedings.neurips.cc
Building agents based on tree-search planning capabilities with learned models has
achieved remarkable success in classic decision-making problems, such as Go and Atari …

A survey of progress on cooperative multi-agent reinforcement learning in open environment

L Yuan, Z Zhang, L Li, C Guan, Y Yu - arXiv preprint arXiv:2312.01058, 2023 - arxiv.org
Multi-agent Reinforcement Learning (MARL) has gained wide attention in recent years and
has made progress in various fields. Specifically, cooperative MARL focuses on training a …

A versatile multi-agent reinforcement learning benchmark for inventory management

X Yang, Z Liu, W Jiang, C Zhang, L Zhao… - arXiv preprint arXiv …, 2023 - arxiv.org
Multi-agent reinforcement learning (MARL) models multiple agents that interact and learn
within a shared environment. This paradigm is applicable to various industrial scenarios …

POGEMA: A Benchmark Platform for Cooperative Multi-Agent Navigation

A Skrynnik, A Andreychuk, A Borzilov… - arXiv preprint arXiv …, 2024 - arxiv.org
Multi-agent reinforcement learning (MARL) has recently excelled in solving challenging
cooperative and competitive multi-agent problems in various environments with, mostly, few …

[PDF][PDF] Marllib: A scalable multi-agent reinforcement learning library

S Hu, Y Zhong, M Gao, W Wang, H Dong… - arXiv preprint arXiv …, 2022 - researchgate.net
Despite the fast development of multi-agent systems (MAS) and multi-agent reinforcement
learning (MARL) algorithms, there is a lack of unified evaluation platforms and commonly …

Theoretically Guaranteed Policy Improvement Distilled from Model-Based Planning

C Li, R Jia, J Liu, Y Zhang, Y Niu, Y Yang, Y Liu… - ECAI 2023, 2023 - ebooks.iospress.nl
Abstract Model-based reinforcement learning (RL) has demonstrated remarkable successes
on a range of continuous control tasks due to its high sample efficiency. To save the …

A Collaborative Perspective on Exploration in Reinforcement Learning

Y Fu, H Zhang, D Wu, W Xu, B Boulet - openreview.net
Exploration is one of the central topic in reinforcement learning (RL). Many existing
approaches take a single agent perspective when tackling this problem. In this work, we …