A survey of progress on cooperative multi-agent reinforcement learning in open environment

L Yuan, Z Zhang, L Li, C Guan, Y Yu - arXiv preprint arXiv:2312.01058, 2023 - arxiv.org
Multi-agent Reinforcement Learning (MARL) has gained wide attention in recent years and
has made progress in various fields. Specifically, cooperative MARL focuses on training a …

Mutual-Information Regularized Multi-Agent Policy Iteration

D Ye, Z Lu - Advances in Neural Information Processing …, 2024 - proceedings.neurips.cc
Despite the success of cooperative multi-agent reinforcement learning algorithms, most of
them focus on a single team composition, which prevents them from being used in more …

Mde-EvoNAS: Automatic network architecture design for monocular depth estimation via evolutionary neural architecture search

Z Yu, H Zhang, R Liu, S Dai, X Chen, W Sheng… - Swarm and Evolutionary …, 2025 - Elsevier
The advanced performance of the monocular depth estimation model highly relies on
features extracted by encoder networks. The encoder architecture in most previous methods …

Mutual-information regularized multi-agent policy iteration

J Wang, D Ye, Z Lu - Thirty-seventh Conference on Neural …, 2023 - openreview.net
Despite the success of cooperative multi-agent reinforcement learning algorithms, most of
them focus on a single team composition, which prevents them from being used in more …

Measuring Mutual Policy Divergence for Multi-Agent Sequential Exploration

H Dou, L Dang, Z Luan, B Chen - The Thirty-eighth Annual Conference on … - openreview.net
Despite the success of Multi-Agent Reinforcement Learning (MARL) algorithms in
cooperative tasks, previous works, unfortunately, face challenges in heterogeneous …

Exploring Complicated Search Spaces With Interleaving-Free Sampling

Y Tian, L Xie, J Fang, J Jiao, Q Ye… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Conventional neural architecture search (NAS) algorithms typically work on search spaces
with short-distance node connections. We argue that such designs, though safe and stable …