Large language models play starcraft ii: Benchmarks and a chain of summarization approach W Ma, Q Mi, X Yan, Y Wu, R Lin, H Zhang, J Wang arXiv preprint arXiv:2312.11865, 2023 | 16 | 2023 |
TaxAI: A Dynamic Economic Simulator and Benchmark for Multi-Agent Reinforcement Learning Q Mi, S Xia, Y Song, H Zhang, S Zhu, J Wang AAMAS '24: Proceedings of the 23rd International Conference on Autonomous …, 2023 | 6 | 2023 |
Joint caching and transmission in the mobile edge network: An multi-agent learning approach Q Mi, N Yang, H Zhang, H Zhang, J Wang 2021 IEEE Global Communications Conference (GLOBECOM), 1-6, 2021 | 4 | 2021 |
Learning Macroeconomic Policies based on Microfoundations: A Dynamic Stackelberg Mean Field Game Approach Q Mi, Z Zhao, S Xia, Y Song, J Wang, H Zhang arXiv preprint arXiv:2403.12093, 2024 | | 2024 |