Metavim: Meta variationally intrinsic motivated reinforcement learning for decentralized traffic signal control L Zhu, P Peng, Z Lu, Y Tian IEEE Transactions on Knowledge and Data Engineering 35 (11), 11570-11584, 2023 | 21 | 2023 |
Enhance reasoning for large language models in the game werewolf S Wu, L Zhu, T Yang, S Xu, Q Fu, Y Wei, H Fu arXiv preprint arXiv:2402.02330, 2024 | 12 | 2024 |
Variationally and intrinsically motivated reinforcement learning for decentralized traffic signal control L Zhu, P Peng, Z Lu, X Wang, Y Tian arXiv preprint arXiv:2101.00746, 2021 | 10 | 2021 |
MTLight: Efficient multi-task reinforcement learning for traffic signal control L Zhu, P Peng, Z Lu, Y Tian arXiv preprint arXiv:2404.00886, 2024 | 4 | 2024 |
Sequential communication in multi-agent reinforcement learning Z Ding, W Hong, L Zhu, T Huang, Z Lu | 3 | 2021 |
Multi-Agent Sequential Decision-Making via Communication Z Ding, K Su, W Hong, L Zhu, T Huang, Z Lu arXiv preprint arXiv:2209.12713, 2022 | 1 | 2022 |
An Advanced Reinforcement Learning Framework for Online Scheduling of Deferrable Workloads in Cloud Computing H Dong, L Zhu, Z Shan, B Qiao, F Yang, S Qin, C Luo, Q Lin, Y Yang, ... arXiv preprint arXiv:2406.01047, 2024 | | 2024 |