Trust region policy optimisation in multi-agent reinforcement learning JG Kuba, R Chen, M Wen, Y Wen, F Sun, J Wang, Y Yang arXiv preprint arXiv:2109.11251, 2021 | 188 | 2021 |
Multi-agent constrained policy optimisation S Gu, JG Kuba, M Wen, R Chen, Z Wang, Z Tian, J Wang, A Knoll, Y Yang arXiv preprint arXiv:2110.02793, 2021 | 50 | 2021 |
Deep reinforcement learning for resource allocation in massive MIMO L Chen, F Sun, K Li, R Chen, Y Yang, J Wang 2021 29th European Signal Processing Conference (EUSIPCO), 1611-1615, 2021 | 9 | 2021 |
Adaptive multi-objective reinforcement learning for pareto frontier approximation: A case study of resource allocation network in massive mimo R Chen, F Sun, L Chen, K Li, L Wu, J Wang, Y Yang 2021 29th European Signal Processing Conference (EUSIPCO), 1631-1635, 2021 | 4 | 2021 |
Trust region policy optimisation in multi-agent reinforcement learning J Grudzien Kuba, R Chen, M Wen, Y Wen, F Sun, J Wang, Y Yang arXiv e-prints, arXiv: 2109.11251, 2021 | 2 | 2021 |
Language and Sketching: An LLM-driven Interactive Multimodal Multitask Robot Navigation Framework W Zu, W Song, R Chen, Z Guo, F Sun, Z Tian, W Pan, J Wang arXiv preprint arXiv:2311.08244, 2023 | 1 | 2023 |