关注
youpeng Zhao
youpeng Zhao
未知所在单位机构
在 mail.ustc.edu.cn 的电子邮件经过验证
标题
引用次数
引用次数
年份
Douzero+: Improving doudizhu ai by opponent modeling and coach-guided learning
Y Zhao, J Zhao, X Hu, W Zhou, H Li
2022 IEEE conference on games (CoG), 127-134, 2022
192022
Coach-assisted multi-agent reinforcement learning framework for unexpected crashed agents
J Zhao, Y Zhao, W Wang, M Yang, X Hu, W Zhou, J Hao, H Li
Frontiers of Information Technology & Electronic Engineering 23 (7), 1032-1042, 2022
102022
Danzero: Mastering guandan game with reinforcement learning
Y Lu, Y Zhao, W Zhou, H Li
2023 IEEE Conference on Games (CoG), 1-8, 2023
62023
Full douzero+: Improving doudizhu ai by opponent modeling, coach-guided training and bidding learning
Y Zhao, J Zhao, X Hu, W Zhou, H Li
IEEE Transactions on Games, 2023
62023
Mcmarl: Parameterizing value function via mixture of categorical distributions for multi-agent reinforcement learning
J Zhao, M Yang, Y Zhao, X Hu, W Zhou, H Li
IEEE Transactions on Games, 2023
42023
Improving deep reinforcement learning with mirror loss
J Zhao, W Shu, Y Zhao, W Zhou, H Li
IEEE Transactions on Games 15 (3), 337-347, 2022
42022
Danzero+: Dominating the guandan game through reinforcement learning
Y Zhao, Y Lu, J Zhao, W Zhou, H Li
IEEE Transactions on Games, 2024
22024
Multi-agent first order constrained optimization in policy space
Y Zhao, Y Yang, Z Lu, W Zhou, H Li
Advances in Neural Information Processing Systems 36, 2024
22024
Q-sat: Value factorization with self-attention for deep multi-agent reinforcement learning
X Hu, J Zhao, Y Zhao, W Zhou, H Li
2023 International Joint Conference on Neural Networks (IJCNN), 01-08, 2023
22023
RL-LLM-DT: An Automatic Decision Tree Generation Method Based on RL Evaluation and LLM Enhancement
J Lin, J Zhao, Y Deng, Y Zhao, W Zhou, H Li
arXiv preprint arXiv:2412.11417, 2024
2024
Mini Honor of Kings: A Lightweight Environment for Multi-Agent Reinforcement Learning
L Liu, J Zhao, C Hu, Z Cao, Y Zhao, Z Ye, M Meng, W Wang, Z He, H Li, ...
arXiv preprint arXiv:2406.03978, 2024
2024
系统目前无法执行此操作,请稍后再试。
文章 1–11