youpeng Zhao 个人学术档案 - 学术资源搜索

引用次数

	总计	2019 年至今
引用	55	55
h 指数	4	4
i10 指数	2	2

0

40

20

2022202320244 11 40

开放获取的出版物数量

3 篇文章

2 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

youpeng Zhao

youpeng Zhao

未知所在单位机构

在 mail.ustc.edu.cn 的电子邮件经过验证

reinforcement learning


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Douzero+: Improving doudizhu ai by opponent modeling and coach-guided learning Y Zhao, J Zhao, X Hu, W Zhou, H Li 2022 IEEE conference on games (CoG), 127-134, 2022	19	2022
Coach-assisted multi-agent reinforcement learning framework for unexpected crashed agents J Zhao, Y Zhao, W Wang, M Yang, X Hu, W Zhou, J Hao, H Li Frontiers of Information Technology & Electronic Engineering 23 (7), 1032-1042, 2022	10	2022
Danzero: Mastering guandan game with reinforcement learning Y Lu, Y Zhao, W Zhou, H Li 2023 IEEE Conference on Games (CoG), 1-8, 2023	6	2023
Full douzero+: Improving doudizhu ai by opponent modeling, coach-guided training and bidding learning Y Zhao, J Zhao, X Hu, W Zhou, H Li IEEE Transactions on Games, 2023	6	2023
Mcmarl: Parameterizing value function via mixture of categorical distributions for multi-agent reinforcement learning J Zhao, M Yang, Y Zhao, X Hu, W Zhou, H Li IEEE Transactions on Games, 2023	4	2023
Improving deep reinforcement learning with mirror loss J Zhao, W Shu, Y Zhao, W Zhou, H Li IEEE Transactions on Games 15 (3), 337-347, 2022	4	2022
Danzero+: Dominating the guandan game through reinforcement learning Y Zhao, Y Lu, J Zhao, W Zhou, H Li IEEE Transactions on Games, 2024	2	2024
Multi-agent first order constrained optimization in policy space Y Zhao, Y Yang, Z Lu, W Zhou, H Li Advances in Neural Information Processing Systems 36, 2024	2	2024
Q-sat: Value factorization with self-attention for deep multi-agent reinforcement learning X Hu, J Zhao, Y Zhao, W Zhou, H Li 2023 International Joint Conference on Neural Networks (IJCNN), 01-08, 2023	2	2023
RL-LLM-DT: An Automatic Decision Tree Generation Method Based on RL Evaluation and LLM Enhancement J Lin, J Zhao, Y Deng, Y Zhao, W Zhou, H Li arXiv preprint arXiv:2412.11417, 2024		2024
Mini Honor of Kings: A Lightweight Environment for Multi-Agent Reinforcement Learning L Liu, J Zhao, C Hu, Z Cao, Y Zhao, Z Ye, M Meng, W Wang, Z He, H Li, ... arXiv preprint arXiv:2406.03978, 2024		2024

系统目前无法执行此操作，请稍后再试。

文章 1–11