Combinatorial q-learning for dou di zhu- 学术资源搜索

Combinatorial q-learning for dou di zhu

Y You, L Li, B Guo, W Wang, C Lu - … of the AAAI Conference on Artificial …, 2020 - aaai.org

Proceedings of the AAAI Conference on Artificial Intelligence and Interactive …, 2020•aaai.org

Abstract

Deep reinforcement learning (DRL) has gained a lot of attention in recent years, and has been proven to be able to play Atari games and Go at or above human levels. However, those games are assumed to have a small fixed number of actions and could be trained with a simple CNN network. In this paper, we study a special class of Asian popular card games called Dou Di Zhu, in which two adversarial groups of agents must consider numerous card combinations at each time step, leading to huge number of actions. We propose a novel method to handle combinatorial actions, which we call combinatorial Q-learning (CQL). We employ a two-stage network to reduce action space and also leverage order-invariant max-pooling operations to extract relationships between primitive actions. Results show that our method prevails over other baseline learning algorithms like naive Q-learning and A3C. We develop an easy-to-use card game environments and train all agents adversarially from sractch, with only knowledge of game rules and verify that our agents are comparative to humans. Our code to reproduce all reported results is available on github. com/qq456cvb/doudizhu-C.

aaai.org

展开收起

被引用次数：30 相关文章所有 9 个版本

以上显示的是最相近的搜索结果。查看全部搜索结果

Google学术搜索按钮

安装不用了

example.edu/paper.pdf

搜索

获取 PDF 文件

引用

References

高级搜索

QQ 群

Combinatorial q-learning for dou di zhu

引用