作者
Yijie Zhang, Roxana Radulescu, Patrick Mannion, Diederik Martin Roijers, Ann Nowé
发表日期
2020/5/11
研讨会论文
Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems (AAMAS)
简介
In this paper, we investigate the effects of opponent modelling on multi-objective multi-agent interactions with non-linear utilities. Specifically, we consider multi-objective normal form games (MON-FGs) with non-linear utility functions under the scalarised expected returns optimisation criterion. We contribute a novel actor-critic formulation to allow reinforcement learning of mixed strategies in this setting, along with an extension that incorporates opponent policy reconstruction using conditional action frequencies. Our empirical results demonstrate that opponent modelling can drastically alter the learning dynamics in this setting.
引用总数
20202021202220235454