多智能体博弈学习研究进展.

罗俊仁, 张万鹏, 苏炯铭, 袁唯淋… - Systems Engineering & …, 2024 - search.ebscohost.com
随着深度学习和强化学习而来的人工智能新浪潮, 为智能体从感知输入到行动决策输出提供了“
端到端” 解决方案. 多智能体学习是研究智能博弈对抗的前沿课题, 面临着对抗性环境 …

Learning in games: a systematic review

RJ Qin, Y Yu - Science China Information Sciences, 2024 - Springer
Game theory studies the mathematical models for self-interested individuals. Nash
equilibrium is arguably the most central solution in game theory. While finding the Nash …

Estimating -Rank from A Few Entries with Low Rank Matrix Completion

Y Du, X Yan, X Chen, J Wang… - … Conference on Machine …, 2021 - proceedings.mlr.press
Multi-agent evaluation aims at the assessment of an agent's strategy on the basis of
interaction with others. Typically, existing methods such as $\alpha $-rank and its …

Game Theoretic Rating in N-player general-sum games with Equilibria

L Marris, M Lanctot, I Gemp, S Omidshafiei… - arXiv preprint arXiv …, 2022 - arxiv.org
Rating strategies in a game is an important area of research in game theory and artificial
intelligence, and can be applied to any real-world competitive or cooperative setting …

Learning to identify top elo ratings: A dueling bandits approach

X Yan, Y Du, B Ru, J Wang, H Zhang… - Proceedings of the AAAI …, 2022 - ojs.aaai.org
The Elo rating system is widely adopted to evaluate the skills of (chess) game and sports
players. Recently it has been also integrated into machine learning algorithms in evaluating …

Reinforcement Nash Equilibrium Solver

X Wang, C Yang, S Li, P Li, X Huang, H Chan… - arXiv preprint arXiv …, 2024 - arxiv.org
Nash Equilibrium (NE) is the canonical solution concept of game theory, which provides an
elegant tool to understand the rationalities. Though mixed strategy NE exists in any game …

Multi-agent evaluation for energy management by practically scaling α-rank

Y Sun, S Zhang, M Liu, R Zheng, S Dong… - Frontiers of Information …, 2024 - Springer
Currently, decarbonization has become an emerging trend in the power system arena.
However, the increasing number of photovoltaic units distributed into a distribution network …

Population-based Multi-agent Evaluation for Large-scale Voltage Control

C Jin, S Zhang, M Liu, R Zheng… - 2023 IEEE International …, 2023 - ieeexplore.ieee.org
Under the purpose of achieving the optimal voltage control strategy in power grid system,
multi-agent evaluation algorithms like a-rank are widely used. However, in large-scale …

-Rank-Collections: Analyzing Expected Strategic Behavior with Uncertain Utilities

FR Pieroth, M Bichler - arXiv preprint arXiv:2211.10317, 2022 - arxiv.org
Game theory largely rests on the availability of cardinal utility functions. In contrast, only
ordinal preferences are elicited in fields such as matching under preferences. The literature …

Multiagent Training in N-Player General-Sum Games

L Marris - 2024 - discovery.ucl.ac.uk
Recent successes in two-player, purely competitive (so-called zero-sum) games has made
headlines around the world. Initially, perfect information games, with vast state spaces, such …