Dynamic potential games with constraints: Fundamentals and applications in communications

K Zhang, Z Yang, T Başar - Handbook of reinforcement learning and …, 2021 - Springer

Recent years have witnessed significant advances in reinforcement learning (RL), which
has registered tremendous success in solving various sequential decision-making problems …

被引用次数：1669 相关文章所有 8 个版本

[PDF] arxiv.org

Decentralized multi-agent reinforcement learning with networked agents: Recent advances

K Zhang, Z Yang, T Başar - Frontiers of Information Technology & …, 2021 - Springer

Multi-agent reinforcement learning (MARL) has long been a significant research topic in
both machine learning and control systems. Recent development of (single-agent) deep …

被引用次数：101 相关文章所有 9 个版本

[PDF] mlr.press

Independent policy gradient for large-scale markov potential games: Sharper rates, function approximation, and game-agnostic convergence

D Ding, CY Wei, K Zhang… - … Conference on Machine …, 2022 - proceedings.mlr.press

We examine global non-asymptotic convergence properties of policy gradient methods for
multi-agent reinforcement learning (RL) problems in Markov potential games (MPGs). To …

被引用次数：85 相关文章所有 8 个版本

A potential game approach to distributed operational optimization for microgrid energy management with renewable energy and demand response

J Zeng, Q Wang, J Liu, J Chen… - IEEE Transactions on …, 2018 - ieeexplore.ieee.org

In view of Internet of Energy, advanced operational optimization in microgrid energy
management system (MEMS) is expected to be scalable to accommodate various …

被引用次数：103 相关文章

[PDF] arxiv.org

Gradient play in stochastic games: stationary points, convergence, and sample complexity

R Zhang, Z Ren, N Li - IEEE Transactions on Automatic Control, 2024 - ieeexplore.ieee.org

We study the performance of the gradient play algorithm for stochastic games (SGs), where
each agent tries to maximize its own total discounted reward by making decisions …

被引用次数：48 相关文章所有 4 个版本

Dynamic cluster formation game for attributed graph clustering

Z Bu, HJ Li, J Cao, Z Wang… - IEEE transactions on …, 2017 - ieeexplore.ieee.org

Besides the topological structure, there are additional information, ie, node attributes, on top
of the plain graphs. Usually, these systems can be well modeled by attributed graphs, where …

被引用次数：108 相关文章所有 3 个版本

[PDF] arxiv.org

Distributed potential ilqr: Scalable game-theoretic trajectory planning for multi-agent interactions

Z Williams, J Chen, N Mehr - 2023 IEEE International …, 2023 - ieeexplore.ieee.org

In this work, we develop a scalable, local tra-jectory optimization algorithm that enables
robots to interact with other robots. It has been shown that agents' interactions can be …

被引用次数：21 相关文章所有 5 个版本

[PDF] neurips.cc

On the global convergence rates of decentralized softmax gradient play in markov potential games

R Zhang, J Mei, B Dai… - Advances in Neural …, 2022 - proceedings.neurips.cc

Softmax policy gradient is a popular algorithm for policy optimization in single-agent
reinforcement learning, particularly since projection is not needed for each gradient update …

被引用次数：27 相关文章所有 8 个版本

[PDF] qut.edu.au

Solution sets for inverse non-cooperative linear-quadratic differential games

J Inga, E Bischoff, TL Molloy, M Flad… - IEEE Control Systems …, 2019 - ieeexplore.ieee.org

This letter addresses the inverse problem of differential games, where the aim is to compute
cost functions which lead to observed Nash equilibrium trajectories. The solution of this …

被引用次数：49 相关文章所有 5 个版本

[PDF] aaai.org

Learning game-theoretic models of multiagent trajectories using implicit layers

P Geiger, CN Straehle - Proceedings of the AAAI Conference on …, 2021 - ojs.aaai.org

For prediction of interacting agents' trajectories, we propose an end-to-end trainable
architecture that hybridizes neural nets with game-theoretic reasoning, has interpretable …

被引用次数：31 相关文章所有 5 个版本

高级搜索

QQ 群