Learning Nash equilibria in zero-sum stochastic games via entropy-regularized policy approximation

PK Sharma, R Fernandez, E Zaroukian… - … learning for multi …, 2021 - spiedigitallibrary.org

Much work has been dedicated to the exploration of Multi-Agent Reinforcement Learning
(MARL) paradigms implementing a centralized learning with decentralized execution …

被引用次数：50 相关文章所有 7 个版本

[PDF] neurips.cc

Finite-sample analysis of off-policy TD-learning via generalized Bellman operators

Z Chen, ST Maguluri, S Shakkottai… - Advances in Neural …, 2021 - proceedings.neurips.cc

In TD-learning, off-policy sampling is known to be more practical than on-policy sampling,
and by decoupling learning from data collection, it enables data reuse. It is known that policy …

被引用次数：13 相关文章所有 9 个版本

[PDF] northumbria.ac.uk

Solving robotic trajectory sequential writing problem via learning character's structural and sequential information

Q Li, Z Guo, F Chao, X Chang, L Yang… - IEEE Transactions …, 2022 - ieeexplore.ieee.org

The writing sequence of numerals or letters often affects aesthetic aspects of the writing
outcomes. As such, it remains a challenge for robotic calligraphy systems to perform …

被引用次数：4 相关文章所有 7 个版本

[PDF] arxiv.org

Exploratory control with Tsallis entropy for latent factor models

R Donnelly, S Jaimungal - SIAM Journal on Financial Mathematics, 2024 - SIAM

We study optimal control in models with latent factors where the agent controls the
distribution over actions, rather than actions themselves, in both discrete and continuous …

被引用次数：6 相关文章所有 7 个版本

[PDF] arxiv.org

Approximate Nash Equilibrium Learning for n-Player Markov Games in Dynamic Pricing

L Liu - EPIA Conference on Artificial Intelligence, 2024 - Springer

We investigate Nash equilibrium learning in a competitive Markov Game (MG) environment,
where multiple agents compete, and multiple Nash equilibria can exist. In particular, for an …

被引用次数：3 相关文章所有 2 个版本

[PDF] arxiv.org

A Coupled Optimization Framework for Correlated Equilibria in Normal-Form Game

SHQ Li, Y Yu, F Dörfler, J Lygeros - arXiv preprint arXiv:2403.16223, 2024 - arxiv.org

In competitive multi-player interactions, simultaneous optimality is a key requirement for
establishing strategic equilibria. This property is explicit when the game-theoretic …

Shaping large population agent behaviors through entropy-regularized mean-field games

Y Guan, M Zhou, A Pakniyat… - 2022 American Control …, 2022 - ieeexplore.ieee.org

Mean-field games (MFG) were introduced to efficiently analyze approximate Nash equilibria
in large population settings. In this work, we consider entropy-regularized mean-field games …

被引用次数：3 相关文章所有 6 个版本

[PDF] gatech.edu

[PDF][PDF] I. Earned Degrees

P TSIOTRAS - dcsl.gatech.edu

Coordinated all campus activities in the area of autonomy. Administered the IRIM Seed
Grants program, established and administered the IRIM Visiting Faculty and Resident …

高级搜索

QQ 群