Put your money where your mouth is: Evaluating strategic planning and execution of llm agents in an auction arena

J Chen, S Yuan, R Ye, BP Majumder… - arXiv preprint arXiv …, 2023 - arxiv.org
Recent advancements in Large Language Models (LLMs) showcase advanced reasoning,
yet NLP evaluations often depend on static benchmarks. Evaluating this necessitates …

LLM as a Mastermind: A Survey of Strategic Reasoning with Large Language Models

Y Zhang, S Mao, T Ge, X Wang, A de Wynter… - arXiv preprint arXiv …, 2024 - arxiv.org
This paper presents a comprehensive survey of the current status and opportunities for
Large Language Models (LLMs) in strategic reasoning, a sophisticated form of reasoning …

Efficacy of Language Model Self-Play in Non-Zero-Sum Games

A Liao, N Tomlin, D Klein - arXiv preprint arXiv:2406.18872, 2024 - arxiv.org
Game-playing agents like AlphaGo have achieved superhuman performance through self-
play, which is theoretically guaranteed to yield optimal policies in competitive games …

Large model strategic thinking, small model efficiency: transferring theory of mind in large language models

N Lore, S Ilami, B Heydari - arXiv preprint arXiv:2408.05241, 2024 - arxiv.org
As the performance of larger, newer Large Language Models continues to improve for
strategic Theory of Mind (ToM) tasks, the demand for these state-of-the-art models increases …

Policy Space Response Oracles: A Survey

A Bighashdel, Y Wang, S McAleer, R Savani… - arXiv preprint arXiv …, 2024 - arxiv.org
In game theory, a game refers to a model of interaction among rational decision-makers or
players, making choices with the goal of achieving their individual objectives. Understanding …

Generative AI for Game Theory-based Mobile Networking

L He, G Sun, D Niyato, H Du, F Mei, J Kang… - arXiv preprint arXiv …, 2024 - arxiv.org
With the continuous advancement of network technology, various emerging complex
networking optimization problems opened up a wide range of applications utilizating of …

Utility-inspired Reward Transformations Improve Reinforcement Learning Training of Language Models

RR Maura-Rivero, C Nagpal, R Patel… - arXiv preprint arXiv …, 2025 - arxiv.org
Current methods that train large language models (LLMs) with reinforcement learning
feedback, often resort to averaging outputs of multiple rewards functions during training. This …

Enhancing Language Model Rationality with Bi-Directional Deliberation Reasoning

Y Zhang, S Mao, W Wu, Y Xia, T Ge, M Lan… - arXiv preprint arXiv …, 2024 - arxiv.org
This paper introduces BI-Directional DEliberation Reasoning (BIDDER), a novel reasoning
approach to enhance the decision rationality of language models. Traditional reasoning …

Game-theoretic LLM: Agent Workflow for Negotiation Games

W Hua, O Liu, L Li, A Amayuelas, J Chen… - arXiv preprint arXiv …, 2024 - arxiv.org
This paper investigates the rationality of large language models (LLMs) in strategic decision-
making contexts, specifically within the framework of game theory. We evaluate several state …

Mechanism Design for LLM Fine-tuning with Multiple Reward Models

H Sun, Y Chen, S Wang, W Chen, X Deng - arXiv preprint arXiv …, 2024 - arxiv.org
Recent research on fine-tuning large language models (LLMs) through the aggregation of
multiple preferences has attracted considerable attention. However, the existing literature …