A unified view on multi-class support vector classification

T Glasmachers, C Igel - Journal of Machine Learning Research, 2016 - jmlr.org
Q-learning (QL) is a popular reinforcement learning algorithm that is guaranteed to converge
to optimal policies in Markov decision processes. However, QL exhibits an artifact: in …

Emotional multiagent reinforcement learning in spatial social dilemmas

C Yu, M Zhang, F Ren, G Tan - IEEE transactions on neural …, 2015 - ieeexplore.ieee.org
Social dilemmas have attracted extensive interest in the research of multiagent systems in
order to study the emergence of cooperative behaviors among selfish agents …

Addressing environment non-stationarity by repeating Q-learning updates

S Abdallah, M Kaisers - Journal of Machine Learning Research, 2016 - jmlr.org
In this paper, we present a new framework for large scale online kernel learning, making
kernel methods efficient and scalable for large-scale online learning applications. Unlike the …

Classes of Dilemma Problems and Their Multi-Agent Reinforcement Learning Method

Y Kuroe, H Iima - IEEE Access, 2024 - ieeexplore.ieee.org
Multi-agent systems appear in a wide variety of fields and there have been several studies
on multi-agent reinforcement learning. Dilemma problems are typical classes of multi-agent …

Spread of cooperation in complex agent networks based on expectation of cooperation

R Shibusawa, T Otsuka, T Sugawara - … on Principles and Practice of Multi …, 2016 - Springer
This paper proposes a behavioral strategy called expectation of cooperation with which
cooperation in the prisoner's dilemma game spreads over agent networks by incorporating …

Robust spread of cooperation by expectation-of-cooperation strategy with simple labeling method

T Otsuka, T Sugawara - Proceedings of the International Conference on …, 2017 - dl.acm.org
This paper proposes an interaction strategy called the extended expectation-of-cooperation
(EEoC) that is intended to spread cooperative activities in prisoner's dilemma situations over …

Evolution Direction of Reward Appraisal in Reinforcement Learning Agents

M Miyawaki, K Moriyama, A Mutoh, T Matsui… - Agents and Multi-Agent …, 2019 - Springer
Humans appraise the environment in daily life. We are implementing appraisal mechanisms
into reinforcement learning agents. One of such mechanisms we proposed is the utility …

[PDF][PDF] Emergence of cooperation in complex agent networks based on expectation of cooperation

R Shibusawa, T Otsuka, T Sugawara - AAMAS, 2016 - academia.edu
This paper proposes a behavioral strategy called expectation of cooperation strategy with
which cooperation in the prisoner's dilemma game emerges in agent networks by …

Running reinforcement learning agents on GPU for many simulations of two-person simultaneous games

K Moriyama, Y Kurogi, A Mutoh… - … on Agents (ICA), 2019 - ieeexplore.ieee.org
It is desirable for multi-agent simulation to be run in parallel; if many agents run
simultaneously, the total run time is reduced. It is popular to use GPGPU technology as an …

[图书][B] A context-based approach to detecting miscreant agent behavior in open mulitagent systems

LT Whitsel - 2013 - search.proquest.com
Agents in open multiagent systems (OMAS) are likely to have a social trust responsibility that
is not found when multiagent societies are primarily closed. Open systems are those that …