Exploring the first-move balance point of Go-Moku based on reinforcement learning and Monte Carlo tree search

P Liu, J Zhou, J Lv - Knowledge-Based Systems, 2023 - Elsevier
In most chess games without additional rule restrictions, the side that makes the first move
(ie, the first-move side) has an absolute advantage, which affects the game's balance to a …

Efficiently mastering the game of nogo with deep reinforcement learning supported by domain knowledge

Y Gao, L Wu - Electronics, 2021 - mdpi.com
Computer games have been regarded as an important field of artificial intelligence (AI) for a
long time. The AlphaZero structure has been successful in the game of Go, beating the top …

Increasing biases can be more efficient than increasing weights

C Metta, M Fantozzi, A Papini… - Proceedings of the …, 2024 - openaccess.thecvf.com
We introduce a novel computational unit for neural networks that features multiple biases,
challenging the traditional perceptron structure. This unit emphasizes the importance of …

Transfer of fully convolutional policy-value networks between games and game variants

DJNJ Soemers, V Mella, E Piette… - arXiv preprint arXiv …, 2021 - arxiv.org
In this paper, we use fully convolutional architectures in AlphaZero-like self-play training
setups to facilitate transfer between variants of board games as well as distinct games. We …

Manipulating the distributions of experience used for self-play learning in expert iteration

DJNJ Soemers, E Piette… - … IEEE Conference on …, 2020 - ieeexplore.ieee.org
Expert Iteration (ExIt) is an effective framework for learning game-playing policies from self-
play. ExIt involves training a policy to mimic the search behaviour of a tree search algorithm …

GloNets: globally connected neural networks

A Di Cecco, C Metta, M Fantozzi, F Morandin… - … on Intelligent Data …, 2024 - Springer
Deep learning architectures suffer from depth-related performance degradation, limiting the
effective depth of neural networks. Approaches like ResNet are able to mitigate this, but they …

Derived metrics for the game of Go–intrinsic network strength assessment and cheat-detection

A Egri-Nagy, A Törmänen - 2020 Eighth International …, 2020 - ieeexplore.ieee.org
The widespread availability of superhuman AI engines is changing how we play the ancient
game of Go. The open-source software packages developed after the AlphaGo series …

[HTML][HTML] Spatial state-action features for general games

DJNJ Soemers, É Piette, M Stephenson, C Browne - Artificial Intelligence, 2023 - Elsevier
In many board games and other abstract games, patterns have been used as features that
can guide automated game-playing agents. Such patterns or features often represent …

Can Go AIs be adversarially robust?

T Tseng, E McLean, K Pelrine, TT Wang… - arXiv preprint arXiv …, 2024 - arxiv.org
Prior work found that superhuman Go AIs like KataGo can be defeated by simple adversarial
strategies. In this paper, we study if simple defenses can improve KataGo's worst-case …

A statistical approach for detecting AI-assisted cheating in the game of Go

J Park, J Im, S On, SJ Lee, J Lee - Journal of the Korean Physical Society, 2022 - Springer
Abstract Since the AlphaGo Zero paper was published, many superhuman Go AIs followed,
some with open sources and easy-to-use interfaces. Currently, it is impossible even for the …