Skynet: A top deep rl agent in the inaugural pommerman team competition

P Hernandez-Leal, B Kartal, ME Taylor - Autonomous Agents and Multi …, 2019 - Springer

Deep reinforcement learning (RL) has achieved outstanding results in recent years. This has
led to a dramatic increase in the number of applications and methods. Recent works have …

被引用次数：664 相关文章所有 8 个版本

[PDF] researchgate.net

[PDF][PDF] Is multiagent deep reinforcement learning the answer or the question? A brief survey

P Hernandez-Leal, B Kartal, ME Taylor - learning, 2018 - researchgate.net

Deep reinforcement learning (RL) has achieved outstanding results in recent years. This has
led to a dramatic increase in the number of applications and methods. Recent works have …

被引用次数：103 相关文章所有 2 个版本

[PDF] cell.com Full View

Progress and summary of reinforcement learning on energy management of MPS-EV

Y Lin, L Chu, J Hu, Z Hou, J Li, J Jiang, Y Zhang - Heliyon, 2023 - cell.com

The escalating environmental concerns and energy crisis caused by internal combustion
engines (ICE) have become unacceptable under environmental regulations and the energy …

被引用次数：1 相关文章所有 6 个版本

[PDF] mlr.press

Towards robust and domain agnostic reinforcement learning competitions: MineRL 2020

WH Guss, S Milani, N Topin… - NeurIPS 2020 …, 2021 - proceedings.mlr.press

Reinforcement learning competitions have formed the basis for standard research
benchmarks, galvanized advances in the state-of-the-art, and shaped the direction of the …

被引用次数：12 相关文章所有 4 个版本

[PDF] wiley.com

Bombalytics: Visualization of competition and collaboration strategies of players in a bomb laying game

S Agarwal, G Wallner, F Beck - Computer Graphics Forum, 2020 - Wiley Online Library

Competition and collaboration form complex interaction patterns between the agents and
objects involved. Only by understanding these interaction patterns, we can reveal the …

被引用次数：21 相关文章所有 9 个版本

[PDF] aaai.org

On hard exploration for reinforcement learning: A case study in pommerman

C Gao, B Kartal, P Hernandez-Leal… - Proceedings of the AAAI …, 2019 - ojs.aaai.org

How to best explore in domains with sparse, delayed, and deceptive rewards is an important
open problem for reinforcement learning (RL). This paper considers one such domain, the …

被引用次数：16 相关文章所有 9 个版本

[PDF] aaai.org

Analysis of statistical forward planning methods in Pommerman

D Perez-Liebana, RD Gaina, O Drageset… - Proceedings of the …, 2019 - ojs.aaai.org

Pommerman is a complex multi-player and partially observable game where agents try to be
the last standing to win. This game poses very interesting challenges to AI, such as …

被引用次数：10 相关文章所有 14 个版本

[PDF] arxiv.org

Accelerating training in pommerman with imitation and reinforcement learning

H Meisheri, O Shelke, R Verma, H Khadilkar - arXiv preprint arXiv …, 2019 - arxiv.org

The Pommerman simulation was recently developed to mimic the classic Japanese game
Bomberman, and focuses on competitive gameplay in a multi-agent setting. We focus on the …

被引用次数：9 相关文章所有 3 个版本

[PDF] ieee.org

Efficient searching with MCTS and imitation learning: a case study in Pommerman

H Yang, S Li, X Xu, X Liu, Z Meng, Y Zhang - IEEE Access, 2021 - ieeexplore.ieee.org

Pommerman is a popular reinforcement learning environment because it imposes several
challenges such as sparse and deceptive rewards and delayed action effects. In this paper …

被引用次数：4 相关文章所有 5 个版本

[PDF] arxiv.org

Developing a Successful Bomberman Agent

D Kowalczyk, J Kowalski, H Obrzut, M Maras… - arXiv preprint arXiv …, 2022 - arxiv.org

In this paper, we study AI approaches to successfully play a 2-4 players, full information,
Bomberman variant published on the CodinGame platform. We compare the behavior of …

被引用次数：1 相关文章所有 7 个版本

高级搜索

QQ 群