Sample-based tree search with fixed and adaptive state abstractions

S Yang-Zhao, T Wang, KS Ng - Advances in Neural …, 2022 - proceedings.neurips.cc

We propose a practical integration of logical state abstraction with AIXI, a Bayesian
optimality notion for reinforcement learning agents, to significantly expand the model class …

被引用次数：4 相关文章所有 4 个版本

[PDF] aaai.org

Dynamic Knowledge Injection for AIXI Agents

S Yang-Zhao, KS Ng, M Hutter - … of the AAAI Conference on Artificial …, 2024 - ojs.aaai.org

Prior approximations of AIXI, a Bayesian optimality notion for general reinforcement
learning, can only approximate AIXI's Bayesian environment model using an a-priori defined …

被引用次数：1 相关文章所有 2 个版本

[PDF] neurips.cc

Monte carlo tree search with iteratively refining state abstractions

S Sokota, CY Ho, Z Ahmad… - Advances in Neural …, 2021 - proceedings.neurips.cc

Decision-time planning is the process of constructing a transient, local policy with the intent
of using it to make the immediate decision. Monte Carlo tree search (MCTS), which has …

被引用次数：10 相关文章所有 5 个版本

[PDF] arxiv.org

Elastic monte carlo tree search with state abstraction for strategy game playing

L Xu, J Hurtado-Grueso, D Jeurissen… - … IEEE Conference on …, 2022 - ieeexplore.ieee.org

Strategy video games challenge AI agents with their combinatorial search space caused by
complex game elements. State abstraction is a popular technique that reduces the state …

被引用次数：6 相关文章所有 10 个版本

[PDF] justinsvegliato.com

Selecting the partial state abstractions of MDPs: A metareasoning approach with deep reinforcement learning

SB Nashed, J Svegliato, A Bhatia… - 2022 IEEE/RSJ …, 2022 - ieeexplore.ieee.org

Markov decision processes (MDPs) are a common general-purpose model used in robotics
for representing sequential decision-making problems. Given the complexity of robotics …

被引用次数：4 相关文章所有 9 个版本

[PDF] qmul.ac.uk

Game state and action abstracting monte carlo tree search for general strategy game-playing

A Dockhorn, J Hurtado-Grueso… - … IEEE Conference on …, 2021 - ieeexplore.ieee.org

When implementing intelligent agents for strategy games, we observe that search-based
methods struggle with the complexity of such games. To tackle this problem, we propose a …

被引用次数：7 相关文章所有 9 个版本

[PDF] qmul.ac.uk

Elastic Monte Carlo Tree Search

L Xu, A Dockhorn… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

Strategy games are a challenge for the design of artificial intelligence agents due to their
complexity and the combinatorial search space they produce. State abstraction has been …

被引用次数：1 相关文章所有 5 个版本

[PDF] oajaiml.com

[PDF][PDF] Goal Agnostic Learning and Planning without Reward Functions.

CK Robinson, J Lancaster - Adv. Artif. Intell. Mach. Learn., 2023 - oajaiml.com

In this paper we present an algorithm, the Goal Agnostic Planner (GAP), which combines
elements of Reinforcement Learning (RL) and Markov Decision Processes (MDPs) into an …

Learning to Solve Sequential Planning Problems Without Rewards

C Robinson - Proceedings of the Future Technologies Conference, 2022 - Springer

In this paper we present an algorithm, the Goal Agnostic Planner (GAP), which combines
elements of Reinforcement Learning (RL) and Markov Decision Processes (MDPs) into an …

On state aggregation in a class of cyber physical energy systems

J Wu, QS Jia - 2018 37th Chinese Control Conference (CCC), 2018 - ieeexplore.ieee.org

Various cyber physical energy systems (CPES) are discrete event dynamic systems (DEDS),
in which the states are discrete and the dynamics of state transitions are triggered by events …

被引用次数：1 相关文章所有 2 个版本

高级搜索

QQ 群