Hierarchies of reward machines

D Furelos-Blanco, M Law, A Jonsson… - International …, 2023 - proceedings.mlr.press
Reward machines (RMs) are a recent formalism for representing the reward function of a
reinforcement learning task through a finite-state machine whose edges encode subgoals of …

Learning task automata for reinforcement learning using hidden Markov models

A Abate, Y Almulla, J Fox, D Hyland, M Wooldridge - ECAI 2023, 2023 - ebooks.iospress.nl
Training reinforcement learning (RL) agents using scalar reward signals is often infeasible
when an environment has sparse and non-Markovian rewards. Moreover, handcrafting …

Noisy symbolic abstractions for deep RL: A case study with reward machines

AC Li, Z Chen, P Vaezipoor, TQ Klassen… - arXiv preprint arXiv …, 2022 - arxiv.org
Natural and formal languages provide an effective mechanism for humans to specify
instructions and reward functions. We investigate how to generate policies via RL when …

Exploration in reward machines with low regret

H Bourel, A Jonsson, OA Maillard… - International …, 2023 - proceedings.mlr.press
We study reinforcement learning (RL) for decision processes with non-Markovian reward, in
which high-level knowledge in the form of reward machines is available to the learner …

Grounding LTLf specifications in image sequences

E Umili, R Capobianco… - Proceedings of the …, 2023 - proceedings.kr.org
A critical challenge in neuro-symbolic (NeSy) approaches is to handle the symbol grounding
problem without direct supervision. That is mapping high-dimensional raw data into an …

Reward Machines for Deep RL in Noisy and Uncertain Environments

AC Li, Z Chen, TQ Klassen, P Vaezipoor… - arXiv preprint arXiv …, 2024 - arxiv.org
Reward Machines provide an automata-inspired structure for specifying instructions, safety
constraints, and other temporally extended reward-worthy behaviour. By exposing complex …

Neural Reward Machines

E Umili, F Argenziano, R Capobianco - arXiv preprint arXiv:2408.08677, 2024 - arxiv.org
Non-markovian Reinforcement Learning (RL) tasks are very hard to solve, because agents
must consider the entire history of state-action pairs to act rationally in the environment. Most …

Learning Robust Reward Machines from Noisy Labels

R Parac, L Nodari, L Ardon, D Furelos-Blanco… - arXiv preprint arXiv …, 2024 - arxiv.org
This paper presents PROB-IRM, an approach that learns robust reward machines (RMs) for
reinforcement learning (RL) agents from noisy execution traces. The key aspect of RM …

Discovering logical knowledge in non-symbolic domains

E Umili - 2023 - iris.uniroma1.it
Deep learning and symbolic artificial intelligence remain the two main paradigms in Artificial
Intelligence (AI), each presenting their own strengths and weaknesses. Artificial agents …