MICo: Improved representations via sampling-based state similarity for Markov decision processes

PS Castro, T Kastner… - Advances in Neural …, 2021 - proceedings.neurips.cc
We present a new behavioural distance over the state space of a Markov decision process,
and demonstrate the use of this distance as an effective means of shaping the learnt …

[图书][B] Distributional reinforcement learning

MG Bellemare, W Dabney, M Rowland - 2023 - books.google.com
The first comprehensive guide to distributional reinforcement learning, providing a new
mathematical formalism for thinking about decisions from a probabilistic perspective …

Learning model checking and the kernel trick for signal temporal logic on stochastic processes

L Bortolussi, GM Gallo, J Křetínský, L Nenzi - International Conference on …, 2022 - Springer
We introduce a similarity function on formulae of signal temporal logic (STL). It comes in the
form of a kernel function, well known in machine learning as a conceptually and …

stl2vec: Semantic and Interpretable Vector Representation of Temporal Logic

G Saveri, L Nenzi, L Bortolussi, J Křetínský - arXiv preprint arXiv …, 2024 - arxiv.org
Integrating symbolic knowledge and data-driven learning algorithms is a longstanding
challenge in Artificial Intelligence. Despite the recognized importance of this task, a notable …