Temporally Extended Metrics for Markov Decision Processes.

文章

学术资源搜索

获得 4 条结果（用时0.02秒）

我的图书馆

Temporally Extended Metrics for Markov Decision Processes.

在引用文章中搜索

[PDF] neurips.cc

MICo: Improved representations via sampling-based state similarity for Markov decision processes

PS Castro, T Kastner… - Advances in Neural …, 2021 - proceedings.neurips.cc

We present a new behavioural distance over the state space of a Markov decision process,
and demonstrate the use of this distance as an effective means of shaping the learnt …

被引用次数：57 相关文章所有 6 个版本

[PDF] enseeiht.fr

[图书][B] Distributional reinforcement learning

MG Bellemare, W Dabney, M Rowland - 2023 - books.google.com

The first comprehensive guide to distributional reinforcement learning, providing a new
mathematical formalism for thinking about decisions from a probabilistic perspective …

被引用次数：124 相关文章所有 9 个版本

[PDF] springer.com

Learning model checking and the kernel trick for signal temporal logic on stochastic processes

L Bortolussi, GM Gallo, J Křetínský, L Nenzi - International Conference on …, 2022 - Springer

We introduce a similarity function on formulae of signal temporal logic (STL). It comes in the
form of a kernel function, well known in machine learning as a conceptually and …

被引用次数：10 相关文章所有 6 个版本

[PDF] arxiv.org

stl2vec: Semantic and Interpretable Vector Representation of Temporal Logic

G Saveri, L Nenzi, L Bortolussi, J Křetínský - arXiv preprint arXiv …, 2024 - arxiv.org

Integrating symbolic knowledge and data-driven learning algorithms is a longstanding
challenge in Artificial Intelligence. Despite the recognized importance of this task, a notable …

高级搜索

QQ 群

Temporally Extended Metrics for Markov Decision Processes.

MICo: Improved representations via sampling-based state similarity for Markov decision processes

[图书][B] Distributional reinforcement learning

Learning model checking and the kernel trick for signal temporal logic on stochastic processes

stl2vec: Semantic and Interpretable Vector Representation of Temporal Logic

引用