Scaling up q-learning via exploiting state–action equivalence

文章

学术资源搜索

获得 4 条结果（用时0.02秒）

我的图书馆

Scaling up q-learning via exploiting state–action equivalence

在引用文章中搜索

[PDF] researchsquare.com

An efficient algorithm for optimal route node sensing in smart tourism Urban traffic based on priority constraints

X Ding, R Yao, E Khezri - Wireless Networks, 2023 - Springer

The public transportation system is now dealing with a number of problems brought on by
the sharp increase in automobile ownership in cities as well as the buildup of vehicles as a …

被引用次数：33 相关文章所有 2 个版本

[PDF] arxiv.org

A simple approach for state-action abstraction using a learned mdp homomorphism

AN Mavor-Parker, MJ Sargent, A Banino… - arXiv preprint arXiv …, 2022 - arxiv.org

Animals are able to rapidly infer from limited experience when sets of state action pairs have
equivalent reward and transition dynamics. On the other hand, modern reinforcement …

被引用次数：4 相关文章所有 3 个版本

[PDF] arxiv.org

How to Shrink Confidence Sets for Many Equivalent Discrete Distributions?

OA Maillard, MS Talebi - arXiv preprint arXiv:2407.15662, 2024 - arxiv.org

We consider the situation when a learner faces a set of unknown discrete distributions
$(p_k) _ {k\in\mathcal K} $ defined over a common alphabet $\mathcal X $, and can build for …

Using Forwards-Backwards Models to Approximate MDP Homomorphisms

AN Mavor-Parker, MJ Sargent, A Banino, L Griffin… - openreview.net

Animals are able to rapidly infer, from limited experience, when sets of state-action pairs
have equivalent reward and transition dynamics. On the other hand, modern reinforcement …

高级搜索

QQ 群

Scaling up q-learning via exploiting state–action equivalence

An efficient algorithm for optimal route node sensing in smart tourism Urban traffic based on priority constraints

A simple approach for state-action abstraction using a learned mdp homomorphism

How to Shrink Confidence Sets for Many Equivalent Discrete Distributions?

Using Forwards-Backwards Models to Approximate MDP Homomorphisms

引用