An efficient algorithm for optimal route node sensing in smart tourism Urban traffic based on priority constraints

X Ding, R Yao, E Khezri - Wireless Networks, 2023 - Springer
The public transportation system is now dealing with a number of problems brought on by
the sharp increase in automobile ownership in cities as well as the buildup of vehicles as a …

A simple approach for state-action abstraction using a learned mdp homomorphism

AN Mavor-Parker, MJ Sargent, A Banino… - arXiv preprint arXiv …, 2022 - arxiv.org
Animals are able to rapidly infer from limited experience when sets of state action pairs have
equivalent reward and transition dynamics. On the other hand, modern reinforcement …

How to Shrink Confidence Sets for Many Equivalent Discrete Distributions?

OA Maillard, MS Talebi - arXiv preprint arXiv:2407.15662, 2024 - arxiv.org
We consider the situation when a learner faces a set of unknown discrete distributions
$(p_k) _ {k\in\mathcal K} $ defined over a common alphabet $\mathcal X $, and can build for …

Using Forwards-Backwards Models to Approximate MDP Homomorphisms

Animals are able to rapidly infer, from limited experience, when sets of state-action pairs
have equivalent reward and transition dynamics. On the other hand, modern reinforcement …