Kernel-based reinforcement learning on representative states

G Theocharous, A Hallak - 2013 - Citeseer

In many marketing applications, companies use technology for interacting with their
customers and making product or services recommendations. Today, these marketing …

被引用次数：154 相关文章

[PDF] ieee.org

Barrier-certified adaptive reinforcement learning with applications to brushbot navigation

M Ohnishi, L Wang, G Notomista… - IEEE Transactions on …, 2019 - ieeexplore.ieee.org

This paper presents a safe learning framework that employs an adaptive model learning
algorithm together with barrier certificates for systems with possibly nonstationary agent …

被引用次数：83 相关文章所有 6 个版本

[PDF] mlr.press

A kernel-based approach to non-stationary reinforcement learning in metric spaces

OD Domingues, P Ménard, M Pirotta… - International …, 2021 - proceedings.mlr.press

In this work, we propose KeRNS: an algorithm for episodic reinforcement learning in non-
stationary Markov Decision Processes (MDPs) whose state-action set is endowed with a …

被引用次数：39 相关文章所有 11 个版本

[PDF] arxiv.org

Organizing experience: a deeper look at replay mechanisms for sample-based planning in continuous state domains

Y Pan, M Zaheer, A White, A Patterson… - arXiv preprint arXiv …, 2018 - arxiv.org

Model-based strategies for control are critical to obtain sample efficient learning. Dyna is a
planning paradigm that naturally interleaves learning and planning, by simulating one-step …

被引用次数：55 相关文章所有 8 个版本

[PDF] arxiv.org

Unlocking the power of representations in long-term novelty-based exploration

A Saade, S Kapturowski, D Calandriello… - arXiv preprint arXiv …, 2023 - arxiv.org

We introduce Robust Exploration via Clustering-based Online Density Estimation
(RECODE), a non-parametric method for novelty-based exploration that estimates visitation …

被引用次数：4 相关文章所有 3 个版本

[PDF] jmlr.org

Practical kernel-based reinforcement learning

AMS Barreto, D Precup, J Pineau - Journal of Machine Learning Research, 2016 - jmlr.org

Kernel-based reinforcement learning (KBRL) stands out among approximate reinforcement
learning algorithms for its strong theoretical guarantees. By casting the learning problem as …

被引用次数：52 相关文章所有 9 个版本

Quantized attention-gated kernel reinforcement learning for brain–machine interface decoding

F Wang, Y Wang, K Xu, H Li, Y Liao… - IEEE transactions on …, 2015 - ieeexplore.ieee.org

Reinforcement learning (RL)-based decoders in brain-machine interfaces (BMIs) interpret
dynamic neural activity without patients' real limb movements. In conventional RL, the goal …

被引用次数：36 相关文章所有 4 个版本

[PDF] psu.edu

Off-policy reinforcement learning with gaussian processes

G Chowdhary, M Liu, R Grande, T Walsh… - IEEE/CAA Journal of …, 2014 - ieeexplore.ieee.org

An off-policy Bayesian nonparameteric approximate reinforcement learning framework,
termed as GPQ, that employs a Gaussian processes (GP) model of the value (Q) function is …

被引用次数：43 相关文章所有 12 个版本

[PDF] aaai.org

Compressed conditional mean embeddings for model-based reinforcement learning

G Lever, J Shawe-Taylor, R Stafford… - Proceedings of the AAAI …, 2016 - ojs.aaai.org

We present a model-based approach to solving Markov decision processes (MDPs) in which
the system dynamics are learned using conditional mean embeddings (CMEs). This class of …

被引用次数：31 相关文章所有 6 个版本

[PDF] arxiv.org

A self-taught artificial agent for multi-physics computational model personalization

D Neumann, T Mansi, L Itu, B Georgescu… - Medical image …, 2016 - Elsevier

Personalization is the process of fitting a model to patient data, a critical step towards
application of multi-physics computational models in clinical practice. Designing robust …

被引用次数：31 相关文章所有 14 个版本

高级搜索

QQ 群