Ad recommendation systems for life-time value optimization

MM Afsar, T Crump, B Far - ACM Computing Surveys, 2022 - dl.acm.org

Recommender systems (RSs) have become an inseparable part of our everyday lives. They
help us find our favorite items to purchase, our friends on social networks, and our favorite …

被引用次数：406 相关文章所有 3 个版本

[PDF] arxiv.org

Offline reinforcement learning: Tutorial, review, and perspectives on open problems

S Levine, A Kumar, G Tucker, J Fu - arXiv preprint arXiv:2005.01643, 2020 - arxiv.org

In this tutorial article, we aim to provide the reader with the conceptual tools needed to get
started on research on offline reinforcement learning algorithms: reinforcement learning …

被引用次数：1769 相关文章所有 3 个版本

[PDF] 176.9.41.242

Autonomous navigation of stratospheric balloons using reinforcement learning

MG Bellemare, S Candido, PS Castro, J Gong… - Nature, 2020 - nature.com

Efficiently navigating a superpressure balloon in the stratosphere requires the integration of
a multitude of cues, such as wind speed and solar elevation, and the process is complicated …

被引用次数：362 相关文章所有 7 个版本

[PDF] arxiv.org

Reinforcement knowledge graph reasoning for explainable recommendation

Y Xian, Z Fu, S Muthukrishnan, G De Melo… - Proceedings of the 42nd …, 2019 - dl.acm.org

Recent advances in personalized recommendation have sparked great interest in the
exploitation of rich structured information provided by knowledge graphs. Unlike most …

被引用次数：509 相关文章所有 11 个版本

[PDF] arxiv.org

Deep reinforcement learning: An overview

Y Li - arXiv preprint arXiv:1701.07274, 2017 - arxiv.org

We give an overview of recent exciting achievements of deep reinforcement learning (RL).
We discuss six core elements, six important mechanisms, and twelve applications. We start …

被引用次数：1844 相关文章所有 6 个版本

[PDF] ieee.org

A gentle introduction to reinforcement learning and its application in different fields

M Naeem, STH Rizvi, A Coronato - IEEE access, 2020 - ieeexplore.ieee.org

Due to the recent progress in Deep Neural Networks, Reinforcement Learning (RL) has
become one of the most important and useful technology. It is a learning method where a …

被引用次数：172 相关文章所有 3 个版本

[PDF] arxiv.org

Deep reinforcement learning

SE Li - Reinforcement learning for sequential decision and …, 2023 - Springer

Similar to humans, RL agents use interactive learning to successfully obtain satisfactory
decision strategies. However, in many cases, it is desirable to learn directly from …

被引用次数：338 相关文章所有 9 个版本

[PDF] mlr.press

Data-efficient off-policy policy evaluation for reinforcement learning

P Thomas, E Brunskill - International Conference on …, 2016 - proceedings.mlr.press

In this paper we present a new way of predicting the performance of a reinforcement
learning policy given historical data that may have been generated by a different policy. The …

被引用次数：728 相关文章所有 14 个版本

[PDF] neurips.cc

Bridging the gap between value and policy based reinforcement learning

O Nachum, M Norouzi, K Xu… - Advances in neural …, 2017 - proceedings.neurips.cc

We establish a new connection between value and policy based reinforcement learning
(RL) based on a relationship between softmax temporal value consistency and policy …

被引用次数：512 相关文章所有 14 个版本

[PDF] neurips.cc

Conservative data sharing for multi-task offline reinforcement learning

T Yu, A Kumar, Y Chebotar… - Advances in …, 2021 - proceedings.neurips.cc

Offline reinforcement learning (RL) algorithms have shown promising results in domains
where abundant pre-collected data is available. However, prior methods focus on solving …

被引用次数：75 相关文章所有 6 个版本

高级搜索

QQ 群