Provably efficient multi-task reinforcement learning with model transfer

C Dann, Y Mansour, M Mohri - International Conference on …, 2023 - proceedings.mlr.press

Reward design is one of the most critical and challenging aspects when formulating a task
as a reinforcement learning (RL) problem. In practice, it often takes several attempts of …

被引用次数：12 相关文章所有 7 个版本

[PDF] mlr.press

Multi-task representation learning for pure exploration in linear bandits

Y Du, L Huang, W Sun - International Conference on …, 2023 - proceedings.mlr.press

Despite the recent success of representation learning in sequential decision making, the
study of the pure exploration scenario (ie, identify the best option and minimize the sample …

被引用次数：5 相关文章所有 7 个版本

[PDF] mlr.press

Can Q-learning be improved with advice?

N Golowich, A Moitra - Conference on Learning Theory, 2022 - proceedings.mlr.press

Despite rapid progress in theoretical reinforcement learning (RL) over the last few years,
most of the known guarantees are worst-case in nature, failing to take advantage of structure …

被引用次数：12 相关文章所有 4 个版本

[PDF] mlr.press

Horizon-free and variance-dependent reinforcement learning for latent markov decision processes

R Zhou, R Wang, SS Du - International Conference on …, 2023 - proceedings.mlr.press

We study regret minimization for reinforcement learning (RL) in Latent Markov Decision
Processes (LMDPs) with context in hindsight. We design a novel model-based algorithmic …

被引用次数：3 相关文章所有 6 个版本

[PDF] mlr.press

On the power of pre-training for generalization in RL: provable benefits and hardness

H Ye, X Chen, L Wang, SS Du - International Conference on …, 2023 - proceedings.mlr.press

Abstract Generalization in Reinforcement Learning (RL) aims to train an agent during
training that generalizes to the target environment. In this work, we first point out that RL …

被引用次数：6 相关文章所有 8 个版本

[PDF] mlr.press

Provably efficient offline reinforcement learning with perturbed data sources

C Shi, W Xiong, C Shen, J Yang - … Conference on Machine …, 2023 - proceedings.mlr.press

Existing theoretical studies on offline reinforcement learning (RL) mostly consider a dataset
sampled directly from the target task. In practice, however, data often come from several …

被引用次数：4 相关文章所有 9 个版本

[PDF] arxiv.org

Thompson sampling for robust transfer in multi-task bandits

Z Wang, C Zhang, K Chaudhuri - arXiv preprint arXiv:2206.08556, 2022 - arxiv.org

We study the problem of online multi-task learning where the tasks are performed within
similar but not necessarily identical multi-armed bandit environments. In particular, we study …

被引用次数：7 相关文章所有 7 个版本

[PDF] google.com

Multitask transfer learning with kernel representation

Y Zhang, S Ying, Z Wen - Neural Computing and Applications, 2022 - Springer

In many real-world applications, collecting and labeling the data is expensive and time-
consuming. Thus, there is a need to obtain a high-performance learner by leveraging the …

被引用次数：7 相关文章所有 5 个版本

[PDF] arxiv.org

Efficient multi-task reinforcement learning via selective behavior sharing

G Zhang, A Jain, I Hwang, SH Sun, JJ Lim - arXiv preprint arXiv …, 2023 - arxiv.org

The ability to leverage shared behaviors between tasks is critical for sample-efficient multi-
task reinforcement learning (MTRL). While prior methods have primarily explored parameter …

被引用次数：6 相关文章所有 4 个版本

[PDF] arxiv.org

Sample Efficient Myopic Exploration Through Multitask Reinforcement Learning with Diverse Tasks

Z Xu, Z Xu, R Jiang, P Stone, A Tewari - arXiv preprint arXiv:2403.01636, 2024 - arxiv.org

Multitask Reinforcement Learning (MTRL) approaches have gained increasing attention for
its wide applications in many important Reinforcement Learning (RL) tasks. However, while …

高级搜索

QQ 群