Y Du, L Huang, W Sun - International Conference on …, 2023 - proceedings.mlr.press
Despite the recent success of representation learning in sequential decision making, the study of the pure exploration scenario (ie, identify the best option and minimize the sample …
N Golowich, A Moitra - Conference on Learning Theory, 2022 - proceedings.mlr.press
Despite rapid progress in theoretical reinforcement learning (RL) over the last few years, most of the known guarantees are worst-case in nature, failing to take advantage of structure …
R Zhou, R Wang, SS Du - International Conference on …, 2023 - proceedings.mlr.press
We study regret minimization for reinforcement learning (RL) in Latent Markov Decision Processes (LMDPs) with context in hindsight. We design a novel model-based algorithmic …
H Ye, X Chen, L Wang, SS Du - International Conference on …, 2023 - proceedings.mlr.press
Abstract Generalization in Reinforcement Learning (RL) aims to train an agent during training that generalizes to the target environment. In this work, we first point out that RL …
Existing theoretical studies on offline reinforcement learning (RL) mostly consider a dataset sampled directly from the target task. In practice, however, data often come from several …
We study the problem of online multi-task learning where the tasks are performed within similar but not necessarily identical multi-armed bandit environments. In particular, we study …
Y Zhang, S Ying, Z Wen - Neural Computing and Applications, 2022 - Springer
In many real-world applications, collecting and labeling the data is expensive and time- consuming. Thus, there is a need to obtain a high-performance learner by leveraging the …
The ability to leverage shared behaviors between tasks is critical for sample-efficient multi- task reinforcement learning (MTRL). While prior methods have primarily explored parameter …
Multitask Reinforcement Learning (MTRL) approaches have gained increasing attention for its wide applications in many important Reinforcement Learning (RL) tasks. However, while …