S Gu, L Yang, Y Du, G Chen, F Walter… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org
Reinforcement Learning (RL) has achieved tremendous success in many complex decision- making tasks. However, safety concerns are raised during deploying RL in real-world …
J He, H Zhao, D Zhou, Q Gu - International Conference on …, 2023 - proceedings.mlr.press
We study reinforcement learning (RL) with linear function approximation. For episodic time- inhomogeneous linear Markov decision processes (linear MDPs) whose transition …
Achieving sample efficiency in online episodic reinforcement learning (RL) requires optimally balancing exploration and exploitation. When it comes to a finite-horizon episodic …
In online reinforcement learning (RL), efficient exploration remains particularly challenging in high-dimensional environments with sparse rewards. In low-dimensional environments …
Y Min, J He, T Wang, Q Gu - International Conference on …, 2022 - proceedings.mlr.press
We study the stochastic shortest path (SSP) problem in reinforcement learning with linear function approximation, where the transition kernel is represented as a linear mixture of …
C Zhao, R Yang, B Wang… - Advances in Neural …, 2024 - proceedings.neurips.cc
In this work, we study the low-rank MDPs with adversarially changed losses in the full- information feedback setting. In particular, the unknown transition probability kernel admits a …
Y Chen, J He, Q Gu - International Conference on Machine …, 2022 - proceedings.mlr.press
We study reinforcement learning for infinite-horizon discounted linear kernel MDPs, where the transition probability function is linear in a predefined feature mapping. Existing …
The exploration-exploitation dilemma has been a central challenge in reinforcement learning (RL) with complex model classes. In this paper, we propose a new algorithm …