Rethinking Goal-conditioned Supervised Learning and Its Connection to Offline RL R Yang, Y Lu, W Li, H Sun, M Fang, Y Du, X Li, L Han, C Zhang International Conference on Learning Representations (ICLR) 2022, 2022 | 59 | 2022 |
FOCAL: Efficient Fully-Offline Meta-Reinforcement Learning via Distance Metric Learning and Behavior Regularization L Li, R Yang, D Luo International Conference on Learning Representations (ICLR) 2021, 2020 | 57 | 2020 |
RORL: Robust Offline Reinforcement Learning via Conservative Smoothing R Yang*, C Bai*, X Ma, Z Wang, C Zhang, L Han Advances in Neural Information Processing Systems (NeurIPS) 2022, 2022 | 47 | 2022 |
Exploiting Reward Shifting in Value-Based Deep RL H Sun, L Han, R Yang, X Ma, J Guo, B Zhou Advances in Neural Information Processing Systems (NeurIPS) 2022, 2022 | 27* | 2022 |
MHER: Model-based Hindsight Experience Replay R Yang, M Fang, L Han, Y Du, F Luo, X Li NeurIPS 2021 Deep RL Workshop, 2021 | 21 | 2021 |
A survey on sparse reward algorithms in reinforcement learning-theory and experiment 杨瑞, 严江鹏, 李秀 智能系统学报 15 (5), 888-899, 2020 | 16* | 2020 |
What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL? R Yang, Y Lin, X Ma, H Hu, C Zhang, T Zhang International Conference on Machine Learning (ICML) 2023, 2023 | 12 | 2023 |
Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards H Wang, Y Lin, W Xiong, R Yang, S Diao, S Qiu, H Zhao, T Zhang arXiv preprint arXiv:2402.18571, 2024 | 10 | 2024 |
Bias-reduced Multi-step Hindsight Experience Replay for Efficient Multi-goal Reinforcement Learning R Yang, J Lyu, Y Yang, J Yan, F Luo, D Luo, L Li, X Li arXiv preprint arXiv:2102.12962, 2021 | 8* | 2021 |
Corruption-Robust Offline Reinforcement Learning with General Function Approximation C Ye*, R Yang*, Q Gu, T Zhang Advances in Neural Information Processing Systems (NeurIPS) 2023, 2023 | 7 | 2023 |
Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment R Yang, X Pan, F Luo, S Qiu, H Zhong, D Yu, J Chen International Conference on Machine Learning (ICML) 2024, 2024 | 6 | 2024 |
Towards Robust Offline-to-Online Reinforcement Learning via Uncertainty and Smoothness X Wen, X Yu, R Yang, C Bai, Z Wang arXiv preprint arXiv:2309.16973, 2023 | 5 | 2023 |
GOPlan: Goal-conditioned Offline Reinforcement Learning by Planning with Learned Models M Wang*, R Yang*, X Chen, H Sun, M Fang, M Giovanni Transactions on Machine Learning Research (TMLR) 2024., 2023 | 3 | 2023 |
Towards Robust Offline Reinforcement Learning under Diverse Data Corruption R Yang*, H Zhong*, J Xu*, A Zhang, C Zhang, L Han, T Zhang International Conference on Learning Representations (ICLR) 2024, 2023 | 3 | 2023 |
Efficient multi-goal reinforcement learning via value consistency prioritization J Xu, S Li, R Yang, C Yuan, L Han Journal of Artificial Intelligence Research 77, 355-376, 2023 | 2 | 2023 |
Combining hindsight with goal-enhanced prediction for multi-goal reinforcement learning R Yang, F Luo, X Li 2021 IEEE 33rd International Conference on Tools with Artificial …, 2021 | 2 | 2021 |
Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs R Yang, R Ding, Y Lin, H Zhang, T Zhang arXiv preprint arXiv:2406.10216, 2024 | | 2024 |
Robot control method, apparatus and device, storage medium and program product R Yang, L Li, D Luo US Patent App. 17/957,710, 2023 | | 2023 |