MetaDiffuser: Diffusion Model as Conditional Planner for Offline Meta-RL F Ni, J Hao, Y Mu, Y Yuan, Y Zheng, B Wang, Z Liang The 40th International Conference on Machine Learning (ICML), 2023 | 25 | 2023 |
AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model Z Dong*, Y Yuan*, J Hao, F Ni, Y Mu, Y Zheng, Y Hu, T Lv, C Fan, Z Hu The 12th International Conference on Learning Representations (ICLR), 2023 | 15 | 2023 |
EUCLID: Towards Efficient Unsupervised Reinforcement Learning with Multi-choice Dynamics Model Y Yuan, J Hao, F Ni, Y Mu, Y Zheng, Y Hu, J Liu, Y Chen, C Fan The 11th International Conference on Learning Representations (ICLR), 2022 | 11 | 2022 |
SheetAgent: A Generalist Agent for Spreadsheet Reasoning and Manipulation via Large Language Models Y Chen*, Y Yuan*, Z Zhang, Y Zheng, J Liu, F Ni, J Hao arXiv preprint arXiv:2403.03636, 2024 | 6 | 2024 |
Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback Y Yuan, J Hao, Y Ma, Z Dong, H Liang, J Liu, Z Feng, K Zhao, Y Zheng The 12th International Conference on Learning Representations (ICLR), 2024 | 6 | 2024 |
Enhancing Robotic Manipulation with AI Feedback from Multimodal Large Language Models J Liu*, Y Yuan*, J Hao, F Ni, L Fu, Y Chen, Y Zheng arXiv preprint arXiv:2402.14245, 2024 | 4 | 2024 |
DiffuserLite: Towards Real-time Diffusion Planning Z Dong, J Hao, Y Yuan, F Ni, Y Wang, P Li, Y Zheng arXiv preprint arXiv:2401.15443, 2024 | 3 | 2024 |
A Method on Searching Better Activation Functions H Sun, Z Wu, B Xia, P Chang, Z Dong, Y Yuan, Y Chang, X Wang arXiv preprint arXiv:2405.12954, 2024 | 2 | 2024 |
KISA: A Unified Keyframe Identifier and Skill Annotator for Long-Horizon Robotics Demonstrations L Kou, F Ni, Y Zheng, J Liu, Y Yuan, Z Dong, HAO Jianye The 41st International Conference on Machine Learning (ICML), 0 | 1 | |
MODULI: Unlocking Preference Generalization via Diffusion Models for Offline Multi-Objective Reinforcement Learning Y Yuan, Z Zheng, Z Dong, J Hao arXiv preprint arXiv:2408.15501, 2024 | | 2024 |
CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making Z Dong*, Y Yuan*, J Hao, F Ni, Y Ma, P Li, Y Zheng arXiv preprint arXiv:2406.09509, 2024 | | 2024 |
MENTOR: Guiding Hierarchical Reinforcement Learning with Human Feedback and Dynamic Distance Constraint X Zhou, Y Yuan, S Yang, J Hao arXiv preprint arXiv:2402.14244, 2024 | | 2024 |
ED2: Environment Dynamics Decomposition World Models for Continuous Control J Hao, Y Yuan, C Wang, Z Wang arXiv preprint arXiv: 2112.02817, 2023 | | 2023 |