Using human feedback to fine-tune diffusion models without any reward model K Yang, J Tao, J Lyu, C Ge, J Chen, W Shen, X Zhu, X Li Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 12 | 2024 |
Exploration and anti-exploration with distributional random network distillation K Yang, J Tao, J Lyu, X Li arXiv preprint arXiv:2401.09750, 2024 | 3 | 2024 |
A Two-stage Reinforcement Learning-based Approach for Multi-entity Task Allocation A Gong, K Yang, J Lyu, X Li arXiv preprint arXiv:2407.00496, 2024 | | 2024 |
CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning Z Liu, K Yang, X Li arXiv preprint arXiv:2406.07541, 2024 | | 2024 |
A novel ensemble approach for road traffic carbon emission prediction: a case in Canada Y Liu, C Tang, A Zhou, K Yang Environment, Development and Sustainability, 1-37, 2024 | | 2024 |
BATON: Aligning Text-to-Audio Model with Human Preference Feedback H Liao, H Han, K Yang, T Du, R Yang, Z Xu, Q Xu, J Liu, J Lu, X Li arXiv preprint arXiv:2402.00744, 2024 | | 2024 |
GTLMA: Generalizable Hierarchical Learning for Tasks with Variable Entities K Yang, A Gong, J Tao, Y Zhang, X Li 2023 International Conference on Frontiers of Robotics and Software …, 2023 | | 2023 |