关注
Kai Yang
Kai Yang
Tsinghua Shenzhen International Graduate School
在 mails.tsinghua.edu.cn 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Using human feedback to fine-tune diffusion models without any reward model
K Yang, J Tao, J Lyu, C Ge, J Chen, W Shen, X Zhu, X Li
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
122024
Exploration and anti-exploration with distributional random network distillation
K Yang, J Tao, J Lyu, X Li
arXiv preprint arXiv:2401.09750, 2024
32024
A Two-stage Reinforcement Learning-based Approach for Multi-entity Task Allocation
A Gong, K Yang, J Lyu, X Li
arXiv preprint arXiv:2407.00496, 2024
2024
CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning
Z Liu, K Yang, X Li
arXiv preprint arXiv:2406.07541, 2024
2024
A novel ensemble approach for road traffic carbon emission prediction: a case in Canada
Y Liu, C Tang, A Zhou, K Yang
Environment, Development and Sustainability, 1-37, 2024
2024
BATON: Aligning Text-to-Audio Model with Human Preference Feedback
H Liao, H Han, K Yang, T Du, R Yang, Z Xu, Q Xu, J Liu, J Lu, X Li
arXiv preprint arXiv:2402.00744, 2024
2024
GTLMA: Generalizable Hierarchical Learning for Tasks with Variable Entities
K Yang, A Gong, J Tao, Y Zhang, X Li
2023 International Conference on Frontiers of Robotics and Software …, 2023
2023
系统目前无法执行此操作,请稍后再试。
文章 1–7