关注
Xiangyu Zhu
Xiangyu Zhu
Institute for AI Industry Research, Tsinghua University
在 air.tsinghua.edu.cn 的电子邮件经过验证
标题
引用次数
引用次数
年份
Constraints penalized q-learning for safe offline reinforcement learning
H Xu, X Zhan, X Zhu
Proceedings of the AAAI Conference on Artificial Intelligence 36 (8), 8753-8760, 2022
642022
Deepthermal: Combustion optimization for thermal power generating units using offline reinforcement learning
X Zhan, H Xu, Y Zhang, Y Huo, X Zhu, H Yin, Y Zheng
Proceedings of the AAAI Conference on Artificial Intelligence 36 (4), 4680-4688, 2022
592022
Model-based offline planning with trajectory pruning
X Zhan, X Zhu, H Xu
Proceedings of the Thirty-First International Joint Conference on Artificial …, 2022
262022
Three-layer graph framework with the sumD feature for alpha matting
C Li, P Wang, X Zhu, H Pi
Computer Vision and Image Understanding 162, 34-45, 2017
252017
When data geometry meets deep function: Generalizing offline reinforcement learning
J Li, X Zhan, H Xu, X Zhu, J Liu, YQ Zhang
The Eleventh International Conference on Learning Representations, 2023
192023
Distance-sensitive offline reinforcement learning
J Li, X Zhan, H Xu, X Zhu, J Liu, YQ Zhang
arXiv preprint arXiv:2205.11027 3, 2022
102022
ECoalVis: visual analysis of control strategies in coal-fired power plants
S Liu, D Weng, Y Tian, Z Deng, H Xu, X Zhu, H Yin, X Zhan, Y Wu
IEEE transactions on visualization and computer graphics 29 (1), 1091-1101, 2022
82022
Adaptive propagation matting based on transparency of image
X Zhu, P Wang, Z Huang
Multimedia Tools and Applications 77, 19089-19112, 2018
72018
H2O+: an improved framework for hybrid offline-and-online RL with dynamics gaps
H Niu, T Ji, B Liu, H Zhao, X Zhu, J Zheng, P Huang, G Zhou, J Hu, X Zhan
arXiv preprint arXiv:2309.12716, 2023
12023
系统目前无法执行此操作,请稍后再试。
文章 1–9