Constraints penalized q-learning for safe offline reinforcement learning H Xu, X Zhan, X Zhu AAAI 2022, 2022 | 65 | 2022 |
Discriminator-weighted offline imitation learning from suboptimal demonstrations H Xu, X Zhan, H Yin, H Qin ICML 2022, 2022 | 60 | 2022 |
Deepthermal: Combustion optimization for thermal power generating units using offline reinforcement learning X Zhan, H Xu, Y Zhang, X Zhu, H Yin, Y Zheng AAAI 2022, 2022 | 60 | 2022 |
Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization H Xu, L Jiang, J Li, Z Yang, Z Wang, VWK Chan, X Zhan ICLR 2023, 2023 | 54 | 2023 |
A Policy-Guided Imitation Approach for Offline Reinforcement Learning H Xu, L Jiang, J Li, X Zhan NeurIPS 2022, 2022 | 39 | 2022 |
When data geometry meets deep function: Generalizing offline reinforcement learning J Li, X Zhan, H Xu, X Zhu, J Liu, YQ Zhang ICLR 2023, 2023 | 30* | 2023 |
Offline reinforcement learning with soft behavior regularization H Xu, X Zhan, J Li, H Yin NeurIPS 2021 Offline Reinforcement Learning Workshop, 2021 | 27 | 2021 |
Model-based offline planning with trajectory pruning X Zhan, X Zhu, H Xu IJCAI 2022, 2021 | 26 | 2021 |
Mind the Gap: Offline Policy Optimization for Imperfect Rewards J Li, X Hu, H Xu, J Liu, X Zhan, QS Jia, YQ Zhang ICLR 2023, 2023 | 16 | 2023 |
Saformer: A conditional sequence modeling approach to offline safe reinforcement learning Q Zhang, L Zhang, H Xu, L Shen, B Wang, Y Chang, X Wang, B Yuan, ... arXiv preprint arXiv:2301.12203, 2023 | 15 | 2023 |
Discriminator-Guided Model-Based Offline Imitation Learning W Zhang, H Xu, H Niu, P Cheng, M Li, H Zhang, G Zhou, X Zhan CoRL 2022, 2022 | 15 | 2022 |
Robust spatio-temporal purchase prediction via deep meta learning H Qin, S Ke, X Yang, H Xu, X Zhan, Y Zheng AAAI 2021, 2021 | 13 | 2021 |
Proto: Iterative policy regularized offline-to-online reinforcement learning J Li, X Hu, H Xu, J Liu, X Zhan, YQ Zhang arXiv preprint arXiv:2305.15669, 2023 | 12 | 2023 |
ECoalVis: visual analysis of control strategies in coal-fired power plants S Liu, D Weng, Y Tian, Z Deng, H Xu, X Zhu, H Yin, X Zhan, Y Wu IEEE transactions on visualization and computer graphics 29 (1), 1091-1101, 2022 | 8 | 2022 |
Offline multi-agent reinforcement learning with implicit global-to-local value regularization X Wang, H Xu, Y Zheng, X Zhan NeurIPS 2023, 2023 | 6 | 2023 |
Offline reinforcement learning with imbalanced datasets L Jiang, S Cheng, J Qiu, H Xu, WK Chan, Z Ding arXiv preprint arXiv:2307.02752, 2023 | 3 | 2023 |
Curriculum goal-conditioned imitation for offline reinforcement learning X Feng, L Jiang, X Yu, H Xu, X Sun, J Wang, X Zhan, WK Chan IEEE Transactions on Games 16 (1), 102-112, 2022 | 3 | 2022 |
ODICE: Revealing the Mystery of Distribution Correction Estimation via Orthogonal-gradient Update L Mao, H Xu, W Zhang, X Zhan ICLR 2024, 2024 | 2 | 2024 |
Curriculum goal-conditioned imitation for offline reinforcement learning X Feng, L Jiang, X Yu, H Xu, X Sun, J Wang, X Zhan, WKV Chan IEEE Transactions on Games, 2022 | | 2022 |
MetaFS: An Effective Wrapper Feature Selection via Meta Learning Z Pan, C Li, S Ke, H Xu, J Zhang, Y Yuan, Y Zheng | | |