A survey on model-based reinforcement learning FM Luo, T Xu, H Lai, XH Chen, W Zhang, Y Yu Science China Information Sciences 67 (2), 121101, 2024 | 92 | 2024 |
Offline Model-based Adaptable Policy Learning XH Chen, Y Yu, Q Li, FM Luo, Z Qin, W Shang, J Ye Advances in Neural Information Processing Systems 34, 8432-8443, 2021 | 37 | 2021 |
Adapt to Environment Sudden Changes by Learning a Context Sensitive Policy FM Luo, S Jiang, Y Yu, Z Zhang, YF Zhang Proceedings of the AAAI Conference on Artificial Intelligence 36 (7), 7637-7646, 2022 | 28 | 2022 |
COVID-19 asymptomatic infection estimation Y Yu, YR Liu, FM Luo, WW Tu, DC Zhan, G Yu, ZH Zhou medRxiv, 2020.04. 19.20068072, 2020 | 26 | 2020 |
Improve generated adversarial imitation learning with reward variance regularization YF Zhang, FM Luo, Y Yu Machine Learning, 1-19, 2022 | 14 | 2022 |
Reward-Consistent Dynamics Models are Strongly Generalizable for Offline Reinforcement Learning FM Luo, T Xu, X Cao, Y Yu arXiv preprint arXiv:2310.05422, 2023 | 7 | 2023 |
Offline Model-Based Adaptable Policy Learning for Decision-Making in Out-of-Support Regions XH Chen, FM Luo, Y Yu, Q Li, Z Qin, W Shang, J Ye IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023 | 7 | 2023 |
Unified Policy Optimization for Continuous-action Reinforcement Learning in Non-stationary Tasks and Games RJ Qin, FM Luo, H Qian, Y Yu arXiv preprint arXiv:2208.09452, 2022 | 1 | 2022 |
Model predictive complex system control from observational and interventional data M Mou, Y Guo, F Luo, Y Yu, J Zhang Chaos: An Interdisciplinary Journal of Nonlinear Science 34 (9), 2024 | | 2024 |
Efficient Recurrent Off-Policy RL Requires a Context-Encoder-Specific Learning Rate FM Luo, Z Tu, Z Huang, Y Yu arXiv preprint arXiv:2405.15384, 2024 | | 2024 |
Limited Preference Aided Imitation Learning from Imperfect Demonstrations X Cao, FM Luo, J Ye, T Xu, Z Zhang, Y Yu Forty-first International Conference on Machine Learning, 2024 | | 2024 |
Transferable Reward Learning by Dynamics-Agnostic Discriminator Ensemble FM Luo, X Cao, RJ Qin, Y Yu arXiv preprint arXiv:2206.00238, 2022 | | 2022 |