关注
Fan-Ming Luo
Fan-Ming Luo
在 lamda.nju.edu.cn 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
A survey on model-based reinforcement learning
FM Luo, T Xu, H Lai, XH Chen, W Zhang, Y Yu
Science China Information Sciences 67 (2), 121101, 2024
922024
Offline Model-based Adaptable Policy Learning
XH Chen, Y Yu, Q Li, FM Luo, Z Qin, W Shang, J Ye
Advances in Neural Information Processing Systems 34, 8432-8443, 2021
372021
Adapt to Environment Sudden Changes by Learning a Context Sensitive Policy
FM Luo, S Jiang, Y Yu, Z Zhang, YF Zhang
Proceedings of the AAAI Conference on Artificial Intelligence 36 (7), 7637-7646, 2022
282022
COVID-19 asymptomatic infection estimation
Y Yu, YR Liu, FM Luo, WW Tu, DC Zhan, G Yu, ZH Zhou
medRxiv, 2020.04. 19.20068072, 2020
262020
Improve generated adversarial imitation learning with reward variance regularization
YF Zhang, FM Luo, Y Yu
Machine Learning, 1-19, 2022
142022
Reward-Consistent Dynamics Models are Strongly Generalizable for Offline Reinforcement Learning
FM Luo, T Xu, X Cao, Y Yu
arXiv preprint arXiv:2310.05422, 2023
72023
Offline Model-Based Adaptable Policy Learning for Decision-Making in Out-of-Support Regions
XH Chen, FM Luo, Y Yu, Q Li, Z Qin, W Shang, J Ye
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023
72023
Unified Policy Optimization for Continuous-action Reinforcement Learning in Non-stationary Tasks and Games
RJ Qin, FM Luo, H Qian, Y Yu
arXiv preprint arXiv:2208.09452, 2022
12022
Model predictive complex system control from observational and interventional data
M Mou, Y Guo, F Luo, Y Yu, J Zhang
Chaos: An Interdisciplinary Journal of Nonlinear Science 34 (9), 2024
2024
Efficient Recurrent Off-Policy RL Requires a Context-Encoder-Specific Learning Rate
FM Luo, Z Tu, Z Huang, Y Yu
arXiv preprint arXiv:2405.15384, 2024
2024
Limited Preference Aided Imitation Learning from Imperfect Demonstrations
X Cao, FM Luo, J Ye, T Xu, Z Zhang, Y Yu
Forty-first International Conference on Machine Learning, 2024
2024
Transferable Reward Learning by Dynamics-Agnostic Discriminator Ensemble
FM Luo, X Cao, RJ Qin, Y Yu
arXiv preprint arXiv:2206.00238, 2022
2022
系统目前无法执行此操作,请稍后再试。
文章 1–12