关注
Qingyuan Wu
Qingyuan Wu
在 liverpool.ac.uk 的电子邮件经过验证
标题
引用次数
引用次数
年份
State-wise safe reinforcement learning with pixel observations
SS Zhan, Y Wang, Q Wu, R Jiao, C Huang, Q Zhu
6th Learning for Dynamics & Control Conference (L4DC 2024), 2023
52023
Highway reinforcement learning
Y Wang, M Strupl, F Faccio, Q Wu, H Liu, M Grudzień, X Tan, ...
arXiv preprint arXiv:2405.18289, 2024
22024
Boosting Reinforcement Learning with Strongly Delayed Feedback Through Auxiliary Short Delays
Q Wu, SS Zhan, Y Wang, Y Wang, CW Lin, C Lv, Q Zhu, J Schmidhuber, ...
Forty-first International Conference on Machine Learning, 2024
1*2024
Highway Value Iteration Networks
Y Wang, W Li, F Faccio, Q Wu, J Schmidhuber
Forty-first International Conference on Machine Learning, 2024
12024
Greedy-Step Off-Policy Reinforcement Learning
Y Wang, Q Wu, P He, X Tan
arXiv preprint arXiv:2102.11717, 2021
12021
Scaling Value Iteration Networks to 5000 Layers for Extreme Long-Term Planning
Y Wang, Q Wu, W Li, DR Ashley, F Faccio, C Huang, J Schmidhuber
arXiv preprint arXiv:2406.08404, 2024
2024
Variational Delayed Policy Optimization
Q Wu, SS Zhan, Y Wang, Y Wang, CW Lin, C Lv, Q Zhu, C Huang
arXiv preprint arXiv:2405.14226, 2024
2024
Learning Downstream Task by Selectively Capturing Complementary Knowledge from Multiple Self-supervisedly Learning Pretexts
J Yao, Q Wu, Q Feng, S Chen
arXiv preprint arXiv:2204.05248, 2022
2022
Expected-Max Ensembled Q-learning with Temporally-Varying Exploration
Q Wu, Y Wang
2022
系统目前无法执行此操作,请稍后再试。
文章 1–9