关注
Jinyi Liu
Jinyi Liu
在 tju.edu.cn 的电子邮件经过验证
标题
引用次数
引用次数
年份
Exploration in deep reinforcement learning: From single-agent to multiagent domain
J Hao, T Yang, H Tang, C Bai, J Liu, Z Meng, P Liu, Z Wang
IEEE Transactions on Neural Networks and Learning Systems, 2023
172*2023
Euclid: Towards efficient unsupervised reinforcement learning with multi-choice dynamics model
Y Yuan, J Hao, F Ni, Y Mu, Y Zheng, Y Hu, J Liu, Y Chen, C Fan
arXiv preprint arXiv:2210.00498, 2022
102022
FIGCPS: Effective failure-inducing input generation for cyber-physical systems with deep reinforcement learning
S Zhang, S Liu, J Sun, Y Chen, W Huang, J Liu, J Liu, J Hao
2021 36th IEEE/ACM International Conference on Automated Software …, 2021
102021
Improving Offline-to-Online Reinforcement Learning with Q-Ensembles
K Zhao, Y Ma, J Liu, HAO Jianye, Y Zheng, Z Meng
ICML Workshop on New Frontiers in Learning, Control, and Dynamical Systems, 2023
8*2023
Ovd-explorer: Optimism should not be the sole pursuit of exploration in noisy environments
J Liu, Z Wang, Y Zheng, J Hao, C Bai, J Ye, Z Wang, H Piao, Y Sun
Proceedings of the AAAI Conference on Artificial Intelligence 38 (12), 13954 …, 2024
32024
SheetAgent: A Generalist Agent for Spreadsheet Reasoning and Manipulation via Large Language Models
Y Chen, Y Yuan, Z Zhang, Y Zheng, J Liu, F Ni, J Hao
arXiv preprint arXiv:2403.03636, 2024
22024
Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback
Y Yuan, J Hao, Y Ma, Z Dong, H Liang, J Liu, Z Feng, K Zhao, Y Zheng
arXiv preprint arXiv:2402.02423, 2024
22024
A Trajectory Perspective on the Role of Data Sampling Techniques in Offline Reinforcement Learning
J Liu, Y Ma, J Hao, Y Hu, Y Zheng, T Lv, C Fan
Proceedings of the 23rd International Conference on Autonomous Agents and …, 2024
12024
Enhancing Robotic Manipulation with AI Feedback from Multimodal Large Language Models
J Liu, Y Yuan, J Hao, F Ni, L Fu, Y Chen, Y Zheng
arXiv preprint arXiv:2402.14245, 2024
12024
ED2: an environment dynamics decomposition framework for world model construction
C Wang, T Yang, HAO Jianye, Y Zheng, H Tang, F Barez, J Liu, J Peng, ...
12021
A Policy-Decoupled Method for High-Quality Data Augmentation in Offline Reinforcement Learning
S Lian, Y Ma, J Liu, HAO Jianye, Y Zheng, Z Meng
ICML Workshop on New Frontiers in Learning, Control, and Dynamical Systems, 0
1*
CellAgent: An LLM-driven Multi-Agent Framework for Automated Single-cell Data Analysis
Y Xiao, J Liu, Y Zheng, X Xie, J Hao, M Li, R Wang, F Ni, Y Li, J Luo, ...
bioRxiv, 2024.05. 13.593861, 2024
2024
vMFER: Von Mises-Fisher Experience Resampling Based on Uncertainty of Gradient Directions for Policy Improvement
Y Zhu, J Liu, W Wei, Q Fu, Y Hu, Z Fang, B An, J Hao, T Lv, C Fan
arXiv preprint arXiv:2405.08638, 2024
2024
ENOTO: Improving Offline-to-Online Reinforcement Learning with Q-Ensembles
K Zhao, J Hao, Y Ma, J Liu, Y Zheng, Z Meng
Proceedings of the 23rd International Conference on Autonomous Agents and …, 2024
2024
vMFER: von Mises-Fisher Experience Resampling Based on Uncertainty of Gradient Directions for Policy Improvement of Actor-Critic Algorithms
Y Zhu, J Liu, W Wei, Q Fu, Y Hu, Z Fang, B An, J Hao, T Lv, C Fan
Proceedings of the 23rd International Conference on Autonomous Agents and …, 2024
2024
OSCAR: OOD State-Conservative Offline Reinforcement Learning for Sequential Decision Making
Y Ma, C Wang, C Chen, J Liu, Z Meng, Y Zheng, J Hao
CAAI Artificial Intelligence Research 2, 2023
2023
Prioritized Trajectory Replay: A Replay Memory for Data-driven Reinforcement Learning
J Liu, Y Ma, J Hao, Y Hu, Y Zheng, T Lv, C Fan
Data-centric Machine Learning Research (DMLR) Workshop at ICML 2023, 2023
2023
系统目前无法执行此操作,请稍后再试。
文章 1–17