关注
Jie Liu (刘杰)
Jie Liu (刘杰)
在 link.cuhk.edu.hk 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Inception convolution with efficient dilation search
J Liu, C Li, F Liang, C Lin, M Sun, J Yan, W Ouyang, D Xu
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021
372021
Beyond One-Preference-for-All: Multi-Objective Direct Preference Optimization for Language Models
Z Zhou, J Liu, C Yang, J Shao, Y Liu, X Yue, W Ouyang, Y Qiao
arXiv e-prints, arXiv: 2310.03708, 2023
17*2023
ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-Dependency
C Li*, J Liu*, Y Zhang, Y Wei, Y Niu, Y Yang, Y Liu, W Ouyang
Proceedings of the AAAI Conference on Artificial Intelligence (AAAI 2023), 2023
172023
Efficient Reinforcement Learning for Autonomous Driving with Parameterized Skills and Priors
L Wang, J Liu, H Shao, W Wang, R Chen, Y Liu, SL Waslander
Robotics: Science and Systems (RSS 2023), 2023
132023
MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues
G Bai*, J Liu*, X Bu, Y He, J Liu, Z Zhou, Z Lin, W Su, T Ge, B Zheng, ...
arXiv preprint arXiv:2402.14762, 2024
6*2024
ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large Language Models
Y Wu*, J Liu*, X Bu, J Liu, Z Zhou, Y Zhang, C Zhang, Z Bai, H Chen, T Ge, ...
arXiv preprint arXiv:2402.14660, 2024
4*2024
MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series
G Zhang, S Qu, J Liu, C Zhang, C Lin, CL Yu, D Pan, E Cheng, J Liu, ...
arXiv preprint arXiv:2405.19327, 2024
22024
Emulated Disalignment: Safety Alignment for Large Language Models May Backfire!
Z Zhou, J Liu, Z Dong, J Liu, C Yang, W Ouyang, Y Qiao
arXiv preprint arXiv:2402.12343, 2024
12024
Masked Pretraining for Multi-Agent Decision Making
J Liu, Y Zhang, C Li, C Yang, Y Yang, Y Liu, W Ouyang
arXiv preprint arXiv:2310.11846, 2023
12023
Theoretically Guaranteed Policy Improvement Distilled from Model-Based Planning
C Li, R Jia, J Liu, Y Zhang, Y Niu, Y Yang, Y Liu, W Ouyang
Proceedings of the European Conference on Artificial Intelligence, 2023
12023
Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models
Z Zhou, Z Liu, J Liu, Z Dong, C Yang, Y Qiao
arXiv preprint arXiv:2405.19262, 2024
2024
A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning
Y Zhang*, J Liu*, C Li, Y Niu, Y Yang, Y Liu, W Ouyang
Proceedings of the AAAI Conference on Artificial Intelligence (AAAI 2024), 2023
2023
Adaptive Gradient Method with Resilience and Momentum
J Liu, C Lin, C Li, L Sheng, M Sun, J Yan, W Ouyang
arXiv preprint arXiv:2010.11041, 2020
2020
系统目前无法执行此操作,请稍后再试。
文章 1–13