关注
Zhiqing Sun
Zhiqing Sun
OpenAI
在 openai.com 的电子邮件经过验证 - 首页
标题
引用次数
年份
Dr-LLaVA: Visual Instruction Tuning with Symbolic Clinical Grounding
S Sun, GM Goldgof, A Schubert, Z Sun, T Hartvigsen, AJ Butte, A Alaa
arXiv preprint arXiv:2405.19567, 2024
2024
LLM and Simulation as Bilevel Optimizers: A New Paradigm to Advance Physical Scientific Discovery
P Ma, TH Wang, M Guo, Z Sun, JB Tenenbaum, D Rus, C Gan, W Matusik
Forty-first International Conference on Machine Learning (ICML), 2024
2024
Self-Play Preference Optimization for Language Model Alignment
Y Wu, Z Sun, H Yuan, K Ji, Y Yang, Q Gu
arXiv preprint arXiv:2405.00675, 2024
152024
Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward
R Zhang, L Gui, Z Sun, Y Feng, K Xu, Y Zhang, D Fu, C Li, A Hauptmann, ...
arXiv preprint arXiv:2404.01258, 2024
2024
Visual Chain-of-Thought Prompting for Knowledge-Based Visual Reasoning
Z Chen, Q Zhou, Y Shen, Y Hong, Z Sun, D Gutfreund, C Gan
Proceedings of the AAAI Conference on Artificial Intelligence 38 (2), 1254-1262, 2024
22024
Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision
Z Sun, L Yu, Y Shen, W Liu, Y Yang, S Welleck, C Gan
arXiv preprint arXiv:2403.09472, 2024
52024
HaluEval-Wild: Evaluating Hallucinations of Language Models in the Wild
Z Zhu, Z Sun, Y Yang
arXiv preprint arXiv:2403.04307, 2024
12024
Instruction-tuned Language Models are Better Knowledge Learners
Z Jiang, Z Sun, W Shi, P Rodriguez, C Zhou, G Neubig, XV Lin, W Yih, ...
2024 Annual Conference of the Association for Computational Linguistics, 2024
22024
Improving Reinforcement Learning from Human Feedback with Efficient Reward Model Ensemble
S Zhang, Z Chen, S Chen, Y Shen, Z Sun, C Gan
arXiv preprint arXiv:2401.16635, 2024
22024
SALMON: Self-Alignment with Instructable Reward Models
Z Sun, Y Shen, H Zhang, Q Zhou, Z Chen, DD Cox, Y Yang, C Gan
The Twelfth International Conference on Learning Representations, 2024
31*2024
Aligning Large Multimodal Models with Factually Augmented RLHF
Z Sun, S Shen, S Cao, H Liu, C Li, Y Shen, C Gan, LY Gui, YX Wang, ...
2024 Annual Conference of the Association for Computational Linguistics …, 2023
902023
Accelerating Diffusion-based Combinatorial Optimization Solvers by Progressive Distillation
J Huang, Z Sun, Y Yang
arXiv preprint arXiv:2308.06644, 2023
2023
Active Retrieval Augmented Generation
Z Jiang, FF Xu, L Gao, Z Sun, Q Liu, J Dwivedi-Yu, Y Yang, J Callan, ...
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
1702023
Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision
Z Sun, Y Shen, Q Zhou, H Zhang, Z Chen, D Cox, Y Yang, C Gan
Advances in Neural Information Processing Systems (NeurIPS), 2023
1982023
DIFUSCO: Graph-based Diffusion Solvers for Combinatorial Optimization
Z Sun, Y Yang
Advances in Neural Information Processing Systems (NeurIPS), 2023
512023
A Neural PDE Solver with Temporal Stencil Modeling
Z Sun, Y Yang, S Yoo
Fortieth International Conference on Machine Learning (ICML), 2023
62023
Recitation-Augmented Language Models
Z Sun, X Wang, Y Tay, Y Yang, D Zhou
International Conference on Learning Representations (ICLR), 2023
772023
Bloom: A 176b-parameter open-access multilingual language model
TL Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, R Castagné, ...
arXiv preprint arXiv:2211.05100, 2022
13082022
DIMES: A Differentiable Meta Solver for Combinatorial Optimization Problems
R Qiu, Z Sun, Y Yang
Advances in Neural Information Processing Systems (NeurIPS), 2022
522022
Sparse Attention with Learning to Hash
Z Sun, Y Yang, S Yoo
International Conference on Learning Representations (ICLR), 2022
172022
系统目前无法执行此操作,请稍后再试。
文章 1–20