Dr-LLaVA: Visual Instruction Tuning with Symbolic Clinical Grounding S Sun, GM Goldgof, A Schubert, Z Sun, T Hartvigsen, AJ Butte, A Alaa arXiv preprint arXiv:2405.19567, 2024 | | 2024 |
LLM and Simulation as Bilevel Optimizers: A New Paradigm to Advance Physical Scientific Discovery P Ma, TH Wang, M Guo, Z Sun, JB Tenenbaum, D Rus, C Gan, W Matusik Forty-first International Conference on Machine Learning (ICML), 2024 | | 2024 |
Self-Play Preference Optimization for Language Model Alignment Y Wu, Z Sun, H Yuan, K Ji, Y Yang, Q Gu arXiv preprint arXiv:2405.00675, 2024 | 15 | 2024 |
Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward R Zhang, L Gui, Z Sun, Y Feng, K Xu, Y Zhang, D Fu, C Li, A Hauptmann, ... arXiv preprint arXiv:2404.01258, 2024 | | 2024 |
Visual Chain-of-Thought Prompting for Knowledge-Based Visual Reasoning Z Chen, Q Zhou, Y Shen, Y Hong, Z Sun, D Gutfreund, C Gan Proceedings of the AAAI Conference on Artificial Intelligence 38 (2), 1254-1262, 2024 | 2 | 2024 |
Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision Z Sun, L Yu, Y Shen, W Liu, Y Yang, S Welleck, C Gan arXiv preprint arXiv:2403.09472, 2024 | 5 | 2024 |
HaluEval-Wild: Evaluating Hallucinations of Language Models in the Wild Z Zhu, Z Sun, Y Yang arXiv preprint arXiv:2403.04307, 2024 | 1 | 2024 |
Instruction-tuned Language Models are Better Knowledge Learners Z Jiang, Z Sun, W Shi, P Rodriguez, C Zhou, G Neubig, XV Lin, W Yih, ... 2024 Annual Conference of the Association for Computational Linguistics, 2024 | 2 | 2024 |
Improving Reinforcement Learning from Human Feedback with Efficient Reward Model Ensemble S Zhang, Z Chen, S Chen, Y Shen, Z Sun, C Gan arXiv preprint arXiv:2401.16635, 2024 | 2 | 2024 |
SALMON: Self-Alignment with Instructable Reward Models Z Sun, Y Shen, H Zhang, Q Zhou, Z Chen, DD Cox, Y Yang, C Gan The Twelfth International Conference on Learning Representations, 2024 | 31* | 2024 |
Aligning Large Multimodal Models with Factually Augmented RLHF Z Sun, S Shen, S Cao, H Liu, C Li, Y Shen, C Gan, LY Gui, YX Wang, ... 2024 Annual Conference of the Association for Computational Linguistics …, 2023 | 90 | 2023 |
Accelerating Diffusion-based Combinatorial Optimization Solvers by Progressive Distillation J Huang, Z Sun, Y Yang arXiv preprint arXiv:2308.06644, 2023 | | 2023 |
Active Retrieval Augmented Generation Z Jiang, FF Xu, L Gao, Z Sun, Q Liu, J Dwivedi-Yu, Y Yang, J Callan, ... Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023 | 170 | 2023 |
Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision Z Sun, Y Shen, Q Zhou, H Zhang, Z Chen, D Cox, Y Yang, C Gan Advances in Neural Information Processing Systems (NeurIPS), 2023 | 198 | 2023 |
DIFUSCO: Graph-based Diffusion Solvers for Combinatorial Optimization Z Sun, Y Yang Advances in Neural Information Processing Systems (NeurIPS), 2023 | 51 | 2023 |
A Neural PDE Solver with Temporal Stencil Modeling Z Sun, Y Yang, S Yoo Fortieth International Conference on Machine Learning (ICML), 2023 | 6 | 2023 |
Recitation-Augmented Language Models Z Sun, X Wang, Y Tay, Y Yang, D Zhou International Conference on Learning Representations (ICLR), 2023 | 77 | 2023 |
Bloom: A 176b-parameter open-access multilingual language model TL Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, R Castagné, ... arXiv preprint arXiv:2211.05100, 2022 | 1308 | 2022 |
DIMES: A Differentiable Meta Solver for Combinatorial Optimization Problems R Qiu, Z Sun, Y Yang Advances in Neural Information Processing Systems (NeurIPS), 2022 | 52 | 2022 |
Sparse Attention with Learning to Hash Z Sun, Y Yang, S Yoo International Conference on Learning Representations (ICLR), 2022 | 17 | 2022 |