A survey of multimodal large language model from a data-centric perspective T Bai, H Liang, B Wan, Y Xu, X Li, S Li, L Yang, B Li, Y Wang, B Cui, ... arXiv preprint arXiv:2405.16640, 2024 | 21 | 2024 |
Cfbench: A comprehensive constraints-following benchmark for llms T Zhang, Y Shen, W Luo, Y Zhang, H Liang, F Yang, M Lin, Y Qiao, ... arXiv preprint arXiv:2408.01122, 2024 | 5 | 2024 |
Synth-empathy: Towards high-quality synthetic empathy data H Liang, L Sun, J Wei, X Huang, L Sun, B Yu, C He, W Zhang arXiv preprint arXiv:2407.21669, 2024 | 3 | 2024 |
Mathscape: Evaluating mllms in multimodal math scenarios through a hierarchical benchmark M Zhou, H Liang, T Li, Z Wu, M Lin, L Sun, Y Zhou, Y Zhang, X Huang, ... arXiv preprint arXiv:2408.07543, 2024 | 2 | 2024 |
PAS: Data-Efficient Plug-and-Play Prompt Augmentation System M Zheng, H Liang, F Yang, H Sun, T Li, L Xiong, Y Zhang, Y Wu, K Li, ... arXiv preprint arXiv:2407.06027, 2024 | 2 | 2024 |
Keyvideollm: Towards large-scale video keyframe selection H Liang, J Li, T Bai, X Huang, L Sun, Z Wang, C He, B Cui, C Chen, ... arXiv preprint arXiv:2407.03104, 2024 | 2 | 2024 |
Efficient-Empathy: Towards Efficient and Effective Selection of Empathy Data L Sun, H Liang, J Wei, L Sun, B Yu, B Cui, W Zhang arXiv preprint arXiv:2407.01937, 2024 | 2 | 2024 |
Synthvlm: High-efficiency and high-quality synthetic data for vision language models Z Liu, H Liang, X Huang, W Xiong, Q Yu, L Sun, C Chen, C He, B Cui, ... arXiv preprint arXiv:2407.20756, 2024 | 1 | 2024 |
EVQAScore: Efficient Video Question Answering Data Evaluation H Liang, Z Chen, W Zhang arXiv preprint arXiv:2411.06908, 2024 | | 2024 |
Document Parsing Unveiled: Techniques, Challenges, and Prospects for Structured Information Extraction Q Zhang, VSJ Huang, B Wang, J Zhang, Z Wang, H Liang, S Wang, M Lin, ... arXiv preprint arXiv:2410.21169, 2024 | | 2024 |
Baichuan Alignment Technical Report M Lin, F Yang, Y Shen, H Sun, T Li, T Zhang, C Zhu, M Zheng, X Li, ... arXiv preprint arXiv:2410.14940, 2024 | | 2024 |
Facilitating Multi-turn Function Calling for LLMs via Compositional Instruction Tuning M Chen, H Sun, T Li, F Yang, H Liang, K Lu, B Cui, W Zhang, Z Zhou, ... arXiv preprint arXiv:2410.12952, 2024 | | 2024 |
Gradual Learning: Optimizing Fine-Tuning with Partially Mastered Knowledge in Large Language Models B Li, H Liang, Y Li, F Fu, H Yin, C He, W Zhang arXiv preprint arXiv:2410.05802, 2024 | | 2024 |
Data Proportion Detection for Optimized Data Management for Large Language Models H Liang, K Zhao, Y Yang, B Cui, G Dong, Z Zhou, W Zhang arXiv preprint arXiv:2409.17527, 2024 | | 2024 |
BEATS: Optimizing LLM Mathematical Capabilities with BackVerify and Adaptive Disambiguate based Efficient Tree Search L Sun, H Liang, J Wei, B Yu, C He, Z Zhou, W Zhang arXiv preprint arXiv:2409.17972, 2024 | | 2024 |
Are Bigger Encoders Always Better in Vision Large Models? B Li, H Liang, Z Meng, W Zhang arXiv preprint arXiv:2408.00620, 2024 | | 2024 |