A survey on evaluation of large language models

Y Chang, X Wang, J Wang, Y Wu, L Yang… - ACM Transactions on …, 2024 - dl.acm.org
Large language models (LLMs) are gaining increasing popularity in both academia and
industry, owing to their unprecedented performance in various applications. As LLMs …

A survey of large language models

WX Zhao, K Zhou, J Li, T Tang, X Wang, Y Hou… - arXiv preprint arXiv …, 2023 - arxiv.org
Language is essentially a complex, intricate system of human expressions governed by
grammatical rules. It poses a significant challenge to develop capable AI algorithms for …

Datasets for large language models: A comprehensive survey

Y Liu, J Cao, C Liu, K Ding, L Jin - arXiv preprint arXiv:2402.18041, 2024 - arxiv.org
This paper embarks on an exploration into the Large Language Model (LLM) datasets,
which play a crucial role in the remarkable advancements of LLMs. The datasets serve as …

Dress: Instructing large vision-language models to align and interact with humans via natural language feedback

Y Chen, K Sikka, M Cogswell, H Ji… - Proceedings of the …, 2024 - openaccess.thecvf.com
We present DRESS a large vision language model (LVLM) that innovatively exploits Natural
Language feedback (NLF) from Large Language Models to enhance its alignment and …

Openagents: An open platform for language agents in the wild

T Xie, F Zhou, Z Cheng, P Shi, L Weng, Y Liu… - arXiv preprint arXiv …, 2023 - arxiv.org
Language agents show potential in being capable of utilizing natural language for varied
and intricate tasks in diverse environments, particularly when built upon large language …

Chatgpt's one-year anniversary: are open-source large language models catching up?

H Chen, F Jiao, X Li, C Qin, M Ravaut, R Zhao… - arXiv preprint arXiv …, 2023 - arxiv.org
Upon its release in late 2022, ChatGPT has brought a seismic shift in the entire landscape of
AI, both in research and commerce. Through instruction-tuning a large language model …

Removing rlhf protections in gpt-4 via fine-tuning

Q Zhan, R Fang, R Bindu, A Gupta, T Hashimoto… - arXiv preprint arXiv …, 2023 - arxiv.org
As large language models (LLMs) have increased in their capabilities, so does their
potential for dual use. To reduce harmful outputs, produces and vendors of LLMs have used …

Lemur: Harmonizing natural language and code for language agents

Y Xu, H Su, C Xing, B Mi, Q Liu, W Shi, B Hui… - arXiv preprint arXiv …, 2023 - arxiv.org
We introduce Lemur and Lemur-Chat, openly accessible language models optimized for
both natural language and coding capabilities to serve as the backbone of versatile …

If llm is the wizard, then code is the wand: A survey on how code empowers large language models to serve as intelligent agents

K Yang, J Liu, J Wu, C Yang, YR Fung, S Li… - arXiv preprint arXiv …, 2024 - arxiv.org
The prominent large language models (LLMs) of today differ from past language models not
only in size, but also in the fact that they are trained on a combination of natural language …

Llama pro: Progressive llama with block expansion

C Wu, Y Gan, Y Ge, Z Lu, J Wang, Y Feng… - arXiv preprint arXiv …, 2024 - arxiv.org
Humans generally acquire new skills without compromising the old; however, the opposite
holds for Large Language Models (LLMs), eg, from LLaMA to CodeLLaMA. To this end, we …