Internlm2 technical report

Z Cai, M Cao, H Chen, K Chen, K Chen, X Chen… - arXiv preprint arXiv …, 2024 - arxiv.org
The evolution of Large Language Models (LLMs) like ChatGPT and GPT-4 has sparked
discussions on the advent of Artificial General Intelligence (AGI). However, replicating such …

ChatGPT's One-year Anniversary: Are Open-Source Large Language Models Catching up?

H Chen, F Jiao, X Li, C Qin, M Ravaut, R Zhao… - arXiv preprint arXiv …, 2023 - arxiv.org
Upon its release in late 2022, ChatGPT has brought a seismic shift in the entire landscape of
AI, both in research and commerce. Through instruction-tuning a large language model …

[HTML][HTML] Augmenting interpretable models with large language models during training

C Singh, A Askari, R Caruana, J Gao - Nature Communications, 2023 - nature.com
Recent large language models (LLMs), such as ChatGPT, have demonstrated remarkable
prediction performance for a growing array of tasks. However, their proliferation into high …

Automl-gpt: Automatic machine learning with gpt

S Zhang, C Gong, L Wu, X Liu, M Zhou - arXiv preprint arXiv:2305.02499, 2023 - arxiv.org
AI tasks encompass a wide range of domains and fields. While numerous AI models have
been designed for specific tasks and applications, they often require considerable human …

Overprompt: Enhancing chatgpt capabilities through an efficient in-context learning approach

J Li, R Zhao, Y He, L Gui - arXiv preprint arXiv:2305.14973, 2023 - arxiv.org
The exceptional performance of pre-trained large language models has revolutionised
various applications, but their adoption in production environments is hindered by …

Llamafactory: Unified efficient fine-tuning of 100+ language models

Y Zheng, R Zhang, J Zhang, Y Ye, Z Luo - arXiv preprint arXiv:2403.13372, 2024 - arxiv.org
Efficient fine-tuning is vital for adapting large language models (LLMs) to downstream tasks.
However, it requires non-trivial efforts to implement these methods on different models. We …

Can we trust the evaluation on ChatGPT?

R Aiyappa, J An, H Kwak, YY Ahn - arXiv preprint arXiv:2303.12767, 2023 - arxiv.org
ChatGPT, the first large language model (LLM) with mass adoption, has demonstrated
remarkable performance in numerous natural language tasks. Despite its evident …

ChatGPT in the age of generative AI and large language models: a concise survey

S Mohamadi, G Mujtaba, N Le, G Doretto… - arXiv preprint arXiv …, 2023 - arxiv.org
ChatGPT is a large language model (LLM) created by OpenAI that has been carefully
trained on a large amount of data. It has revolutionized the field of natural language …

ChatGPT: Fundamentals, applications and social impacts

M Abdullah, A Madain… - 2022 Ninth International …, 2022 - ieeexplore.ieee.org
Recent progress in large language models has pushed the boundaries of natural language
processing, setting new standards for performance. It is remarkable how artificial intelligence …

[HTML][HTML] Cpm-2: Large-scale cost-effective pre-trained language models

Z Zhang, Y Gu, X Han, S Chen, C Xiao, Z Sun, Y Yao… - AI Open, 2021 - Elsevier
In recent years, the size of pre-trained language models (PLMs) has grown by leaps and
bounds. However, efficiency issues of these large-scale PLMs limit their utilization in real …