Continual learning of large language models: A comprehensive survey

H Shi, Z Xu, H Wang, W Qin, W Wang, Y Wang… - arXiv preprint arXiv …, 2024 - arxiv.org
The recent success of large language models (LLMs) trained on static, pre-collected,
general datasets has sparked numerous research directions and applications. One such …

Scibench: Evaluating college-level scientific problem-solving abilities of large language models

X Wang, Z Hu, P Lu, Y Zhu, J Zhang… - arXiv preprint arXiv …, 2023 - arxiv.org
Recent advances in large language models (LLMs) have demonstrated notable progress on
many mathematical benchmarks. However, most of these benchmarks only feature problems …

Rest-mcts*: Llm self-training via process reward guided tree search

D Zhang, S Zhoubian, Z Hu, Y Yue, Y Dong… - arXiv preprint arXiv …, 2024 - arxiv.org
Recent methodologies in LLM self-training mostly rely on LLM generating responses and
filtering those with correct output answers as training data. This approach often yields a low …

Scientific large language models: A survey on biological & chemical domains

Q Zhang, K Ding, T Lyv, X Wang, Q Yin… - arXiv preprint arXiv …, 2024 - arxiv.org
Large Language Models (LLMs) have emerged as a transformative power in enhancing
natural language comprehension, representing a significant stride toward artificial general …

A survey on knowledge distillation of large language models

X Xu, M Li, C Tao, T Shen, R Cheng, J Li, C Xu… - arXiv preprint arXiv …, 2024 - arxiv.org
This survey presents an in-depth exploration of knowledge distillation (KD) techniques
within the realm of Large Language Models (LLMs), spotlighting the pivotal role of KD in …

A survey on symbolic knowledge distillation of large language models

K Acharya, A Velasquez… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
This survey article delves into the emerging and critical area of symbolic knowledge
distillation in large language models (LLMs). As LLMs such as generative pretrained …

Sciagent: Tool-augmented language models for scientific reasoning

Y Ma, Z Gou, J Hao, R Xu, S Wang, L Pan… - arXiv preprint arXiv …, 2024 - arxiv.org
Scientific reasoning poses an excessive challenge for even the most advanced Large
Language Models (LLMs). To make this task more practical and solvable for LLMs, we …

AutoRE: document-level relation extraction with large language models

L Xue, D Zhang, Y Dong, J Tang - … of the 62nd Annual Meeting of …, 2024 - aclanthology.org
Abstract Large Language Models (LLMs) have demonstrated exceptional abilities in
comprehending and generating text, motivating numerous researchers to utilize them for …

SciLitLLM: How to Adapt LLMs for Scientific Literature Understanding

S Li, J Huang, J Zhuang, Y Shi, X Cai, M Xu… - arXiv preprint arXiv …, 2024 - arxiv.org
Scientific literature understanding is crucial for extracting targeted information and garnering
insights, thereby significantly advancing scientific discovery. Despite the remarkable …

Chimera: Improving generalist model with domain-specific experts

T Peng, M Li, H Zhou, R Xia, R Zhang, L Bai… - arXiv preprint arXiv …, 2024 - arxiv.org
Recent advancements in Large Multi-modal Models (LMMs) underscore the importance of
scaling by increasing image-text paired data, achieving impressive performance on general …