- 学术资源搜索

Challenges and applications of large language models

J Kaddour, J Harris, M Mozes, H Bradley… - arXiv preprint arXiv …, 2023 - arxiv.org

Large Language Models (LLMs) went from non-existent to ubiquitous in the machine
learning discourse within a few years. Due to the fast pace of the field, it is difficult to identify …

被引用次数：454 相关文章所有 3 个版本

[PDF] arxiv.org

Parameter-efficient fine-tuning for large models: A comprehensive survey

Z Han, C Gao, J Liu, J Zhang, SQ Zhang - arXiv preprint arXiv:2403.14608, 2024 - arxiv.org

Large models represent a groundbreaking advancement in multiple application fields,
enabling remarkable achievements across various tasks. However, their unprecedented …

被引用次数：203 相关文章所有 2 个版本

[PDF] arxiv.org

A survey of large language models

WX Zhao, K Zhou, J Li, T Tang, X Wang, Y Hou… - arXiv preprint arXiv …, 2023 - arxiv.org

Language is essentially a complex, intricate system of human expressions governed by
grammatical rules. It poses a significant challenge to develop capable AI algorithms for …

被引用次数：3342 相关文章所有 4 个版本

[PDF] arxiv.org

Phi-3 technical report: A highly capable language model locally on your phone

M Abdin, J Aneja, H Awadalla, A Awadallah… - arXiv preprint arXiv …, 2024 - arxiv.org

We introduce phi-3-mini, a 3.8 billion parameter language model trained on 3.3 trillion
tokens, whose overall performance, as measured by both academic benchmarks and …

被引用次数：664 相关文章所有 2 个版本

[PDF] arxiv.org

Qwen technical report

J Bai, S Bai, Y Chu, Z Cui, K Dang, X Deng… - arXiv preprint arXiv …, 2023 - arxiv.org

Large language models (LLMs) have revolutionized the field of artificial intelligence,
enabling natural language processing tasks that were previously thought to be exclusive to …

被引用次数：1700 相关文章所有 2 个版本

[PDF] arxiv.org

Llamafactory: Unified efficient fine-tuning of 100+ language models

Y Zheng, R Zhang, J Zhang, Y Ye, Z Luo… - arXiv preprint arXiv …, 2024 - arxiv.org

Efficient fine-tuning is vital for adapting large language models (LLMs) to downstream tasks.
However, it requires non-trivial efforts to implement these methods on different models. We …

被引用次数：219 相关文章所有 2 个版本

[PDF] amax.com

Mistral 7B

AQ Jiang, A Sablayrolles, A Mensch, C Bamford… - arXiv preprint arXiv …, 2023 - arxiv.org

We introduce Mistral 7B v0. 1, a 7-billion-parameter language model engineered for
superior performance and efficiency. Mistral 7B outperforms Llama 2 13B across all …

被引用次数：1123 相关文章所有 3 个版本

[PDF] acm.org

Recommendation as instruction following: A large language model empowered recommendation approach

J Zhang, R Xie, Y Hou, X Zhao, L Lin… - ACM Transactions on …, 2023 - dl.acm.org

In the past decades, recommender systems have attracted much attention in both research
and industry communities. Existing recommendation models mainly learn the underlying …

被引用次数：195 相关文章所有 2 个版本

[PDF] aclanthology.org

Math-shepherd: Verify and reinforce llms step-by-step without human annotations

P Wang, L Li, Z Shao, R Xu, D Dai, Y Li… - Proceedings of the …, 2024 - aclanthology.org

In this paper, we present an innovative process-oriented math process reward model called
Math-shepherd, which assigns a reward score to each step of math problem solutions. The …

被引用次数：89 相关文章

[PDF] arxiv.org

Efficient large language models: A survey

Z Wan, X Wang, C Liu, S Alam, Y Zheng, J Liu… - arXiv preprint arXiv …, 2023 - arxiv.org

Large Language Models (LLMs) have demonstrated remarkable capabilities in important
tasks such as natural language understanding and language generation, and thus have the …

被引用次数：118 相关文章所有 7 个版本

高级搜索

QQ 群

Challenges and applications of large language models

Parameter-efficient fine-tuning for large models: A comprehensive survey

A survey of large language models

Phi-3 technical report: A highly capable language model locally on your phone

Qwen technical report

Llamafactory: Unified efficient fine-tuning of 100+ language models

Mistral 7B

Recommendation as instruction following: A large language model empowered recommendation approach

Math-shepherd: Verify and reinforce llms step-by-step without human annotations

Efficient large language models: A survey

引用