- 学术资源搜索

A comprehensive overview of large language models

H Naveed, AU Khan, S Qiu, M Saqib, S Anwar… - arXiv preprint arXiv …, 2023 - arxiv.org

Large Language Models (LLMs) have recently demonstrated remarkable capabilities in
natural language processing tasks and beyond. This success of LLMs has led to a large …

被引用次数：617 相关文章所有 3 个版本

[PDF] arxiv.org

Recent advances in deep learning based dialogue systems: A systematic survey

J Ni, T Young, V Pandelea, F Xue… - Artificial intelligence review, 2023 - Springer

Dialogue systems are a popular natural language processing (NLP) task as it is promising in
real-life applications. It is also a complicated task since many NLP tasks deserving study are …

被引用次数：293 相关文章所有 15 个版本

[PDF] arxiv.org

Holistic evaluation of language models

P Liang, R Bommasani, T Lee, D Tsipras… - arXiv preprint arXiv …, 2022 - arxiv.org

Language models (LMs) are becoming the foundation for almost all major language
technologies, but their capabilities, limitations, and risks are not well understood. We present …

被引用次数：1141 相关文章所有 5 个版本

[PDF] neurips.cc

Diffusion-lm improves controllable text generation

X Li, J Thickstun, I Gulrajani… - Advances in Neural …, 2022 - proceedings.neurips.cc

Controlling the behavior of language models (LMs) without re-training is a major open
problem in natural language generation. While recent works have demonstrated successes …

被引用次数：704 相关文章所有 7 个版本

[PDF] jmlr.org

Palm: Scaling language modeling with pathways

A Chowdhery, S Narang, J Devlin, M Bosma… - Journal of Machine …, 2023 - jmlr.org

Large language models have been shown to achieve remarkable performance across a
variety of natural language tasks using few-shot learning, which drastically reduces the …

被引用次数：5394 相关文章所有 10 个版本

[PDF] arxiv.org

Super-naturalinstructions: Generalization via declarative instructions on 1600+ nlp tasks

Y Wang, S Mishra, P Alipoormolabashi, Y Kordi… - arXiv preprint arXiv …, 2022 - arxiv.org

How well can NLP models generalize to a variety of unseen tasks when provided with task
instructions? To address this question, we first introduce Super-NaturalInstructions, a …

被引用次数：510 相关文章所有 8 个版本

[PDF] arxiv.org

Lora: Low-rank adaptation of large language models

EJ Hu, Y Shen, P Wallis, Z Allen-Zhu, Y Li… - arXiv preprint arXiv …, 2021 - arxiv.org

An important paradigm of natural language processing consists of large-scale pre-training
on general domain data and adaptation to particular tasks or domains. As we pre-train larger …

被引用次数：9752 相关文章所有 10 个版本

[PDF] arxiv.org

Vera: Vector-based random matrix adaptation

DJ Kopiczko, T Blankevoort, YM Asano - arXiv preprint arXiv:2310.11454, 2023 - arxiv.org

Low-rank adapation (LoRA) is a popular method that reduces the number of trainable
parameters when finetuning large language models, but still faces acute storage challenges …

被引用次数：112 相关文章所有 3 个版本

[PDF] aclanthology.org

Prefix-tuning: Optimizing continuous prompts for generation

XL Li, P Liang - arXiv preprint arXiv:2101.00190, 2021 - arxiv.org

Fine-tuning is the de facto way to leverage large pretrained language models to perform
downstream tasks. However, it modifies all the language model parameters and therefore …

被引用次数：4070 相关文章所有 8 个版本

[PDF] arxiv.org

Large language models can be strong differentially private learners

X Li, F Tramer, P Liang, T Hashimoto - arXiv preprint arXiv:2110.05679, 2021 - arxiv.org

Differentially Private (DP) learning has seen limited success for building large deep learning
models of text, and straightforward attempts at applying Differentially Private Stochastic …

被引用次数：352 相关文章所有 3 个版本

高级搜索

QQ 群

A comprehensive overview of large language models

Recent advances in deep learning based dialogue systems: A systematic survey

Holistic evaluation of language models

Diffusion-lm improves controllable text generation

Palm: Scaling language modeling with pathways

Super-naturalinstructions: Generalization via declarative instructions on 1600+ nlp tasks

Lora: Low-rank adaptation of large language models

Vera: Vector-based random matrix adaptation

Prefix-tuning: Optimizing continuous prompts for generation

Large language models can be strong differentially private learners

引用