- 学术资源搜索

A comprehensive overview of large language models

H Naveed, AU Khan, S Qiu, M Saqib, S Anwar… - arXiv preprint arXiv …, 2023 - arxiv.org

Large Language Models (LLMs) have recently demonstrated remarkable capabilities in
natural language processing tasks and beyond. This success of LLMs has led to a large …

被引用次数：615 相关文章所有 3 个版本

[PDF] arxiv.org

Ammus: A survey of transformer-based pretrained models in natural language processing

KS Kalyan, A Rajasekharan, S Sangeetha - arXiv preprint arXiv …, 2021 - arxiv.org

Transformer-based pretrained language models (T-PTLMs) have achieved great success in
almost every NLP task. The evolution of these models started with GPT and BERT. These …

被引用次数：359 相关文章所有 2 个版本

[PDF] aclanthology.org

NusaCrowd: Open source initiative for Indonesian NLP resources

S Cahyawijaya, H Lovenia, AF Aji… - Findings of the …, 2023 - aclanthology.org

We present NusaCrowd, a collaborative initiative to collect and unify existing resources for
Indonesian languages, including opening access to previously non-public resources …

被引用次数：969 相关文章所有 7 个版本

[PDF] aclanthology.org

Semeval-2022 task 11: Multilingual complex named entity recognition (multiconer)

S Malmasi, A Fang, B Fetahu, S Kar… - Proceedings of the …, 2022 - aclanthology.org

We present the findings of SemEval-2022 Task 11 on Multilingual Complex Named Entity
Recognition MULTICONER. Divided into 13 tracks, the task focused on methods to identify …

被引用次数：108 相关文章所有 3 个版本

[PDF] arxiv.org

Language models are few-shot multilingual learners

GI Winata, A Madotto, Z Lin, R Liu, J Yosinski… - arXiv preprint arXiv …, 2021 - arxiv.org

General-purpose language models have demonstrated impressive capabilities, performing
on par with state-of-the-art approaches on a range of downstream natural language …

被引用次数：129 相关文章所有 3 个版本

[PDF] aclanthology.org

JGLUE: Japanese general language understanding evaluation

K Kurihara, D Kawahara, T Shibata - Proceedings of the Thirteenth …, 2022 - aclanthology.org

To develop high-performance natural language understanding (NLU) models, it is
necessary to have a benchmark to evaluate and analyze NLU ability from various …

被引用次数：91 相关文章所有 6 个版本

[PDF] arxiv.org

On the effect of pretraining corpora on in-context learning by a large-scale language model

S Shin, SW Lee, H Ahn, S Kim, HS Kim, B Kim… - arXiv preprint arXiv …, 2022 - arxiv.org

Many recent studies on large-scale language models have reported successful in-context
zero-and few-shot learning ability. However, the in-depth analysis of when in-context …

被引用次数：85 相关文章所有 6 个版本

[PDF] mlr.press

IGLUE: A benchmark for transfer learning across modalities, tasks, and languages

E Bugliarello, F Liu, J Pfeiffer, S Reddy… - International …, 2022 - proceedings.mlr.press

Reliable evaluation benchmarks designed for replicability and comprehensiveness have
driven progress in machine learning. Due to the lack of a multilingual benchmark, however …

被引用次数：57 相关文章所有 5 个版本

[PDF] mdpi.com

End-to-end transformer-based models in textual-based NLP

A Rahali, MA Akhloufi - AI, 2023 - mdpi.com

Transformer architectures are highly expressive because they use self-attention
mechanisms to encode long-range dependencies in the input sequences. In this paper, we …

被引用次数：77 相关文章所有 5 个版本

[PDF] arxiv.org

Bloom+ 1: Adding language support to bloom for zero-shot prompting

ZX Yong, H Schoelkopf, N Muennighoff, AF Aji… - arXiv preprint arXiv …, 2022 - arxiv.org

The BLOOM model is a large publicly available multilingual language model, but its
pretraining was limited to 46 languages. To extend the benefits of BLOOM to other …

被引用次数：51 相关文章所有 7 个版本

高级搜索

QQ 群

A comprehensive overview of large language models

Ammus: A survey of transformer-based pretrained models in natural language processing

NusaCrowd: Open source initiative for Indonesian NLP resources

Semeval-2022 task 11: Multilingual complex named entity recognition (multiconer)

Language models are few-shot multilingual learners

JGLUE: Japanese general language understanding evaluation

On the effect of pretraining corpora on in-context learning by a large-scale language model

IGLUE: A benchmark for transfer learning across modalities, tasks, and languages

End-to-end transformer-based models in textual-based NLP

Bloom+ 1: Adding language support to bloom for zero-shot prompting

引用