Embedding structure matters: Comparing methods to adapt multilingual vocabularies to new languages

P Lin, S Ji, J Tiedemann, AFT Martins… - arXiv preprint arXiv …, 2024 - arxiv.org

Large language models have advanced the state of the art in natural language processing.
However, their predominant design for English or a limited set of languages creates a …

被引用次数：17 相关文章所有 2 个版本

[PDF] arxiv.org

Exploring Design Choices for Building Language-Specific LLMs

A Tejaswi, N Gupta, E Choi - arXiv preprint arXiv:2406.14670, 2024 - arxiv.org

Despite rapid progress in large language models (LLMs), their performance on a vast
majority of languages remain unsatisfactory. In this paper, we study building language …

被引用次数：2 相关文章所有 2 个版本

[PDF] arxiv.org

Language Adaptation on a Tight Academic Compute Budget: Tokenizer Swapping Works and Pure bfloat16 Is Enough

K Dobler, G de Melo - arXiv preprint arXiv:2408.15793, 2024 - arxiv.org

We investigate continued pretraining of LLMs for language adaptation on a tight academic
budget: a setting in which only a few GPUs can be used in parallel, for a heavily constrained …

[PDF] arxiv.org

Open Generative Large Language Models for Galician

P Gamallo, P Rodríguez, I de-Dios-Flores… - arXiv preprint arXiv …, 2024 - arxiv.org

Large language models (LLMs) have transformed natural language processing. Yet, their
predominantly English-centric training has led to biases and performance disparities across …

被引用次数：1 相关文章

[PDF] arxiv.org

Adapters for Altering LLM Vocabularies: What Languages Benefit the Most?

HJ Han, A Eriguchi, H Xu, H Hoang, M Carpuat… - arXiv preprint arXiv …, 2024 - arxiv.org

Vocabulary adaptation, which integrates new vocabulary into pre-trained language models
(LMs), enables expansion to new languages and mitigates token over-fragmentation …

Multilingual Language Models: Analysis and Algorithms

T Blevins - 2024 - search.proquest.com

While large language models (LLMs) continue to grow in scale and gain new zero-shot
capabilities, their performance for languages beyond English increasingly lags behind. This …

[PDF][PDF] A Galician-Portuguese Generative Model

P Gamallo, P Rodríguez, D Santos, S Sotelo… - fegalaz.usc.es

Large language models (LLMs) have revolutionized natural language processing, but their
predominant focus on English has resulted in biases and performance differences across …

高级搜索

QQ 群