Mala-500: Massive language adaptation of large language models

P Lin, S Ji, J Tiedemann, AFT Martins… - arXiv preprint arXiv …, 2024 - arxiv.org
Large language models have advanced the state of the art in natural language processing.
However, their predominant design for English or a limited set of languages creates a …

Exploring Design Choices for Building Language-Specific LLMs

A Tejaswi, N Gupta, E Choi - arXiv preprint arXiv:2406.14670, 2024 - arxiv.org
Despite rapid progress in large language models (LLMs), their performance on a vast
majority of languages remain unsatisfactory. In this paper, we study building language …

Language Adaptation on a Tight Academic Compute Budget: Tokenizer Swapping Works and Pure bfloat16 Is Enough

K Dobler, G de Melo - arXiv preprint arXiv:2408.15793, 2024 - arxiv.org
We investigate continued pretraining of LLMs for language adaptation on a tight academic
budget: a setting in which only a few GPUs can be used in parallel, for a heavily constrained …

Open Generative Large Language Models for Galician

P Gamallo, P Rodríguez, I de-Dios-Flores… - arXiv preprint arXiv …, 2024 - arxiv.org
Large language models (LLMs) have transformed natural language processing. Yet, their
predominantly English-centric training has led to biases and performance disparities across …

Adapters for Altering LLM Vocabularies: What Languages Benefit the Most?

HJ Han, A Eriguchi, H Xu, H Hoang, M Carpuat… - arXiv preprint arXiv …, 2024 - arxiv.org
Vocabulary adaptation, which integrates new vocabulary into pre-trained language models
(LMs), enables expansion to new languages and mitigates token over-fragmentation …

Multilingual Language Models: Analysis and Algorithms

T Blevins - 2024 - search.proquest.com
While large language models (LLMs) continue to grow in scale and gain new zero-shot
capabilities, their performance for languages beyond English increasingly lags behind. This …

[PDF][PDF] A Galician-Portuguese Generative Model

P Gamallo, P Rodríguez, D Santos, S Sotelo… - fegalaz.usc.es
Large language models (LLMs) have revolutionized natural language processing, but their
predominant focus on English has resulted in biases and performance differences across …