Machine culture

L Brinkmann, F Baumann, JF Bonnefon… - Nature Human …, 2023 - nature.com
The ability of humans to create and disseminate culture is often credited as the single most
important factor of our success as a species. In this Perspective, we explore the notion of …

Quantifying the dialect gap and its correlates across languages

A Kantharuban, I Vulić, A Korhonen - arXiv preprint arXiv:2310.15135, 2023 - arxiv.org
Historically, researchers and consumers have noticed a decrease in quality when applying
NLP tools to minority variants of languages (ie Puerto Rican Spanish or Swiss German), but …

Transcending language boundaries: Harnessing llms for low-resource language translation

P Shu, J Chen, Z Liu, H Wang, Z Wu, T Zhong… - arXiv preprint arXiv …, 2024 - arxiv.org
Large Language Models (LLMs) have demonstrated remarkable success across a wide
range of tasks and domains. However, their performance in low-resource language …

Leveraging supplementary text data to kick-start automatic speech recognition system development with limited transcriptions

N San, M Bartelds, B Billings, E de Falco… - arXiv preprint arXiv …, 2023 - arxiv.org
Recent research using pre-trained transformer models suggests that just 10 minutes of
transcribed speech may be enough to fine-tune such a model for automatic speech …

Neural machine translation for the indigenous languages of the Americas: An introduction

M Mager, R Bhatnagar, G Neubig, NT Vu… - arXiv preprint arXiv …, 2023 - arxiv.org
Neural models have drastically advanced state of the art for machine translation (MT)
between high-resource languages. Traditionally, these models rely on large amounts of …

Predictive typing method for Persian office automation

B Nouraei, J Shanbehzadeh, P Asghari - Engineering Applications of …, 2024 - Elsevier
Typing is a time-consuming task and predictive text is proposed as a solution. Recently,
Generative Pre-trained Transformers (GPT) have employed autoregressive deep learning to …

Natural language processing in politics

T Marwala - Artificial intelligence, game theory and mechanism …, 2023 - Springer
Natural language processing (NLP) has changed how humans interact with technology and
evaluate data. Its ability to comprehend, interpret, and generate human language has …

Too brittle to touch: comparing the stability of quantization and distillation towards developing low-resource MT models

H Diddee, S Dandapat, M Choudhury… - Proceedings of the …, 2022 - aclanthology.org
Leveraging shared learning through Massively Multilingual Models, state-of-the-art Machine
translation (MT) models are often able to adapt to the paucity of data for low-resource …

Machine culture

L Brinkmann, F Baumann, JF Bonnefon… - Nature Human …, 2023 - hal.science
The ability of humans to create and disseminate culture is often credited as the single most
important factor of our success as a species. In this Perspective, we explore the notion of …

Shifting from endangerment to rebirth in the Artificial Intelligence Age: An Ensemble Machine Learning Approach for Hawrami Text Classification

A Khaksar, H Hassani - arXiv preprint arXiv:2409.16884, 2024 - arxiv.org
Hawrami, a dialect of Kurdish, is classified as an endangered language as it suffers from the
scarcity of data and the gradual loss of its speakers. Natural Language Processing projects …