Automatic speech recognition for supporting endangered language documentation

E Prud'hommeaux, R Jimerson, R Hatcher… - 2021 - scholarspace.manoa.hawaii.edu
Generating accurate word-level transcripts of recorded speech for language documentation
is difficult and time-consuming, even for skilled speakers of the target language. Automatic …

User-friendly automatic transcription of low-resource languages: Plugging ESPnet into Elpis

O Adams, B Galliot, G Wisniewski… - arXiv preprint arXiv …, 2020 - arxiv.org
This paper reports on progress integrating the speech recognition toolkit ESPnet into Elpis, a
web front-end originally designed to provide access to the Kaldi automatic speech …

[PDF][PDF] Balancing Social Impact, Opportunities, and Ethical Constraints of Using AI in the Documentation and Vitalization of Indigenous Languages.

CS Pinhanez, PR Cavalin, M Vasconcelos, J Nogima - IJCAI, 2023 - ijcai.org
In this paper we discuss how AI can contribute to support the documentation and vitalization
of Indigenous languages and how that involves a delicate balancing of ensuring social …

Phonemic transcription of low-resource languages: To what extent can preprocessing be automated?

G Wisniewski, A Michaud… - 1st Joint SLTU (Spoken …, 2020 - shs.hal.science
Automatic Speech Recognition for low-resource languages has been an active field of
research for more than a decade. It holds promise for facilitating the urgent task of …

[PDF][PDF] Recognizing lexical units in low-resource language contexts with supervised and unsupervised neural networks

C Macaire - 2021 - hal.science
Automatic Speech Recognition (ASR) has made significant progress thanks to the advent of
deep neural networks (DNNs). In the context of under-resourced languages, for which few …

Fashioning local designs from generic speech technologies in an Australian aboriginal community

É Le Ferrand, S Bird, L Besacier - International Conference on …, 2022 - hal.science
An increasing number of papers have been addressing issues related to low-resource
languages and the transcription bottleneck paradigm. After several years spent in Northern …

Natural Language Processing RELIES on Linguistics

J Opitz, S Wein, N Schneider - arXiv preprint arXiv:2405.05966, 2024 - arxiv.org
Large Language Models (LLMs) have become capable of generating highly fluent text in
certain languages, without modules specially designed to capture grammar or semantic …

[PDF][PDF] Language analysis in library OPAC: designing an open source software based framework for bibliographic records in mainstream and tribal languages

P Mukhopadhyay, A Dutta - DESIDOC Journal of Library & …, 2020 - academia.edu
It reports the development of an enhanced library OPAC prototype through integration of
language analysis tool and book reader in the retrieval interface. Language analysis or text …

La transcription du linguiste au miroir de l'intelligence artificielle: réflexions à partir de la transcription phonémique automatique

A Michaud, O Adams, C Cox, S Guillaume… - Bulletin de la Société …, 2020 - shs.hal.science
Les systèmes de reconnaissance automatique de la parole atteignent désormais des
degrés de précision élevés sur la base d'un corpus d'entraînement limité à deux ou trois …

Participatory translations of oshiwambo: Towards sustainable culture preservation with language technology

W Nekoto, J Kreutzer, J Rajab, M Ochieng… - 3rd Workshop on …, 2022 - openreview.net
In this paper, we describe a participatory, collaborative, and cost-effective process for
creating translations in Oshiwambo, the most widely African language spoken in Namibia …