[PDF][PDF] Two-Stage Data Augmentation for Low-Resourced Speech Recognition.

W Hartmann, T Ng, R Hsiao, S Tsakalidis… - Interspeech, 2016 - isca-archive.org
Low resourced languages suffer from limited training data and resources. Data
augmentation is a common approach to increasing the amount of training data. Additional …

The 2016 BBN Georgian telephone speech keyword spotting system

T Alumäe, D Karakos, W Hartmann… - … , Speech and Signal …, 2017 - ieeexplore.ieee.org
In this paper we describe the 2016 BBN conversational telephone speech keyword spotting
system; the culmination of four years of research and development under the IARPA Babel …

[PDF][PDF] Improved Multilingual Training of Stacked Neural Network Acoustic Models for Low Resource Languages.

T Alumäe, S Tsakalidis, RM Schwartz - Interspeech, 2016 - isca-archive.org
This paper proposes several improvements to multilingual training of neural network
acoustic models for speech recognition and keyword spotting in the context of low-resource …

Multilingual techniques for low resource automatic speech recognition

E Chuangsuwanich - 2016 - dspace.mit.edu
Out of the approximately 7000 languages spoken around the world, there are only about
100 languages with Automatic Speech Recognition (ASR) capability. This is due to the fact …

[HTML][HTML] An introduction to pluricentric languages in speech science and technology

B Schuppler, M Adda-Decker, C Cucchiarini… - Speech …, 2024 - Elsevier
Pluricentric languages are languages that are spoken in at least two countries where they
have an official function and thus develop national varieties with specific linguistic and …

Constructing sub-word units for spoken term detection

C Van Heerden, D Karakos… - … , Speech and Signal …, 2017 - ieeexplore.ieee.org
Spoken term detection, especially of out-of-vocabulary (OOV) keywords, benefits from the
use of sub-word systems. We experiment with different language-independent approaches …

[PDF][PDF] Language Modeling for Speech Analytics in Under-Resourced Languages.

S Wills, P Uys, CJ van Heerden, E Barnard - INTERSPEECH, 2020 - isca-archive.org
Different language modeling approaches are evaluated on two under-resourced,
agglutinative, South African languages; Sesotho and isiZulu. The two languages present …

Real-time transcription, keyword spotting, archival and retrieval for telugu TV news using ASR

M Pala, L Parayitam, V Appala - International Journal of Speech …, 2019 - Springer
The main objective of this paper is to describe the system developed for transcription,
keyword spotting and alerting, archival, and retrieval for broadcasted Telugu TV news. Real …

[PDF][PDF] Comparison of Multiple System Combination Techniques for Keyword Spotting.

W Hartmann, Le Zhang 0002, K Barnes, R Hsiao… - Interspeech, 2016 - researchgate.net
Abstract System combination is a common approach to improving results for both speech
transcription and keyword spotting—especially in the context of low-resourced languages …

Code-switched English pronunciation modeling for Swahili spoken term detection

N Kleynhans, W Hartman, D Van Niekerk… - Procedia Computer …, 2016 - Elsevier
We investigate modeling strategies for English code-switched words as found in a Swahili
spoken term detection system. Code switching, where speakers switch language in a …