Data augmentation for low resource languages

A Ragni, KM Knill, SP Rath… - … 2014: 15th annual …, 2014 - eprints.whiterose.ac.uk
Recently there has been interest in the approaches for training speech recognition systems
for languages with limited resources. Under the IARPA Babel program such resources have …

Spoken content retrieval—beyond cascading speech recognition with text retrieval

L Lee, J Glass, H Lee, C Chan - IEEE/ACM Transactions on …, 2015 - ieeexplore.ieee.org
Spoken content retrieval refers to directly indexing and retrieving spoken content based on
the audio rather than text descriptions. This potentially eliminates the requirement of …

End-to-end speech recognition and keyword search on low-resource languages

A Rosenberg, K Audhkhasi, A Sethy… - … on acoustics, speech …, 2017 - ieeexplore.ieee.org
In recent years, so-called,“end-to-end” speech recognition systems have emerged as viable
alternatives to traditional ASR frameworks. Keyword search, localizing an orthographic …

High-performance query-by-example spoken term detection on the SWS 2013 evaluation

LJ Rodriguez-Fuentes, A Varona… - … , Speech and Signal …, 2014 - ieeexplore.ieee.org
In the last years, the task of Query-by-Example Spoken Term Detection (QbE-STD), which
aims to find occurrences of a spoken query in a set of audio documents, has gained the …

[PDF][PDF] Data augmentation, feature combination, and multilingual neural networks to improve ASR and KWS performance for low-resource languages.

Z Tüske, P Golik, D Nolden, R Schlüter, H Ney - Interspeech, 2014 - academia.edu
This paper presents the progress of acoustic models for lowresourced languages
(Assamese, Bengali, Haitian Creole, Lao, Zulu) developed within the second evaluation …

[PDF][PDF] Subword and phonetic search for detecting out-of-vocabulary keywords

D Karakos, R Schwartz - Fifteenth Annual Conference of the …, 2014 - isca-archive.org
We compare several approaches, separately and together, for spotting of out-of-vocabulary
(OOV) keywords, in terms of their ATWV scores. We considered three types of recognition …

End-to-End Open Vocabulary Keyword Search With Multilingual Neural Representations

B Yusuf, J Černocký, M Saraçlar - IEEE/ACM Transactions on …, 2023 - ieeexplore.ieee.org
Conventional keyword search systems operate on automatic speech recognition (ASR)
outputs, which causes them to have a complex indexing and search pipeline. This has led to …

Constructing sub-word units for spoken term detection

C Van Heerden, D Karakos… - … , Speech and Signal …, 2017 - ieeexplore.ieee.org
Spoken term detection, especially of out-of-vocabulary (OOV) keywords, benefits from the
use of sub-word systems. We experiment with different language-independent approaches …

Using pronunciation-based morphological subword units to improve OOV handling in keyword search

Y He, P Baumann, H Fang… - … on Audio, Speech …, 2015 - ieeexplore.ieee.org
Out-of-vocabulary (OOV) keywords present a challenge for keyword search (KWS) systems
especially in the low-resource setting. Previous research has centered around approaches …

Joint learning of distance metric and query model for posteriorgram-based keyword search

B Gündoğdu, B Yusuf, M Saraçlar - IEEE Journal of Selected …, 2017 - ieeexplore.ieee.org
In this paper, we propose a novel approach to keyword search (KWS) in low-resource
languages, which provides an alternative method for retrieving the terms of interest …