Speech recognition for resource deficient languages using frugal speech corpus

A Imran, K Sunil - 2012 IEEE International Conference on …, 2012 - ieeexplore.ieee.org
use online audio news archives to build a frugal speech corpus. We then use this speech
corpus … of deviation from the conventional approach in terms of making it frugal. In the rest of …

Crowdsourcing speech data for low-resource languages from low-income workers

B Abraham, D Goel, D Siddarth, K Bali… - … Language Resources …, 2020 - aclanthology.org
… to the speech dataset, we believe this approach can also provide … or manual gold standard
annotations for the speech data … Cheap, fast and good enough: Automatic speech recognition …

Phrase detectives: Utilizing collective intelligence for internet-scale language resource creation

M Poesio, J Chamberlain, U Kruschwitz… - … Intelligent Systems  …, 2013 - dl.acm.org
… that the collaborative approach to resource creation can also … 4, and the methods used to
create a corpus in Section 5. The … refer to as markables using standard annotation terminology) …

The GUM corpus: Creating multilayer resources in the classroom

A Zeldes - Language Resources and Evaluation, 2017 - Springer
methodology, design principles and detailed evaluation of a new freely available multilayer
corpus, collected and edited via classroom annotation using … relatively standard English Web

Speech recognition for illiterate access to information and technology

M Plauche, U Nallasamy, J Pal… - … on information and …, 2006 - ieeexplore.ieee.org
… inexpensive approach for gathering the linguistic resources … This paper describes an
SDS built from standard speech … We constructed a cheap and modifiable flex button system (…

Dirt cheap web-scale parallel text from the common crawl

JR Smith, H Saint-Amand, M Plamada, P Koehn… - 2013 - zora.uzh.ch
… differ from standard government and news training text, web-… of each corpus (percentage
of n-grams from the test corpus … a two-step process, first obtaining parallel data from the web, …

[PDF][PDF] Cheap, fast and good enough: Automatic speech recognition with non-expert transcription

S Novotney, C Callison-Burch - … of the North American Chapter of …, 2010 - aclanthology.org
… For each method of selection, we build an acoustic and … use for quality control without
gold standard reference data. … conversational telephone speech corpus. In RT04 Workshop. …

Using mechanical turk to create a corpus of arabic summaries

M El-Haj, U Kruschwitz, C Fox - 2010 - repository.essex.ac.uk
… intelligence—to generate our own reference standard for … of the impact of the aggregation
method on the results of the … here was to create a relatively small but usable resource. We …

AGH corpus of Polish speech

P Żelasko, B Ziółko, T Jadczyk, D Skurzok - Language Resources and …, 2016 - Springer
… lots of recorded speech can be found on the Internet, it is … require as much resources as
traditional training techniques. It … ) is various: some students use cheap PC microphones and cell …

Cheap translation for cross-lingual named entity recognition

S Mayhew, CT Tsai, D Roth - … conference on empirical methods in …, 2017 - aclanthology.org
… several high resource language(s) into the target language, and learns a standard monolingual
… English corpus into the target script. These results are in Table 2, row “Baseline”. In our …