A mechanism for personalized Automatic Speech Recognition for less frequently spoken languages: the Greek case

P Antoniadis, E Tsardoulias, A Symeonidis - Multimedia Tools and …, 2022 - Springer
Abstract Automatic Speech Recognition (ASR) has become increasingly popular since it
significantly simplifies human-computer interaction, providing a more intuitive way of …

[PDF][PDF] Korpus in baza Gos Videolectures

D Verdonik - V D. Fišer in A. Pančur (ur.), Zbornik 11. konference …, 2018 - sdjt.si
Leta 2011 je bil dokončan prvi sklop referenčnega govornega korpusa Gos. Zajetih je bilo
120 ur posnetkov govora oz. 1 mio. besed. S tem obsegom se korpus Gos uvršča na …

Interface for smart audiovisual data archive

T Koctúr, M Pleva, J Juhár - 2015 25th International Conference …, 2015 - ieeexplore.ieee.org
This paper describes selected concepts and solutions that were used to build an interface
for smart audiovisual data archive (ISADA). Automatic speech recognition (ASR) cloud …

[PDF][PDF] Vprašanja zapisovanja govora v govornem korpusu Gos

D Verdonik - V T. Erjavec in J. Žganec Gros (ur.): Jezikovne …, 2014 - nl.ijs.si
Prispevek obravnava vprašanja, povezana z morebitno nadgradnjo referenčnega
govornega korpusa slovenščine Gos, s poudarkom na nekaterih težavnejših vprašanjih …

The SI TEDx-UM speech database: A new Slovenian spoken language resource

A Žgank, MS Maucec, D Verdonik - Proceedings of the Tenth …, 2016 - aclanthology.org
This paper presents a new Slovenian spoken language resource built from TEDx Talks. The
speech database contains 242 talks in total duration of 54 hours. The annotation and …

Gos 2: A New Reference Corpus of Spoken Slovenian

D Verdonik, K Dobrovoljc, T Erjavec… - Proceedings of the …, 2024 - aclanthology.org
This paper introduces a new version of the Gos reference corpus of spoken Slovenian,
which was recently extended to more than double the original size (300 hours, 2.4 million …

[PDF][PDF] Application of Fischer semi discriminant analysis for speaker diarization in costa rican radio broadcasts

R Sánchez Cárdenas, M Coto-Jiménez - 2022 - scholar.archive.org
Automatic segmentation and classification of audio streams is a challenging problem, with
many applications, such as indexing multimedia digital libraries, information retrieving, and …

[PDF][PDF] Razpoznavanje tekočega govora v slovenščini z bazo predavanj SI TEDx-UM

A Žgank, D Verdonik, MS Maučec - Proceedings of the Conference on …, 2016 - sdjt.si
V članku bomo predstavili novi slovenski govorni vir, nastal na osnovi posnetkov predavanj
TEDx. Govorna baza vsebuje posnetke 242 predavanj, v skupni dolžini 54 ur …

Application of Fischer semi discriminant analysis for speaker diarization in costa rican radio broadcasts

RS Cárdenas, MC Jiménez - Tecnología en Marcha, 2022 - dialnet.unirioja.es
Automatic segmentation and classification of audio streams is a challenging problem, with
many applications, such as indexing multimedia digital libraries, information retrieving, and …

[PDF][PDF] Govorni, dialoški in multimodalni jezikovni viri: pregled stanja

D Verdonik, A Žgank, S Majhenič, I Mlakar - 2020 - dsplab.feri.um.si
Iz govornih korpusov lahko izluščimo informacije o jeziku, ki jih iz pisnih korpusov ne
moremo dobiti. Pisni vir ne more v celoti ustrezno zastopati govorjene rabe. Iz samo 1 …