[HTML][HTML] Advances in subword-based HMM-DNN speech recognition across languages

P Smit, S Virpioja, M Kurimo - Computer Speech & Language, 2021 - Elsevier
We describe a novel way to implement subword language models in speech recognition
systems based on weighted finite state transducers, hidden Markov models, and deep …

[PDF][PDF] Finnish parliament on the semantic web: Using ParliamentSampo data service and semantic portal for studying political culture and language

E Hyvönen, L Sinikallio, P Leskinen… - … Data in Action, 2022 - researchportal.helsinki.fi
This paper introduces the system ParliamentSampo–Parliament of Finland on the Semantic
Web, a Linked Open Data (LOD) service, data infrastructure, and semantic portal for …

Finnish parliament ASR corpus: Analysis, benchmarks and statistics

A Virkkunen, A Rouhe, N Phan, M Kurimo - Language Resources and …, 2023 - Springer
Public sources like parliament meeting recordings and transcripts provide ever-growing
material for the training and evaluation of automatic speech recognition (ASR) systems. In …

[PDF][PDF] Neural text-to-speech adaptation from low quality public recordings

Q Hu, E Marchi, D Winarsky, Y Stylianou… - Speech Synthesis …, 2019 - isca-archive.org
Abstract Neural Text-to-Speech (TTS) synthesis is able to generate highquality speech with
natural prosody. However, these systems typically require a large amount of data, preferably …

Plenary Speeches of the Parliament of Finland as Linked Open Data and Data Services

E Hyvönen, L Sinikallio, P Leskinen… - CEUR Workshop …, 2023 - research.aalto.fi
This paper presents a new open infrastructure called ParliamentSampo for studying the
parliamentary culture, language, and activities of politicians in Finland. For the first time, the …

Computational intelligence in processing of speech acoustics: a survey

A Singh, N Kaur, V Kukreja, V Kadyan… - Complex & Intelligent …, 2022 - Springer
Speech recognition of a language is a key area in the field of pattern recognition. This paper
presents a comprehensive survey on the speech recognition techniques for non-Indian and …

Publishing and using parliamentary Linked Data on the Semantic Web: ParliamentSampo system for Parliament of Finland

E Hyvönen, L Sinikallio, P Leskinen, S Drobac… - Semantic …, 2024 - content.iospress.com
This paper presents a new infrastructure and semantic portal called ParliamentSampo for
studying parliamentary speeches, culture, language, and activities in Finland. For the first …

ParlaSpeech-HR-a freely available ASR dataset for croatian bootstrapped from the parlaMint corpus

N Ljubešić, D Koržinek, P Rupnik… - Proceedings of the …, 2022 - aclanthology.org
This paper presents our bootstrapping efforts of producing the first large freely available
Croatian automatic speech recognition (ASR) dataset, 1,816 hours in size, obtained from …

Low resource comparison of attention-based and hybrid ASR exploiting wav2vec 2.0

A Rouhe, A Virkkunen, J Leinonen, M Kurimo - Interspeech, 2022 - research.aalto.fi
Low resource speech recognition can potentially benefit a lot from exploiting a pretrained
model such as wav2vec 2.0. These pretrained models have learned useful representations …

Preparation of bangla speech corpus from publicly available audio & text

S Ahmed, N Sadeq, SS Shubha, MN Islam… - Proceedings of the …, 2020 - aclanthology.org
Automatic speech recognition systems require large annotated speech corpus. The manual
annotation of a large corpus is very difficult. In this paper, we focus on the automatic …