Automatic construction of the Finnish parliament speech corpus

P Smit, S Virpioja, M Kurimo - Computer Speech & Language, 2021 - Elsevier

We describe a novel way to implement subword language models in speech recognition
systems based on weighted finite state transducers, hidden Markov models, and deep …

被引用次数：52 相关文章所有 12 个版本

[PDF] helsinki.fi

[PDF][PDF] Finnish parliament on the semantic web: Using ParliamentSampo data service and semantic portal for studying political culture and language

E Hyvönen, L Sinikallio, P Leskinen… - … Data in Action, 2022 - researchportal.helsinki.fi

This paper introduces the system ParliamentSampo–Parliament of Finland on the Semantic
Web, a Linked Open Data (LOD) service, data infrastructure, and semantic portal for …

被引用次数：28 相关文章所有 13 个版本

[PDF] springer.com

Finnish parliament ASR corpus: Analysis, benchmarks and statistics

A Virkkunen, A Rouhe, N Phan, M Kurimo - Language Resources and …, 2023 - Springer

Public sources like parliament meeting recordings and transcripts provide ever-growing
material for the training and evaluation of automatic speech recognition (ASR) systems. In …

被引用次数：13 相关文章所有 10 个版本

[PDF] isca-archive.org

[PDF][PDF] Neural text-to-speech adaptation from low quality public recordings

Q Hu, E Marchi, D Winarsky, Y Stylianou… - Speech Synthesis …, 2019 - isca-archive.org

Abstract Neural Text-to-Speech (TTS) synthesis is able to generate highquality speech with
natural prosody. However, these systems typically require a large amount of data, preferably …

被引用次数：35 相关文章所有 4 个版本

[PDF] aalto.fi

Plenary Speeches of the Parliament of Finland as Linked Open Data and Data Services

E Hyvönen, L Sinikallio, P Leskinen… - CEUR Workshop …, 2023 - research.aalto.fi

This paper presents a new open infrastructure called ParliamentSampo for studying the
parliamentary culture, language, and activities of politicians in Finland. For the first time, the …

被引用次数：6 相关文章所有 12 个版本

[PDF] springer.com

Computational intelligence in processing of speech acoustics: a survey

A Singh, N Kaur, V Kukreja, V Kadyan… - Complex & Intelligent …, 2022 - Springer

Speech recognition of a language is a key area in the field of pattern recognition. This paper
presents a comprehensive survey on the speech recognition techniques for non-Indian and …

被引用次数：17 相关文章所有 7 个版本

[PDF] iospress.com

Publishing and using parliamentary Linked Data on the Semantic Web: ParliamentSampo system for Parliament of Finland

E Hyvönen, L Sinikallio, P Leskinen, S Drobac… - Semantic …, 2024 - content.iospress.com

This paper presents a new infrastructure and semantic portal called ParliamentSampo for
studying parliamentary speeches, culture, language, and activities in Finland. For the first …

被引用次数：4 相关文章所有 3 个版本

[PDF] aclanthology.org

ParlaSpeech-HR-a freely available ASR dataset for croatian bootstrapped from the parlaMint corpus

N Ljubešić, D Koržinek, P Rupnik… - Proceedings of the …, 2022 - aclanthology.org

This paper presents our bootstrapping efforts of producing the first large freely available
Croatian automatic speech recognition (ASR) dataset, 1,816 hours in size, obtained from …

被引用次数：16 相关文章所有 6 个版本

[PDF] aalto.fi

Low resource comparison of attention-based and hybrid ASR exploiting wav2vec 2.0

A Rouhe, A Virkkunen, J Leinonen, M Kurimo - Interspeech, 2022 - research.aalto.fi

Low resource speech recognition can potentially benefit a lot from exploiting a pretrained
model such as wav2vec 2.0. These pretrained models have learned useful representations …

被引用次数：9 相关文章所有 5 个版本

[PDF] aclanthology.org

Preparation of bangla speech corpus from publicly available audio & text

S Ahmed, N Sadeq, SS Shubha, MN Islam… - Proceedings of the …, 2020 - aclanthology.org

Automatic speech recognition systems require large annotated speech corpus. The manual
annotation of a large corpus is very difficult. In this paper, we focus on the automatic …

被引用次数：18 相关文章所有 4 个版本

高级搜索

QQ 群