Open-source high quality speech datasets for Basque, Catalan and Galician

O Kjartansson, A Gutkin, A Butryna… - Proceedings of the …, 2020 - aclanthology.org
This paper introduces new open speech datasets for three of the languages of Spain:
Basque, Catalan and Galician. Catalan is furthermore the official language of the Principality …

Enhancing Crowdsourced Audio for Text-to-Speech Models

J Giraldo, M Llopart-Font, A Peiró-Lilja… - arXiv preprint arXiv …, 2024 - arxiv.org
High-quality audio data is a critical prerequisite for training robust text-to-speech models,
which often limits the use of opportunistic or crowdsourced datasets. This paper presents an …

[PDF][PDF] Building an Open Source Automatic Speech Recognition System for Catalan.

B Külebi, A Öktem - IberSPEECH, 2018 - isca-archive.org
Catalan is recognized as the largest stateless language in Europe hence it is a language
well studied in the field of speech, and there exists various solutions for Automatic Speech …

Polish unit selection speech synthesis with BOSS: extensions and speech corpora

G Demenko, K Klessa, M Szymański, S Breuer… - International Journal of …, 2010 - Springer
This article presents research and development aimed at creating a Polish speech database
for speech synthesis and adapting BOSS (The Bonn Open Synthesis System) to the Polish …

[PDF][PDF] LaFresCat: A Catalan Multi-Accent Speech Dataset for Text-to-Speech

A Peiró-Lilja, M Llopart-Font… - … : Aveiro, Portugal, 11 …, 2024 - linguistica.ub.edu
Current generative text-to-speech (TTS) models are very robust and capable of learning the
phonetics of a language almost perfectly. To do so, it remains crucial that the speech data …

Sistema de conversión texto a voz de código abierto para lenguas ibéricas

A Alonso, I Sainz, D Erro, E Navas… - … del lenguaje natural, 2013 - journal.sepln.org
Este artículo presenta un conversor texto a voz basado en síntesis estadística que por
primera vez permite disponer en un único sistema de las cuatro lenguas oficiales en …

Language-independent acoustic cloning of HTS voices

C Magariños, D Erro, ER Banga - Computer Speech & Language, 2019 - Elsevier
Speaker adaptation techniques can be classified as intra-lingual or cross-lingual depending
on whether or not the source model and the target speaker employ the same language. Most …

[PDF][PDF] Synthesis using speaker adaptation from speech recognition db

S Oller, A Moreno, A Bonafonte - FALA-2010. ISCA Special Interest …, 2010 - dl.sergioller.com
This paper deals with the creation of multiple voices from a Hidden Markov Model based
speech synthesis system (HTS). More than 150 Catalan synthetic voices were built using …

Language-Independent Acoustic Cloning of HTS Voices: An Objective Evaluation

C Magariños, D Erro, P Lopez-Otero… - Advances in Speech and …, 2016 - Springer
In a previous work we presented a method to combine the acoustic characteristics of a
speech synthesis model with the linguistic characteristics of another one. This paper …

Discourse-givenness of noun phrases: theoretical and computational models

J Ritz - 2013 - publishup.uni-potsdam.de
This thesis gives formal definitions of discourse-givenness, coreference and reference, and
reports on experiments with computational models of discourse-givenness of noun phrases …