Recent work on the FESTCAT database for speech synthesis

O Kjartansson, A Gutkin, A Butryna… - Proceedings of the …, 2020 - aclanthology.org

This paper introduces new open speech datasets for three of the languages of Spain:
Basque, Catalan and Galician. Catalan is furthermore the official language of the Principality …

被引用次数：39 相关文章所有 7 个版本

[PDF] arxiv.org

Enhancing Crowdsourced Audio for Text-to-Speech Models

J Giraldo, M Llopart-Font, A Peiró-Lilja… - arXiv preprint arXiv …, 2024 - arxiv.org

High-quality audio data is a critical prerequisite for training robust text-to-speech models,
which often limits the use of opportunistic or crowdsourced datasets. This paper presents an …

[PDF][PDF] Building an Open Source Automatic Speech Recognition System for Catalan.

B Külebi, A Öktem - IberSPEECH, 2018 - isca-archive.org

Catalan is recognized as the largest stateless language in Europe hence it is a language
well studied in the field of speech, and there exists various solutions for Automatic Speech …

被引用次数：10 相关文章所有 3 个版本

Polish unit selection speech synthesis with BOSS: extensions and speech corpora

G Demenko, K Klessa, M Szymański, S Breuer… - International Journal of …, 2010 - Springer

This article presents research and development aimed at creating a Polish speech database
for speech synthesis and adapting BOSS (The Bonn Open Synthesis System) to the Polish …

被引用次数：13 相关文章所有 9 个版本

[PDF] ub.edu

[PDF][PDF] LaFresCat: A Catalan Multi-Accent Speech Dataset for Text-to-Speech

A Peiró-Lilja, M Llopart-Font… - … : Aveiro, Portugal, 11 …, 2024 - linguistica.ub.edu

Current generative text-to-speech (TTS) models are very robust and capable of learning the
phonetics of a language almost perfectly. To do so, it remains crucial that the speech data …

Sistema de conversión texto a voz de código abierto para lenguas ibéricas

A Alonso, I Sainz, D Erro, E Navas… - … del lenguaje natural, 2013 - journal.sepln.org

Este artículo presenta un conversor texto a voz basado en síntesis estadística que por
primera vez permite disponer en un único sistema de las cuatro lenguas oficiales en …

被引用次数：3 相关文章所有 8 个版本

[PDF] researchgate.net

Language-independent acoustic cloning of HTS voices

C Magariños, D Erro, ER Banga - Computer Speech & Language, 2019 - Elsevier

Speaker adaptation techniques can be classified as intra-lingual or cross-lingual depending
on whether or not the source model and the target speaker employ the same language. Most …

被引用次数：1 相关文章所有 2 个版本

[PDF] sergioller.com

[PDF][PDF] Synthesis using speaker adaptation from speech recognition db

S Oller, A Moreno, A Bonafonte - FALA-2010. ISCA Special Interest …, 2010 - dl.sergioller.com

This paper deals with the creation of multiple voices from a Hidden Markov Model based
speech synthesis system (HTS). More than 150 Catalan synthetic voices were built using …

被引用次数：3 相关文章

Language-Independent Acoustic Cloning of HTS Voices: An Objective Evaluation

C Magariños, D Erro, P Lopez-Otero… - Advances in Speech and …, 2016 - Springer

In a previous work we presented a method to combine the acoustic characteristics of a
speech synthesis model with the linguistic characteristics of another one. This paper …

被引用次数：1 相关文章所有 4 个版本

[PDF] uni-potsdam.de

Discourse-givenness of noun phrases: theoretical and computational models

J Ritz - 2013 - publishup.uni-potsdam.de

This thesis gives formal definitions of discourse-givenness, coreference and reference, and
reports on experiments with computational models of discourse-givenness of noun phrases …

被引用次数：1 相关文章

高级搜索

QQ 群