Unsupervised and lightly-supervised learning for rapid construction of TTS systems in multiple...

[PDF][PDF] Text to speech synthesis: a systematic review, deep learning based architecture and future research direction

F Khanam, FA Munmun, NA Ritu, AK Saha… - Journal of Advances in …, 2022 - academia.edu

Text to Speech (TTS) synthesis is a process of translating natural language text into speech.
Pieces of recorded speech generate synthesized speech and a database is maintained for …

被引用次数：32 相关文章所有 4 个版本

[PDF] aclanthology.org

The SIGMORPHON 2020 shared task on multilingual grapheme-to-phoneme conversion

K Gorman, LFE Ashby, A Goyzueta… - Proceedings of the …, 2020 - aclanthology.org

We describe the design and findings of the SIGMORPHON 2020 shared task on multilingual
grapheme-to-phoneme conversion. Participants were asked to submit systems which take in …

被引用次数：63 相关文章所有 6 个版本

[PDF] ed.ac.uk

TUNDRA: a multilingual corpus of found data for TTS research created with light supervision

A Stan, O Watts, Y Mamiya, M Giurgiu… - … 2013, 14th Annual …, 2013 - research.ed.ac.uk

Abstract Simple4All Tundra (version 1.0) is the first release of a standardised multilingual
corpus designed for text-to-speech research with imperfect or found data. The corpus …

被引用次数：54 相关文章所有 17 个版本

[PDF] google.com

ALISA: An automatic lightly supervised speech segmentation and alignment tool

A Stan, Y Mamiya, J Yamagishi, P Bell, O Watts… - Computer Speech & …, 2016 - Elsevier

This paper describes the ALISA tool, which implements a lightly supervised method for
sentence-level alignment of speech with imperfect transcripts. Its intended use is to enable …

被引用次数：41 相关文章所有 7 个版本

Unsupervised language identification based on Latent Dirichlet Allocation

W Zhang, RAJ Clark, Y Wang, W Li - Computer Speech & Language, 2016 - Elsevier

To automatically build, from scratch, the language processing component for a speech
synthesis system in a new language, a purified text corpora is needed where any words and …

被引用次数：27 相关文章所有 3 个版本

[PDF] cmu.edu

[PDF][PDF] Utterance Selection Techniques for TTS Systems Using Found Speech.

P Baljekar, AW Black - SSW, 2016 - cs.cmu.edu

The goal in this paper is to investigate data selection techniques for found speech. Found
speech unlike clean, phoneticallybalanced datasets recorded specifically for synthesis …

被引用次数：24 相关文章所有 7 个版本

[PDF] ed.ac.uk

The CSTR entry to the Blizzard Challenge 2016

T Merritt, S Ronanki, Z Wu, O Watts - Blizzard Challenge 2016, 2016 - research.ed.ac.uk

This paper describes the text-to-speech system entered by The Centre for Speech
Technology Research into the 2016 Blizzard Challenge. This system is a hybrid synthesis …

被引用次数：18 相关文章所有 5 个版本

[PDF] cmu.edu

Speech synthesis from found data

P Baljekar - 2018 - kilthub.cmu.edu

Text-to-speech synthesis (TTS) has progressed to such a stage that given a large, clean,
phonetically balanced dataset from a single speaker, it can produce intelligible, almost …

被引用次数：17 相关文章所有 5 个版本

[PDF] isca-archive.org

[PDF][PDF] HMM based myanmar text to speech system.

YK Thu, WP Pa, J Ni, Y Shiga, AM Finch, C Hori… - …, 2015 - isca-archive.org

This paper presents a complete statistical speech synthesizer for Myanmar which includes a
syllable segmenter, text normalizer, grapheme-to-phoneme convertor, and an HMM-based …

被引用次数：19 相关文章所有 4 个版本

[PDF] researchgate.net

Data selection for improving naturalness of tts voices trained on small found corpuses

FY Kuo, S Aryal, G Degottex, S Kang… - 2018 IEEE Spoken …, 2018 - ieeexplore.ieee.org

This work investigates techniques that select training data from small, found corpuses in
order to improve the naturalness of synthesized text-to-speech voices. The approach …

被引用次数：14 相关文章所有 2 个版本

高级搜索

QQ 群