[PDF][PDF] Text to speech synthesis: a systematic review, deep learning based architecture and future research direction

F Khanam, FA Munmun, NA Ritu, AK Saha… - Journal of Advances in …, 2022 - academia.edu
Text to Speech (TTS) synthesis is a process of translating natural language text into speech.
Pieces of recorded speech generate synthesized speech and a database is maintained for …

The SIGMORPHON 2020 shared task on multilingual grapheme-to-phoneme conversion

K Gorman, LFE Ashby, A Goyzueta… - Proceedings of the …, 2020 - aclanthology.org
We describe the design and findings of the SIGMORPHON 2020 shared task on multilingual
grapheme-to-phoneme conversion. Participants were asked to submit systems which take in …

TUNDRA: a multilingual corpus of found data for TTS research created with light supervision

A Stan, O Watts, Y Mamiya, M Giurgiu… - … 2013, 14th Annual …, 2013 - research.ed.ac.uk
Abstract Simple4All Tundra (version 1.0) is the first release of a standardised multilingual
corpus designed for text-to-speech research with imperfect or found data. The corpus …

ALISA: An automatic lightly supervised speech segmentation and alignment tool

A Stan, Y Mamiya, J Yamagishi, P Bell, O Watts… - Computer Speech & …, 2016 - Elsevier
This paper describes the ALISA tool, which implements a lightly supervised method for
sentence-level alignment of speech with imperfect transcripts. Its intended use is to enable …

Unsupervised language identification based on Latent Dirichlet Allocation

W Zhang, RAJ Clark, Y Wang, W Li - Computer Speech & Language, 2016 - Elsevier
To automatically build, from scratch, the language processing component for a speech
synthesis system in a new language, a purified text corpora is needed where any words and …

[PDF][PDF] Utterance Selection Techniques for TTS Systems Using Found Speech.

P Baljekar, AW Black - SSW, 2016 - cs.cmu.edu
The goal in this paper is to investigate data selection techniques for found speech. Found
speech unlike clean, phoneticallybalanced datasets recorded specifically for synthesis …

The CSTR entry to the Blizzard Challenge 2016

T Merritt, S Ronanki, Z Wu, O Watts - Blizzard Challenge 2016, 2016 - research.ed.ac.uk
This paper describes the text-to-speech system entered by The Centre for Speech
Technology Research into the 2016 Blizzard Challenge. This system is a hybrid synthesis …

Speech synthesis from found data

P Baljekar - 2018 - kilthub.cmu.edu
Text-to-speech synthesis (TTS) has progressed to such a stage that given a large, clean,
phonetically balanced dataset from a single speaker, it can produce intelligible, almost …

[PDF][PDF] HMM based myanmar text to speech system.

YK Thu, WP Pa, J Ni, Y Shiga, AM Finch, C Hori… - …, 2015 - isca-archive.org
This paper presents a complete statistical speech synthesizer for Myanmar which includes a
syllable segmenter, text normalizer, grapheme-to-phoneme convertor, and an HMM-based …

Data selection for improving naturalness of tts voices trained on small found corpuses

FY Kuo, S Aryal, G Degottex, S Kang… - 2018 IEEE Spoken …, 2018 - ieeexplore.ieee.org
This work investigates techniques that select training data from small, found corpuses in
order to improve the naturalness of synthesized text-to-speech voices. The approach …