A survey on neural speech synthesis

X Tan, T Qin, F Soong, TY Liu - arXiv preprint arXiv:2106.15561, 2021 - arxiv.org
Text to speech (TTS), or speech synthesis, which aims to synthesize intelligible and natural
speech given text, is a hot research topic in speech, language, and machine learning …

Delightfultts: The microsoft speech synthesis system for blizzard challenge 2021

Y Liu, Z Xu, G Wang, K Chen, B Li, X Tan, J Li… - arXiv preprint arXiv …, 2021 - arxiv.org
This paper describes the Microsoft end-to-end neural text to speech (TTS) system:
DelightfulTTS for Blizzard Challenge 2021. The goal of this challenge is to synthesize …

Efficienttts: An efficient and high-quality text-to-speech architecture

C Miao, L Shuang, Z Liu, C Minchuan… - International …, 2021 - proceedings.mlr.press
In this work, we address the Text-to-Speech (TTS) task by proposing a non-autoregressive
architecture called EfficientTTS. Unlike the dominant non-autoregressive TTS models, which …

Fasttalker: A neural text-to-speech architecture with shallow and group autoregression

R Liu, B Sisman, Y Lin, H Li - Neural Networks, 2021 - Elsevier
Non-autoregressive architecture for neural text-to-speech (TTS) allows for parallel
implementation, thus reduces inference time over its autoregressive counterpart. However …

[PDF][PDF] EdenTTS: A Simple and Efficient Parallel Text-to-speech Architecture with Collaborative Duration-alignment Learning

Y Ma, J He, M Wu, G Hu, H Fei - Proc. Interspeech, 2023 - isca-archive.org
In pursuit of high inference speed, many non-autoregressive neural text-to-speech (TTS)
models have been proposed for parallel speech synthesis recently. A critical challenge of …

Fcl-taco2: Towards fast, controllable and lightweight text-to-speech synthesis

D Wang, L Deng, Y Zhang, N Zheng… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org
Sequence-to-sequence (seq2seq) learning has greatly improved text-to-speech (TTS)
synthesis performance, but effective implementation on resource-restricted devices remains …

Robust TTS

X Tan - Neural Text-to-Speech Synthesis, 2023 - Springer
In this chapter, we introduce how to address the robustness issues in TTS. We summarize
some popular techniques to improve robustness, including enhancing attention, replacing …

[引用][C] tts-tutorial/survey

G Repo, GPK Tool, X Tan, T Qin, F Soong, TY Liu

[引用][C] tts-tutorial/survey

X Tan, T Qin, F Soong, TY Liu