Moboaligner: A neural alignment model for non-autoregressive tts with monotonic boundary search

X Tan, T Qin, F Soong, TY Liu - arXiv preprint arXiv:2106.15561, 2021 - arxiv.org

Text to speech (TTS), or speech synthesis, which aims to synthesize intelligible and natural
speech given text, is a hot research topic in speech, language, and machine learning …

被引用次数：410 相关文章所有 2 个版本

[PDF] arxiv.org

Delightfultts: The microsoft speech synthesis system for blizzard challenge 2021

Y Liu, Z Xu, G Wang, K Chen, B Li, X Tan, J Li… - arXiv preprint arXiv …, 2021 - arxiv.org

This paper describes the Microsoft end-to-end neural text to speech (TTS) system:
DelightfulTTS for Blizzard Challenge 2021. The goal of this challenge is to synthesize …

被引用次数：62 相关文章所有 4 个版本

[PDF] mlr.press

Efficienttts: An efficient and high-quality text-to-speech architecture

C Miao, L Shuang, Z Liu, C Minchuan… - International …, 2021 - proceedings.mlr.press

In this work, we address the Text-to-Speech (TTS) task by proposing a non-autoregressive
architecture called EfficientTTS. Unlike the dominant non-autoregressive TTS models, which …

被引用次数：48 相关文章所有 5 个版本

Fasttalker: A neural text-to-speech architecture with shallow and group autoregression

R Liu, B Sisman, Y Lin, H Li - Neural Networks, 2021 - Elsevier

Non-autoregressive architecture for neural text-to-speech (TTS) allows for parallel
implementation, thus reduces inference time over its autoregressive counterpart. However …

被引用次数：16 相关文章所有 4 个版本

[PDF] isca-archive.org

[PDF][PDF] EdenTTS: A Simple and Efficient Parallel Text-to-speech Architecture with Collaborative Duration-alignment Learning

Y Ma, J He, M Wu, G Hu, H Fei - Proc. Interspeech, 2023 - isca-archive.org

In pursuit of high inference speed, many non-autoregressive neural text-to-speech (TTS)
models have been proposed for parallel speech synthesis recently. A critical challenge of …

被引用次数：3 相关文章所有 3 个版本

[PDF] cuhk.edu.hk

Fcl-taco2: Towards fast, controllable and lightweight text-to-speech synthesis

D Wang, L Deng, Y Zhang, N Zheng… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org

Sequence-to-sequence (seq2seq) learning has greatly improved text-to-speech (TTS)
synthesis performance, but effective implementation on resource-restricted devices remains …

被引用次数：11 相关文章所有 2 个版本

Robust TTS

X Tan - Neural Text-to-Speech Synthesis, 2023 - Springer

In this chapter, we introduce how to address the robustness issues in TTS. We summarize
some popular techniques to improve robustness, including enhancing attention, replacing …

[引用][C] tts-tutorial/survey

G Repo, GPK Tool, X Tan, T Qin, F Soong, TY Liu

[引用][C] tts-tutorial/survey

X Tan, T Qin, F Soong, TY Liu

高级搜索

QQ 群