Conventional and contemporary approaches used in text to speech synthesis: A review

N Kaur, P Singh - Artificial Intelligence Review, 2023 - Springer
Nowadays speech synthesis or text to speech (TTS), an ability of system to produce human
like natural sounding voice from the written text, is gaining popularity in the field of speech …

A deep learning approaches in text-to-speech system: a systematic review and recent research perspective

Y Kumar, A Koul, C Singh - Multimedia Tools and Applications, 2023 - Springer
Text-to-speech systems (TTS) have come a long way in the last decade and are now a
popular research topic for creating various human-computer interaction systems. Although, a …

[HTML][HTML] A review of deep learning based speech synthesis

Y Ning, S He, Z Wu, C Xing, LJ Zhang - Applied Sciences, 2019 - mdpi.com
Speech synthesis, also known as text-to-speech (TTS), has attracted increasingly more
attention. Recent advances on speech synthesis are overwhelmingly contributed by deep …

Review of end-to-end speech synthesis technology based on deep learning

Z Mu, X Yang, Y Dong - arXiv preprint arXiv:2104.09995, 2021 - arxiv.org
As an indispensable part of modern human-computer interaction system, speech synthesis
technology helps users get the output of intelligent machine more easily and intuitively, thus …

Dynamic prosody generation for speech synthesis using linguistics-driven acoustic embedding selection

S Tyagi, M Nicolis, J Rohnke, T Drugman… - arXiv preprint arXiv …, 2019 - arxiv.org
Recent advances in Text-to-Speech (TTS) have improved quality and naturalness to near-
human capabilities when considering isolated sentences. But something which is still …

[PDF][PDF] Deep learning based NLP techniques in text to speech synthesis for communication recognition

EEB Adam - Journal of Soft Computing Paradigm (JSCP), 2020 - researchgate.net
The computer system is developing the model for speech synthesis of various aspects for
natural language processing. The speech synthesis explores by articulatory, formant and …

Hierarchical prosody modeling for non-autoregressive speech synthesis

CM Chien, H Lee - 2021 IEEE Spoken Language Technology …, 2021 - ieeexplore.ieee.org
Prosody modeling is an essential component in modern text-to-speech (TTS) frameworks.
By explicitly providing prosody features to the TTS model, the style of synthesized utterances …

Improving prosody modelling with cross-utterance bert embeddings for end-to-end speech synthesis

G Xu, W Song, Z Zhang, C Zhang… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org
Although speech prosody is related to the linguistic information up to the discourse structure,
most text-to-speech (TTS) systems only take into account the information within each …

Neural speech synthesis with transformer network

N Li, S Liu, Y Liu, S Zhao, M Liu - … of the AAAI conference on artificial …, 2019 - ojs.aaai.org
Although end-to-end neural text-to-speech (TTS) methods (such as Tacotron2) are proposed
and achieve state-of-theart performance, they still suffer from two problems: 1) low efficiency …

A survey on neural speech synthesis

X Tan, T Qin, F Soong, TY Liu - arXiv preprint arXiv:2106.15561, 2021 - arxiv.org
Text to speech (TTS), or speech synthesis, which aims to synthesize intelligible and natural
speech given text, is a hot research topic in speech, language, and machine learning …