JSUT and JVS: Free Japanese voice corpora for accelerating speech synthesis research

S Takamichi, R Sonobe, K Mitsui, Y Saito… - Acoustical Science …, 2020 - jstage.jst.go.jp
In this paper, we develop two corpora for speech synthesis research. Thanks to
improvements in machine learning techniques, including deep learning, speech synthesis is …

JVS corpus: free Japanese multi-speaker voice corpus

S Takamichi, K Mitsui, Y Saito, T Koriyama… - arXiv preprint arXiv …, 2019 - arxiv.org
Thanks to improvements in machine learning techniques, including deep learning, speech
synthesis is becoming a machine learning task. To accelerate speech synthesis research …

JSUT corpus: free large-scale Japanese speech corpus for end-to-end speech synthesis

R Sonobe, S Takamichi, H Saruwatari - arXiv preprint arXiv:1711.00354, 2017 - arxiv.org
Thanks to improvements in machine learning techniques including deep learning, a free
large-scale speech corpus that can be shared between academic institutions and …

Text-to-speech synthesis

Y Shiga, J Ni, K Tachibana, T Okamoto - Speech-to-Speech Translation, 2020 - Springer
The recent progress of text-to-speech synthesis (TTS) technology has allowed computers to
read any written text aloud with voice that is artificial but almost indistinguishable from real …

A review of deep learning based speech synthesis

Y Ning, S He, Z Wu, C Xing, LJ Zhang - Applied Sciences, 2019 - mdpi.com
Speech synthesis, also known as text-to-speech (TTS), has attracted increasingly more
attention. Recent advances on speech synthesis are overwhelmingly contributed by deep …

Review of end-to-end speech synthesis technology based on deep learning

Z Mu, X Yang, Y Dong - arXiv preprint arXiv:2104.09995, 2021 - arxiv.org
As an indispensable part of modern human-computer interaction system, speech synthesis
technology helps users get the output of intelligent machine more easily and intuitively, thus …

[PDF][PDF] Impacts of input linguistic feature representation on Japanese end-to-end speech synthesis

T Fujimoto, K Hashimoto, K Oura… - 10th ISCA Speech …, 2019 - isca-archive.org
We investigate the impact of input linguistic feature representation on Japanese end-to-end
speech synthesis. An end-toend speech synthesis system, which directly generates natural …

[PDF][PDF] Text to speech synthesis: a systematic review, deep learning based architecture and future research direction

F Khanam, FA Munmun, NA Ritu, AK Saha… - Journal of Advances in …, 2022 - academia.edu
Text to Speech (TTS) synthesis is a process of translating natural language text into speech.
Pieces of recorded speech generate synthesized speech and a database is maintained for …

[PDF][PDF] Back to the Future: Extending the Blizzard Challenge 2013.

S Le Maguer, S King, N Harte - INTERSPEECH, 2022 - researchgate.net
Nowadays, speech synthesis technology is synonymous with the use of Deep Learning. To
understand more about how synthesis systems have progressed with the advent of Deep …

Mongolian text-to-speech system based on deep neural network

R Liu, F Bao, G Gao, Y Wang - … 2017, Lianyungang, China, October 11–13 …, 2018 - Springer
Abstract Recently, Deep Neural Network (DNN), which is a feed-forward artificial neural
network with many hidden layers, has opened a new research direction for Speech …