Merlin: An open source neural network speech synthesis system

Z Wu, O Watts, S King - 9th ISCA Speech Synthesis Workshop, 2016 - research.ed.ac.uk
We introduce the Merlin speech synthesis toolkit for neural network-based speech synthesis.
The system takes linguistic features as input, and employs neural networks to predict …

[图书][B] Biomimetics: nature-based innovation

Y Bar-Cohen - 2016 - books.google.com
A review of the current state of the art of biomimetics, this book documents key biological
solutions that provide a model for innovations in engineering and science. Leading experts …

Expressive speech synthesis: a review

D Govind, SRM Prasanna - International Journal of Speech Technology, 2013 - Springer
The objective of the present work is to provide a detailed review of expressive speech
synthesis (ESS). Among various approaches for ESS, the present paper focuses the …

Social fidelity in virtual agents: Impacts on presence and learning

AM Sinatra, KA Pollard, BT Files, AH Oiknine… - Computers in Human …, 2021 - Elsevier
Teaching and training are increasingly moving from real world venues to computerized
environments, with human instructors often being replaced or joined by virtual pedagogical …

Robust speaker-adaptive HMM-based text-to-speech synthesis

J Yamagishi, T Nose, H Zen, ZH Ling… - … on Audio, Speech …, 2009 - ieeexplore.ieee.org
This paper describes a speaker-adaptive HMM-based speech synthesis system. The new
system, called ldquoHTS-2007, rdquo employs speaker adaptation (CSMAPLR+ MAP) …

Language-agnostic meta-learning for low-resource text-to-speech with articulatory features

F Lux, NT Vu - arXiv preprint arXiv:2203.03191, 2022 - arxiv.org
While neural text-to-speech systems perform remarkably well in high-resource scenarios,
they cannot be applied to the majority of the over 6,000 spoken languages in the world due …

Requirements and motivations of low-resource speech synthesis for language revitalization

A Pine, D Wells, N Brinklow, P Littell… - Proceedings of the …, 2022 - aclanthology.org
This paper describes the motivation and development of speech synthesis systems for the
purposes of language revitalization. By building speech synthesis systems for three …

[PDF][PDF] Announcing the electromagnetic articulography (day 1) subset of the mngu0 articulatory corpus

K Richmond, P Hoole, S King - Twelfth Annual Conference of the …, 2011 - researchgate.net
This paper serves as an initial announcement of the availability of a corpus of articulatory
data called mngu0. This corpus will ultimately consist of a collection of multiple sources of …

Towards a neurocomputational model of speech production and perception

BJ Kröger, J Kannampuzha… - Speech …, 2009 - Elsevier
The limitation in performance of current speech synthesis and speech recognition systems
may result from the fact that these systems are not designed with respect to the human …

[PDF][PDF] Siri on-device deep learning-guided unit selection text-to-speech system.

T Capes, P Coles, A Conkie, L Golipour… - Interspeech, 2017 - academia.edu
This paper describes Apple's hybrid unit selection speech synthesis system, which provides
the voices for Siri with the requirement of naturalness, personality and expressivity. It has …