A situational analysis of current speech-synthesis systems for child voices: A scoping review of qualitative and quantitative evidence

C Terblanche, M Harty, M Pascoe, BV Tucker - Applied Sciences, 2022 - mdpi.com
(1) Background: Speech synthesis has customarily focused on adult speech, but with the
rapid development of speech-synthesis technology, it is now possible to create child voices …

The development of synthetic child speech in three South African languages

C Terblanche, TT Schnoor, M Harty… - Augmentative and …, 2024 - Taylor & Francis
It is well-known that children with expressive communication difficulties have the right to
communicate, but they should also have the right to do so in whichever language they …

Do you like my voice? Stakeholder perspectives about the acceptability of synthetic child voices in three South African languages

CC Terblanche, M Pascoe… - International Journal of …, 2025 - Wiley Online Library
Background There is a global need for synthetic speech development in multiple languages
and dialects, as many children who cannot communicate using their natural voice struggle to …

A comparison of speaker-based and utterance-based data selection for text-to-speech synthesis

KZ Lee, E Cooper - Interspeech 2018, 2018 - par.nsf.gov
Building on previous work in subset selection of training data for text-to-speech (TTS), this
work compares speaker-level and utterance-level selection of TTS training data, using …

Automated Child Voice Generation: Methodology and Implementation

S Alwaisi, MS Al-Radhi… - … Conference on Speech …, 2023 - ieeexplore.ieee.org
Significant progress has been made in the development of text-to-speech (TTS) models;
however, synthesizing child speech remains a challenging task. Limited research has been …

Adaptation and frontend features to improve naturalness in found-data synthesis

E Cooper, J Hirschberg - Proceedings of Speech Prosody 2018, 2018 - par.nsf.gov
We compare two approaches for training statistical parametric voices that make use of
acoustic and prosodic features at the utterance level with the aim of improving naturalness of …