A situational analysis of current speech-synthesis systems for child voices: A scoping review of qualitative and quantitative evidence

C Terblanche, M Harty, M Pascoe, BV Tucker - Applied Sciences, 2022 - mdpi.com
(1) Background: Speech synthesis has customarily focused on adult speech, but with the
rapid development of speech-synthesis technology, it is now possible to create child voices …

A text-to-speech pipeline, evaluation methodology, and initial fine-tuning results for child speech synthesis

R Jain, MY Yiwere, D Bigioi, P Corcoran… - IEEE Access, 2022 - ieeexplore.ieee.org
Speech synthesis has come a long way as current text-to-speech (TTS) models can now
generate natural human-sounding speech. However, most of the TTS research focuses on …

Towards personalized speech synthesis for augmentative and alternative communication

T Mills, HT Bunnell, R Patel - Augmentative and Alternative …, 2014 - Taylor & Francis
Text-to-speech options on augmentative and alternative communication (AAC) devices are
limited. Often, several individuals in a group setting use the same synthetic voice. This lack …

State-of-the-art in TEL to support social communication skill development in children with autism: a multi-disciplinary review

K Avramides, S Bernardini, ME Foster… - International …, 2012 - inderscienceonline.com
The paper reviews state-of-the-art in Technology Enhanced Learning (TEL) to support social
communication skill development in children with autism. We identify the driving research …

Voice-enabled assistive robots for handling autism spectrum conditions: an examination of the role of prosody

E Marchi, F Ringeval, B Schuller - Speech and automata in health …, 2014 - library.oapen.org
Autism spectrum conditions (ASC) are neurodevelopmental conditions, characterized by
impairments in social interaction, communication (ie, verbal and non-verbal language), and …

Adapting pretrained models for adult to child voice conversion

PN Sudro, A Ragni, T Hain - 2023 31st European Signal …, 2023 - ieeexplore.ieee.org
Due to widespread lack of parallel data for adult to child voice conversion (VC), non parallel
VC techniques have grown in popularity. Methods, such as encoder-decoder model, have …

Enhancement of cleft palate speech using temporal and spectral processing

PN Sudro, SRM Prasanna - Speech Communication, 2020 - Elsevier
The speech of the individuals with cleft palate (CP) is generally characterized by the
presence of abnormal nasal resonances during the production of voiced sounds, primarily in …

Improved particle swarm optimization and applications to hidden markov model and ackley function

S Motiian, H Soltanian-Zadeh - 2011 IEEE International …, 2011 - ieeexplore.ieee.org
Particle Swarm Optimization (PSO) is an algorithm based on social intelligence, utilized in
many fields of optimization. In applications like speech recognition, due to existence of high …

Polyglot speech synthesis based on cross-lingual frame selection using auditory and articulatory features

CP Chen, YC Huang, CH Wu… - IEEE/ACM Transactions …, 2014 - ieeexplore.ieee.org
In this paper, an approach for polyglot speech synthesis based on cross-lingual frame
selection is proposed. This method requires only mono-lingual speech data of different …

Personalized spectral and prosody conversion using frame-based codeword distribution and adaptive CRF

YC Huang, CH Wu, YT Chao - IEEE transactions on audio …, 2012 - ieeexplore.ieee.org
This study proposes a voice conversion-based approach to personalized text-to-speech
(TTS) synthesis. The conversion functions, trained using a small parallel corpus with source …