This paper investigates using objective quality measures to evaluate speaker adaptation performance in HMM-based speech synthesis. We compare several objective measures to …
Recent research has demonstrated the effectiveness of vocal tract length normalization (VTLN) as a rapid adaptation technique for statistical parametric speech synthesis. VTLN …
Z Khan, L Wihlborg, C Valentini-Botinhao… - Proc. Interspeech …, 2023 - isca-archive.org
In this paper, we present a framework for formant-controllable neural text-to-speech. We train a model that predicts formant frequencies which then condition melspectrogram …
When a synthetic voice represents a human being, its capability to facilitate social interaction becomes paramount. Extralinguistic aspects of the synthetic voice, such as age, gender …
Statistical speech synthesis (SSS) systems have the ability to adapt to a target speaker with a couple of minutes of adaptation data. Developing adaptation algorithms to further reduce …
Human voice provides the means for verbal communication and forms a part of personal identity. Unfortunately, not every individual can produce speech output. In clinical …