A hybrid text-to-speech system that combines concatenative and statistical synthesis units

S Tiomkin, D Malah, S Shechtman… - IEEE transactions on …, 2010 - ieeexplore.ieee.org
Concatenative synthesis and statistical synthesis are the two main approaches to text-to-
speech (TTS) synthesis. Concatenative TTS (CTTS) stores natural speech features …

Wrapped Gaussian mixture models for modeling and high-rate quantization of phase data of speech

Y Agiomyrgiannakis, Y Stylianou - IEEE Transactions on Audio …, 2009 - ieeexplore.ieee.org
The harmonic representation of speech signals has found many applications in speech
processing. This paper presents a novel statistical approach to model the behavior of …

[PDF][PDF] Voice transformation-based spoofing of text-dependent speaker verification systems.

Z Kons, H Aronowitz - INTERSPEECH, 2013 - isca-archive.org
In the past few years state-of-the-art text-dependent speaker verification technology has
improved significantly in terms of the ability to accept target speakers and reject imposters …

Corpus-based speech synthesis

T Dutoit - Springer Handbook of Speech Processing, 2008 - Springer
In this chapter, we present the main trends in corpus-based speech synthesis, assuming a
stream of phonemes and prosodic target as input. From the early diphone-based speech …

Modern methods of speech synthesis

D O'Shaughnessy - IEEE Circuits and Systems Magazine, 2007 - ieeexplore.ieee.org
We have examined various aspects of how to produce synthetic speech. There are
numerous applications for such synthetic speech, mostly when starting from a textual input …

High quality sinusoidal modeling of wideband speech for the purposes of speech synthesis and modification

D Chazan, R Hoory, A Sagi… - … on Acoustics Speech …, 2006 - ieeexplore.ieee.org
This paper describes an efficient sinusoidal modeling framework for high quality wide band
(WB) speech synthesis and modification. This technique may serve as a basis for speech …

Reducing footprint of unit selection based text-to-speech system using compressed sensing and sparse representation

P Sharma, V Abrol, AK Sao - Computer Speech & Language, 2018 - Elsevier
In this paper, we have explored the framework of compressed sensing (CS) and sparse
representation (SR) to reduce the footprint of unit selection based speech synthesis (USS) …

[PDF][PDF] Sinusoidal model parameterization for HMM-based TTS system.

S Shechtman, A Sorin - INTERSPEECH, 2010 - isca-archive.org
A sinusoidal representation of speech is an alternative to the source-filter model. It is widely
used in speech coding and unit-selection TTS, but is less common in statistical TTS …

[PDF][PDF] On feature extraction for voice pathology detection from speech signals

Z Kons, A Satt, R Hoory, V Uloza… - Proceedings of the …, 2011 - events.eventact.com
Reliable, automatic and objective detector of pathological voice disorders from speech
signals is a long sought-for tool, by voice clinicians as well as by general practitioners. Such …

Experiments on reducing footprint of unit selection TTS system

Z Hanzlíček, J Matoušek, D Tihelka - International Conference on Text …, 2013 - Springer
The quality of speech produced by modern TTS systems utilizing the unit selection approach
is very high. However, the system demands are enormous. The storage requirements are …