Small footprint concatenative text-to-speech synthesis system using complex spectral envelope...

S Tiomkin, D Malah, S Shechtman… - IEEE transactions on …, 2010 - ieeexplore.ieee.org

Concatenative synthesis and statistical synthesis are the two main approaches to text-to-
speech (TTS) synthesis. Concatenative TTS (CTTS) stores natural speech features …

被引用次数：72 相关文章所有 11 个版本

[PDF] researchgate.net

Wrapped Gaussian mixture models for modeling and high-rate quantization of phase data of speech

Y Agiomyrgiannakis, Y Stylianou - IEEE Transactions on Audio …, 2009 - ieeexplore.ieee.org

The harmonic representation of speech signals has found many applications in speech
processing. This paper presents a novel statistical approach to model the behavior of …

被引用次数：62 相关文章所有 10 个版本

[PDF] isca-archive.org

[PDF][PDF] Voice transformation-based spoofing of text-dependent speaker verification systems.

Z Kons, H Aronowitz - INTERSPEECH, 2013 - isca-archive.org

In the past few years state-of-the-art text-dependent speaker verification technology has
improved significantly in terms of the ability to accept target speakers and reject imposters …

被引用次数：40 相关文章所有 5 个版本

[PDF] researchgate.net

Corpus-based speech synthesis

T Dutoit - Springer Handbook of Speech Processing, 2008 - Springer

In this chapter, we present the main trends in corpus-based speech synthesis, assuming a
stream of phonemes and prosodic target as input. From the early diphone-based speech …

被引用次数：49 相关文章所有 5 个版本

Modern methods of speech synthesis

D O'Shaughnessy - IEEE Circuits and Systems Magazine, 2007 - ieeexplore.ieee.org

We have examined various aspects of how to produce synthetic speech. There are
numerous applications for such synthetic speech, mostly when starting from a textual input …

被引用次数：37 相关文章所有 2 个版本

[PDF] academia.edu

High quality sinusoidal modeling of wideband speech for the purposes of speech synthesis and modification

D Chazan, R Hoory, A Sagi… - … on Acoustics Speech …, 2006 - ieeexplore.ieee.org

This paper describes an efficient sinusoidal modeling framework for high quality wide band
(WB) speech synthesis and modification. This technique may serve as a basis for speech …

被引用次数：39 相关文章所有 7 个版本

Reducing footprint of unit selection based text-to-speech system using compressed sensing and sparse representation

P Sharma, V Abrol, AK Sao - Computer Speech & Language, 2018 - Elsevier

In this paper, we have explored the framework of compressed sensing (CS) and sparse
representation (SR) to reduce the footprint of unit selection based speech synthesis (USS) …

被引用次数：9 相关文章所有 2 个版本

[PDF] isca-archive.org

[PDF][PDF] Sinusoidal model parameterization for HMM-based TTS system.

S Shechtman, A Sorin - INTERSPEECH, 2010 - isca-archive.org

A sinusoidal representation of speech is an alternative to the source-filter model. It is widely
used in speech coding and unit-selection TTS, but is less common in statistical TTS …

被引用次数：17 相关文章所有 7 个版本

[PDF] eventact.com

[PDF][PDF] On feature extraction for voice pathology detection from speech signals

Z Kons, A Satt, R Hoory, V Uloza… - Proceedings of the …, 2011 - events.eventact.com

Reliable, automatic and objective detector of pathological voice disorders from speech
signals is a long sought-for tool, by voice clinicians as well as by general practitioners. Such …

被引用次数：13 相关文章所有 5 个版本

Experiments on reducing footprint of unit selection TTS system

Z Hanzlíček, J Matoušek, D Tihelka - International Conference on Text …, 2013 - Springer

The quality of speech produced by modern TTS systems utilizing the unit selection approach
is very high. However, the system demands are enormous. The storage requirements are …

被引用次数：10 相关文章所有 3 个版本

高级搜索

QQ 群