COVAREP—A collaborative voice analysis repository for speech technologies

G Degottex, J Kane, T Drugman… - … on acoustics, speech …, 2014 - ieeexplore.ieee.org
Speech processing algorithms are often developed demonstrating improvements over the
state-of-the-art, but sometimes at the cost of high complexity. This makes algorithm …

Continuous probabilistic transform for voice conversion

Y Stylianou, O Cappé… - IEEE Transactions on …, 1998 - ieeexplore.ieee.org
Voice conversion, as considered in this paper, is defined as modifying the speech signal of
one speaker (source speaker) so that it sounds as if it had been pronounced by a different …

Neural source-filter waveform models for statistical parametric speech synthesis

X Wang, S Takaki, J Yamagishi - IEEE/ACM Transactions on …, 2019 - ieeexplore.ieee.org
Neural waveform models have demonstrated better performance than conventional
vocoders for statistical parametric speech synthesis. One of the best models, called …

Spectral voice conversion for text-to-speech synthesis

A Kain, MW Macon - … of the 1998 IEEE International Conference …, 1998 - ieeexplore.ieee.org
A new voice conversion algorithm that modifies a source speaker's speech to sound as if
produced by a target speaker is presented. It is applied to a residual-excited LPC text-to …

Evaluation of speaker verification security and detection of HMM-based synthetic speech

PL De Leon, M Pucher, J Yamagishi… - … on Audio, Speech …, 2012 - ieeexplore.ieee.org
In this paper, we evaluate the vulnerability of speaker verification (SV) systems to synthetic
speech. The SV systems are based on either the Gaussian mixture model–universal …

Applying the harmonic plus noise model in concatenative speech synthesis

Y Stylianou - IEEE Transactions on speech and audio …, 2001 - ieeexplore.ieee.org
This paper describes the application of the harmonic plus noise model (HNM) for
concatenative text-to-speech (TTS) synthesis. In the context of HNM, speech signals are …

Voice pathology detection based eon short-term jitter estimations in running speech

M Vasilakis, Y Stylianou - Folia Phoniatrica et Logopaedica, 2009 - karger.com
In this paper, we investigate the use of jitter estimation over short time intervals (short-term
jitter) for voice pathology detection in the case of running or continuous speech. Short-term …

Voice conversion based on weighted frequency warping

D Erro, A Moreno, A Bonafonte - IEEE Transactions on Audio …, 2009 - ieeexplore.ieee.org
Any modification applied to speech signals has an impact on their perceptual quality. In
particular, voice conversion to modify a source voice so that it is perceived as a specific …

[图书][B] Audio bandwidth extension: application of psychoacoustics, signal processing and loudspeaker design

E Larsen, RM Aarts - 2005 - books.google.com
Bandwidth extension (BWE) refers to various methods that increase either the perceived or
real frequency spectrum (bandwidth) of audio signals. Such frequency extension is …

Narrowband to wideband conversion of speech using GMM based transformation

KY Park, HS Kim - … conference on acoustics, speech, and signal …, 2000 - ieeexplore.ieee.org
Reconstruction of wideband speech from its narrowband version is an attractive issue, since
it can enhance the speech quality without modifying the existing communication networks …