Harmonic plus noise models for speech, combined with statistical methods, for speech and...

G Degottex, J Kane, T Drugman… - … on acoustics, speech …, 2014 - ieeexplore.ieee.org

Speech processing algorithms are often developed demonstrating improvements over the
state-of-the-art, but sometimes at the cost of high complexity. This makes algorithm …

被引用次数：754 相关文章所有 17 个版本

[PDF] columbia.edu

Continuous probabilistic transform for voice conversion

Y Stylianou, O Cappé… - IEEE Transactions on …, 1998 - ieeexplore.ieee.org

Voice conversion, as considered in this paper, is defined as modifying the speech signal of
one speaker (source speaker) so that it sounds as if it had been pronounced by a different …

被引用次数：1379 相关文章所有 12 个版本

[PDF] arxiv.org

Neural source-filter waveform models for statistical parametric speech synthesis

X Wang, S Takaki, J Yamagishi - IEEE/ACM Transactions on …, 2019 - ieeexplore.ieee.org

Neural waveform models have demonstrated better performance than conventional
vocoders for statistical parametric speech synthesis. One of the best models, called …

被引用次数：150 相关文章所有 6 个版本

[PDF] mikemacon.com

Spectral voice conversion for text-to-speech synthesis

A Kain, MW Macon - … of the 1998 IEEE International Conference …, 1998 - ieeexplore.ieee.org

A new voice conversion algorithm that modifies a source speaker's speech to sound as if
produced by a target speaker is presented. It is applied to a residual-excited LPC text-to …

被引用次数：887 相关文章所有 8 个版本

[PDF] ed.ac.uk

Evaluation of speaker verification security and detection of HMM-based synthetic speech

PL De Leon, M Pucher, J Yamagishi… - … on Audio, Speech …, 2012 - ieeexplore.ieee.org

In this paper, we evaluate the vulnerability of speaker verification (SV) systems to synthetic
speech. The SV systems are based on either the Gaussian mixture model–universal …

被引用次数：283 相关文章所有 6 个版本

[PDF] columbia.edu

Applying the harmonic plus noise model in concatenative speech synthesis

Y Stylianou - IEEE Transactions on speech and audio …, 2001 - ieeexplore.ieee.org

This paper describes the application of the harmonic plus noise model (HNM) for
concatenative text-to-speech (TTS) synthesis. In the context of HNM, speech signals are …

被引用次数：496 相关文章所有 13 个版本

[PDF] karger.com

Voice pathology detection based eon short-term jitter estimations in running speech

M Vasilakis, Y Stylianou - Folia Phoniatrica et Logopaedica, 2009 - karger.com

In this paper, we investigate the use of jitter estimation over short time intervals (short-term
jitter) for voice pathology detection in the case of running or continuous speech. Short-term …

被引用次数：80 相关文章所有 9 个版本

Voice conversion based on weighted frequency warping

D Erro, A Moreno, A Bonafonte - IEEE Transactions on Audio …, 2009 - ieeexplore.ieee.org

Any modification applied to speech signals has an impact on their perceptual quality. In
particular, voice conversion to modify a source voice so that it is perceived as a specific …

被引用次数：216 相关文章所有 4 个版本

[图书][B] Audio bandwidth extension: application of psychoacoustics, signal processing and loudspeaker design

E Larsen, RM Aarts - 2005 - books.google.com

Bandwidth extension (BWE) refers to various methods that increase either the perceived or
real frequency spectrum (bandwidth) of audio signals. Such frequency extension is …

被引用次数：263 相关文章所有 5 个版本

Narrowband to wideband conversion of speech using GMM based transformation

KY Park, HS Kim - … conference on acoustics, speech, and signal …, 2000 - ieeexplore.ieee.org

Reconstruction of wideband speech from its narrowband version is an attractive issue, since
it can enhance the speech quality without modifying the existing communication networks …

被引用次数：254 相关文章所有 5 个版本

高级搜索

QQ 群