On cepstral and all-pole based spectral envelope modeling with unknown model order

G Degottex, J Kane, T Drugman… - … on acoustics, speech …, 2014 - ieeexplore.ieee.org

Speech processing algorithms are often developed demonstrating improvements over the
state-of-the-art, but sometimes at the cost of high complexity. This makes algorithm …

被引用次数：865 相关文章所有 17 个版本

[PDF] arxiv.org

Improving singing voice separation using deep u-net and wave-u-net with data augmentation

A Cohen-Hadria, A Roebel… - 2019 27th European …, 2019 - ieeexplore.ieee.org

State-of-the-art singing voice separation is based on deep learning making use of CNN
structures with skip connections (like U-Net model, Wave-U-Net model, or MSDENSELSTM) …

被引用次数：43 相关文章所有 9 个版本

[PDF] mdpi.com

Neural vocoding for singing and speaking voices with the multi-band excited wavenet

A Roebel, F Bous - Information, 2022 - mdpi.com

The use of the mel spectrogram as a signal parameterization for voice generation is quite
recent and linked to the development of neural vocoders. These are deep neural networks …

被引用次数：12 相关文章所有 5 个版本

[PDF] ircam.fr

Mixed source model and its adapted vocal tract filter estimate for voice transformation and synthesis

G Degottex, P Lanchantin, A Roebel, X Rodet - Speech Communication, 2013 - Elsevier

In current methods for voice transformation and speech synthesis, the vocal tract filter is
usually assumed to be excited by a flat amplitude spectrum. In this article, we present a …

被引用次数：59 相关文章所有 8 个版本

[PDF] hal.science

Improved estimation of the amplitude envelope of time-domain signals using true envelope cepstral smoothing

M Caetano, X Rodet - 2011 IEEE International Conference on …, 2011 - ieeexplore.ieee.org

The amplitude modulations of musical instrument sounds and speech are important
perceptual cues. Accurate estimation of the amplitude, or equivalently energy, envelope of a …

被引用次数：49 相关文章所有 11 个版本

[PDF] isca-archive.org

[PDF][PDF] Time-Domain Envelope Modulating the Noise Component of Excitation in a Continuous Residual-Based Vocoder for Statistical Parametric Speech Synthesis.

MS Al-Radhi, TG Csapó, G Németh - Interspeech, 2017 - isca-archive.org

In this paper, we present an extension of a novel continuous residual-based vocoder for
statistical parametric speech synthesis. Previous work has shown the advantages of adding …

被引用次数：26 相关文章所有 9 个版本

[PDF] hal.science

Synthesis and expressive transformation of singing voice

L Ardaillon - 2017 - hal.science

This thesis aimed at conducting research on the synthesis and expressive transformations of
the singing voice, towards the development of a high-quality synthesizer that can generate a …

被引用次数：24 相关文章所有 6 个版本

[PDF] hal.science

A shape-invariant phase vocoder for speech transformation

A Roebel - Digital Audio Effects (DAFx), 2010 - hal.science

This paper proposes a new method for shape invariant real-time modification of speech
signals. The method can be un-derstood as a frequency domain SOLA algorithm that is us …

被引用次数：45 相关文章所有 8 个版本

[PDF] hal.science

Glottal source and vocal-tract separation

G Degottex - 2010 - theses.hal.science

This study addresses the problem of inverting a voice production model to retrieve, for a
given recording, a representation of the sound source which is generated at the glottis level …

被引用次数：45 相关文章所有 11 个版本

A continuous vocoder for statistical parametric speech synthesis and its evaluation using an audio-visual phonetically annotated Arabic corpus

MS Al-Radhi, O Abdo, TG Csapó, S Abdou… - Computer Speech & …, 2020 - Elsevier

In this paper, we present an extension of a novel continuous residual-based vocoder for
statistical parametric speech synthesis by addressing two objectives. First, because the …

被引用次数：15 相关文章所有 2 个版本

高级搜索

QQ 群