COVAREP—A collaborative voice analysis repository for speech technologies

G Degottex, J Kane, T Drugman… - … on acoustics, speech …, 2014 - ieeexplore.ieee.org
Speech processing algorithms are often developed demonstrating improvements over the
state-of-the-art, but sometimes at the cost of high complexity. This makes algorithm …

Improving singing voice separation using deep u-net and wave-u-net with data augmentation

A Cohen-Hadria, A Roebel… - 2019 27th European …, 2019 - ieeexplore.ieee.org
State-of-the-art singing voice separation is based on deep learning making use of CNN
structures with skip connections (like U-Net model, Wave-U-Net model, or MSDENSELSTM) …

Neural vocoding for singing and speaking voices with the multi-band excited wavenet

A Roebel, F Bous - Information, 2022 - mdpi.com
The use of the mel spectrogram as a signal parameterization for voice generation is quite
recent and linked to the development of neural vocoders. These are deep neural networks …

Mixed source model and its adapted vocal tract filter estimate for voice transformation and synthesis

G Degottex, P Lanchantin, A Roebel, X Rodet - Speech Communication, 2013 - Elsevier
In current methods for voice transformation and speech synthesis, the vocal tract filter is
usually assumed to be excited by a flat amplitude spectrum. In this article, we present a …

Improved estimation of the amplitude envelope of time-domain signals using true envelope cepstral smoothing

M Caetano, X Rodet - 2011 IEEE International Conference on …, 2011 - ieeexplore.ieee.org
The amplitude modulations of musical instrument sounds and speech are important
perceptual cues. Accurate estimation of the amplitude, or equivalently energy, envelope of a …

[PDF][PDF] Time-Domain Envelope Modulating the Noise Component of Excitation in a Continuous Residual-Based Vocoder for Statistical Parametric Speech Synthesis.

MS Al-Radhi, TG Csapó, G Németh - Interspeech, 2017 - isca-archive.org
In this paper, we present an extension of a novel continuous residual-based vocoder for
statistical parametric speech synthesis. Previous work has shown the advantages of adding …

Synthesis and expressive transformation of singing voice

L Ardaillon - 2017 - hal.science
This thesis aimed at conducting research on the synthesis and expressive transformations of
the singing voice, towards the development of a high-quality synthesizer that can generate a …

A shape-invariant phase vocoder for speech transformation

A Roebel - Digital Audio Effects (DAFx), 2010 - hal.science
This paper proposes a new method for shape invariant real-time modification of speech
signals. The method can be un-derstood as a frequency domain SOLA algorithm that is us …

Glottal source and vocal-tract separation

G Degottex - 2010 - theses.hal.science
This study addresses the problem of inverting a voice production model to retrieve, for a
given recording, a representation of the sound source which is generated at the glottis level …

A continuous vocoder for statistical parametric speech synthesis and its evaluation using an audio-visual phonetically annotated Arabic corpus

MS Al-Radhi, O Abdo, TG Csapó, S Abdou… - Computer Speech & …, 2020 - Elsevier
In this paper, we present an extension of a novel continuous residual-based vocoder for
statistical parametric speech synthesis by addressing two objectives. First, because the …