[PDF][PDF] Applying voice conversion to concatenative singing-voice synthesis.

F Villavicencio, J Bonada - Interspeech, 2010 - isca-archive.org
This work address the application of Voice Conversion to singing-voice. The GMM-based
approach was applied to VOCALOID, a concatenative singing synthesizer, to perform singer …

Real-time audio-to-score alignment of singing voice based on melody and lyric information

R Gong, P Cuvillier, N Obin, A Cont - Interspeech, 2015 - hal.science
Singing voice is specific in music: a vocal performance conveys both music (melody/pitch)
and lyrics (text/phoneme) content. This paper aims at exploiting the advantages of melody …

Analyse acoustique de la voix émotionnelle de locuteurs lors d'une interaction humain-robot

M Tahon - 2012 - theses.hal.science
Mes travaux de thèse s' intéressent à la voix émotionnelle dans un contexte d'interaction
humain-robot. Dans une interaction réaliste, nous définissons au moins quatre grands types …

Efficient pitch estimation on natural opera-singing by a spectral correlation based strategy

F Villavicencio, J Bonada, J Yamagish… - … Processing Society of …, 2015 - ipsj.ixsq.nii.ac.jp
We present in this work a study for robust pitch estimation on signals presenting wide-range
pitch content, as is the case of opera singing. Aiming to perform automatic features …

From signal representation to representation learning: structured modeling of speech signals

N Obin - 2023 - hal.science
This habilitation presents the last ten years of my research on the structured modelling of
speech signals. Speech, as an oral language, constitutes the most elaborate communication …

Structured sparse spectral transforms and structural measures for voice conversion

Y Zhao, M Kuruvilla-Dugdale… - IEEE/ACM transactions on …, 2018 - ieeexplore.ieee.org
We investigate a structured sparse spectral transform method for voice conversion (VC) to
perform frequency warping and spectral shaping simultaneously on high-dimensional (D) …

[PDF][PDF] Détection des états affectifs lors d'interactions parlées: robustesse des indices non verbaux [Automatic in-voice affective state detection in spontaneous speech …

L Devillers, M Tahon, MA Sehili… - Traitement Automatique …, 2014 - aclanthology.org
Dans un contexte d'interaction homme-machine, les systèmes de détection des émotions
dans la voix doivent être robustes aux variabilités et efficaces en temps de calcul. Cet article …

Analysis and synthesis of strong vocal expressions: Extension and application of audio texture features to singing voice

H Kawahara, M Morise - 2012 IEEE International Conference …, 2012 - ieeexplore.ieee.org
Realistic reconstruction and manipulation of strong vocal expressions found in singing
voices is a challenging and exciting topic. A speech analysis, modification and resynthesis …

[PDF][PDF] Excitation source analysis for high-quality speech manipulation systems based on an interference-free representation of group delay with minimum phase …

H Kawahara, M Morise, T Toda, H Banno… - … Annual Conference of …, 2014 - isca-archive.org
A group delay-based excitation source analysis and design method is introduced for
extension of TANDEM-STRAIGHT, a speech analysis, modification and synthesis system …

Between physics and perception: Signal models for high level audio processing

A Roebel - Digital Audio Effects (DAFx), 2010 - hal.science
The use of signal models is one of the key factors enabling us to establish high quality signal
transformation algorithms with intuitive high level control parameters. In the present article …