Applying improved spectral modeling for high quality voice conversion

F Villavicencio, J Bonada - Interspeech, 2010 - isca-archive.org

This work address the application of Voice Conversion to singing-voice. The GMM-based
approach was applied to VOCALOID, a concatenative singing synthesizer, to perform singer …

被引用次数：88 相关文章所有 10 个版本

[PDF] hal.science

Real-time audio-to-score alignment of singing voice based on melody and lyric information

R Gong, P Cuvillier, N Obin, A Cont - Interspeech, 2015 - hal.science

Singing voice is specific in music: a vocal performance conveys both music (melody/pitch)
and lyrics (text/phoneme) content. This paper aims at exploiting the advantages of melody …

被引用次数：29 相关文章所有 15 个版本

[PDF] hal.science

Analyse acoustique de la voix émotionnelle de locuteurs lors d'une interaction humain-robot

M Tahon - 2012 - theses.hal.science

Mes travaux de thèse s' intéressent à la voix émotionnelle dans un contexte d'interaction
humain-robot. Dans une interaction réaliste, nous définissons au moins quatre grands types …

被引用次数：17 相关文章所有 9 个版本

[PDF] nii.ac.jp

Efficient pitch estimation on natural opera-singing by a spectral correlation based strategy

F Villavicencio, J Bonada, J Yamagish… - … Processing Society of …, 2015 - ipsj.ixsq.nii.ac.jp

We present in this work a study for robust pitch estimation on signals presenting wide-range
pitch content, as is the case of opera singing. Aiming to perform automatic features …

被引用次数：11 相关文章所有 10 个版本

[PDF] hal.science

From signal representation to representation learning: structured modeling of speech signals

N Obin - 2023 - hal.science

This habilitation presents the last ten years of my research on the structured modelling of
speech signals. Speech, as an oral language, constitutes the most elaborate communication …

被引用次数：1 相关文章所有 5 个版本

[HTML] nih.gov

Structured sparse spectral transforms and structural measures for voice conversion

Y Zhao, M Kuruvilla-Dugdale… - IEEE/ACM transactions on …, 2018 - ieeexplore.ieee.org

We investigate a structured sparse spectral transform method for voice conversion (VC) to
perform frequency warping and spectral shaping simultaneously on high-dimensional (D) …

被引用次数：8 相关文章所有 7 个版本

[PDF] aclanthology.org

[PDF][PDF] Détection des états affectifs lors d'interactions parlées: robustesse des indices non verbaux [Automatic in-voice affective state detection in spontaneous speech …

L Devillers, M Tahon, MA Sehili… - Traitement Automatique …, 2014 - aclanthology.org

Dans un contexte d'interaction homme-machine, les systèmes de détection des émotions
dans la voix doivent être robustes aux variabilités et efficaces en temps de calcul. Cet article …

被引用次数：12 相关文章所有 4 个版本

[PDF] academia.edu

Analysis and synthesis of strong vocal expressions: Extension and application of audio texture features to singing voice

H Kawahara, M Morise - 2012 IEEE International Conference …, 2012 - ieeexplore.ieee.org

Realistic reconstruction and manipulation of strong vocal expressions found in singing
voices is a challenging and exciting topic. A speech analysis, modification and resynthesis …

被引用次数：12 相关文章所有 4 个版本

[PDF] isca-archive.org

[PDF][PDF] Excitation source analysis for high-quality speech manipulation systems based on an interference-free representation of group delay with minimum phase …

H Kawahara, M Morise, T Toda, H Banno… - … Annual Conference of …, 2014 - isca-archive.org

A group delay-based excitation source analysis and design method is introduced for
extension of TANDEM-STRAIGHT, a speech analysis, modification and synthesis system …

被引用次数：7 相关文章所有 3 个版本

[PDF] hal.science

Between physics and perception: Signal models for high level audio processing

A Roebel - Digital Audio Effects (DAFx), 2010 - hal.science

The use of signal models is one of the key factors enabling us to establish high quality signal
transformation algorithms with intuitive high level control parameters. In the present article …

被引用次数：9 相关文章所有 8 个版本

高级搜索

QQ 群