World: a vocoder-based high-quality speech synthesis system for real-time applications

M Morise, F Yokomori, K Ozawa - IEICE TRANSACTIONS on …, 2016 - search.ieice.org
A vocoder-based speech synthesis system, named WORLD, was developed in an effort to
improve the sound quality of real-time applications using speech. Speech analysis …

Restructuring speech representations using a pitch-adaptive time–frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a …

H Kawahara, I Masuda-Katsuse, A De Cheveigne - Speech communication, 1999 - Elsevier
A set of simple new procedures has been developed to enable the real-time manipulation of
speech parameters. The proposed method uses pitch-adaptive spectral analysis combined …

Speech and melody recognition in binaurally combined acoustic and electric hearing

YY Kong, GS Stickney, FG Zeng - The Journal of the Acoustical Society …, 2005 - pubs.aip.org
Speech recognition in noise and music perception is especially challenging for current
cochlear implant users. The present study utilizes the residual acoustic hearing in the …

[PDF][PDF] Aperiodicity extraction and control using mixed mode excitation and group delay manipulation for a high quality speech analysis, modification and synthesis …

H Kawahara, J Estill, O Fujimura - … on models and analysis of vocal …, 2001 - isca-archive.org
A new control paradigm of source signals for high quality speech synthesis is introduced to
handle a variety of speech quality, based on timefrequency analyses by the use of an …

Expressive speech synthesis: a review

D Govind, SRM Prasanna - International Journal of Speech Technology, 2013 - Springer
The objective of the present work is to provide a detailed review of expressive speech
synthesis (ESS). Among various approaches for ESS, the present paper focuses the …

The contribution of changes in F0 and spectral tilt to increased intelligibility of speech produced in noise

Y Lu, M Cooke - Speech Communication, 2009 - Elsevier
Talkers modify the way they speak in the presence of noise. As well as increases in voice
level and fundamental frequency (F0), a flattening of spectral tilt is observed. The resulting …

Voice transformation: a survey

Y Stylianou - 2009 IEEE International Conference on Acoustics …, 2009 - ieeexplore.ieee.org
Voice transformation refers to the various modifications one may apply to the sound
produced by a person, speaking or singing. Voice transformation is usually seen as an add …

[PDF][PDF] Fixed point analysis of frequency to instantaneous frequency mapping for accurate estimation of F0 and periodicity

H Kawahara, H Katayose, A Cheveigné… - … european conference on …, 1999 - Citeseer
An accurate fundamental frequency (F0) estimation method for non-stationary, speech-like
sounds is proposed based on the differential properties of the instantaneous frequencies of …

Cortical activity patterns predict speech discrimination ability

CT Engineer, CA Perez, YTH Chen, RS Carraway… - Nature …, 2008 - nature.com
Neural activity in the cerebral cortex can explain many aspects of sensory perception.
Extensive psychophysical and neurophysiological studies of visual motion and vibrotactile …

Prosody modification using instants of significant excitation

KS Rao, B Yegnanarayana - IEEE Transactions on Audio …, 2006 - ieeexplore.ieee.org
Prosody modification involves changing the pitch and duration of speech without affecting
the message and naturalness. This paper proposes a method for prosody (pitch and …