Signal Processing Methods for Music Transcription is the first book dedicated to uniting research related to signal processing algorithms and models for various aspects of music …
There is not one kind, but instead several kinds, of creaky voice, or creak. There is no single defining property shared by all kinds. Instead, each kind exhibits some properties but not …
An algorithm is presented for the estimation of the fundamental frequency (F 0) of speech or musical sounds. It is based on the well-known autocorrelation method with a number of …
P Belin, S Fillion-Bilodeau, F Gosselin - Behavior research methods, 2008 - Springer
Abstract The Montreal Affective Voices consist of 90 nonverbal affect bursts corresponding to the emotions of anger, disgust, fear, pain, sadness, surprise, happiness, and pleasure (plus …
H Kawahara, M Morise, T Takahashi… - … on acoustics, speech …, 2008 - ieeexplore.ieee.org
A simple new method for estimating temporally stable power spectra is introduced to provide a unified basis for computing an interference-free spectrum, the fundamental frequency (F0) …
A Camacho, JG Harris - The Journal of the Acoustical Society of …, 2008 - pubs.aip.org
A sawtooth waveform inspired pitch estimator (SWIPE) has been developed for speech and music. SWIPE estimates the pitch as the fundamental frequency of the sawtooth waveform …
T Drugman, A Alwan - arXiv preprint arXiv:2001.00459, 2019 - arxiv.org
This paper focuses on the problem of pitch tracking in noisy conditions. A method using harmonic information in the residual signal is presented. The proposed criterion is used both …
H Kawahara - Acoustical science and technology, 2006 - jstage.jst.go.jp
STRAIGHT, a speech analysis, modification synthesis system, is an extension of the classical channel VOCODER that exploits the advantages of progress in information …
H Kawahara, J Estill, O Fujimura - … on models and analysis of vocal …, 2001 - isca-archive.org
A new control paradigm of source signals for high quality speech synthesis is introduced to handle a variety of speech quality, based on timefrequency analyses by the use of an …