Extraction and utilization of excitation information of speech: A review

SR Kadiri, P Alku, B Yegnanarayana - Proceedings of the IEEE, 2021 - ieeexplore.ieee.org
Speech production can be regarded as a process where a time-varying vocal tract system
(filter) is excited by a time-varying excitation. In addition to its linguistic message, the speech …

Epoch extraction from emotional speech using single frequency filtering approach

SR Kadiri, B Yegnanarayana - Speech Communication, 2017 - Elsevier
Epochs are instants of significant excitation of the vocal tract system during production of
voiced speech. Existing methods for epoch extraction provide good results on neutral …

[PDF][PDF] Analysis and Detection of Phonation Modes in Singing Voice using Excitation Source Features and Single Frequency Filtering Cepstral Coefficients (SFFCC).

SR Kadiri, B Yegnanarayana - Interspeech, 2018 - researchgate.net
In this study, classification of the phonation modes in singing voice is carried out. Phonation
modes in singing voice can be described using four categories: breathy, neutral, flow and …

Epoch estimation from emotional speech signals using variational mode decomposition

GJ Lal, EA Gopalakrishnan, D Govind - Circuits, Systems, and Signal …, 2018 - Springer
This paper presents a novel approach for the estimation of epochs from the emotional
speech signal. Epochs are the locations of significant excitation in the vocal tract during the …

Significance of incorporating excitation source parameters for improved emotion recognition from speech and electroglottographic signals

D Pravena, D Govind - International Journal of Speech Technology, 2017 - Springer
The work presented in this paper explores the effectiveness of incorporating the excitation
source parameters such as strength of excitation and instantaneous fundamental frequency …

[HTML][HTML] Mel-frequency cepstral coefficients derived using the zero-time windowing spectrum for classification of phonation types in singing

SR Kadiri, P Alku - The Journal of the Acoustical Society of America, 2019 - pubs.aip.org
Existing studies in classification of phonation types in singing use voice source features and
Mel-frequency cepstral coefficients (MFCCs) showing poor performance due to high pitch in …

[PDF][PDF] Estimation of Fundamental Frequency from Singing Voice Using Harmonics of Impulse-like Excitation Source.

SR Kadiri, B Yegnanarayana - Interspeech, 2018 - isca-archive.org
This paper focuses on the problem of estimating fundamental frequency from singing voice.
Estimation of fundamental frequency is a well studied topic in the speech research …

Comparison of glottal closure instants detection algorithms for emotional speech

SR Kadiri, P Alku… - ICASSP 2020-2020 IEEE …, 2020 - ieeexplore.ieee.org
In production of voiced speech, epochs or glottal closure instants (GCIs) refer to the instants
of significant excitation of the vocal tract. Extraction of GCIs is used as a pre-processing …

Detection and assessment of hypernasality in repaired cleft palate speech using vocal tract and residual features

AK Dubey, SR Prasanna, S Dandapat - The Journal of the Acoustical …, 2019 - pubs.aip.org
The presence of hypernasality in repaired cleft palate (CP) speech is a consequence of
velopharyngeal insufficiency. The coupling of the nasal tract with the oral tract adds nasal …

Analysis of aperiodicity in artistic Noh singing voice using an impulse sequence representation of excitation source

SR Kadiri, B Yegnanarayana - The Journal of the Acoustical Society of …, 2019 - pubs.aip.org
Aperiodicity in the voice source is caused by changes in the vocal fold vibrations, other than
the normal quasi-periodicity and the turbulence at the glottis. The aperiodicity appears to be …