[HTML][HTML] An ongoing review of speech emotion recognition

J de Lope, M Graña - Neurocomputing, 2023 - Elsevier
User emotional status recognition is becoming a key feature in advanced Human Computer
Interfaces (HCI). A key source of emotional information is the spoken expression, which may …

Extraction and utilization of excitation information of speech: A review

SR Kadiri, P Alku, B Yegnanarayana - Proceedings of the IEEE, 2021 - ieeexplore.ieee.org
Speech production can be regarded as a process where a time-varying vocal tract system
(filter) is excited by a time-varying excitation. In addition to its linguistic message, the speech …

The Sound of Emotional Prosody: Nearly 3 Decades of Research and Future Directions

P Larrouy-Maestri, D Poeppel… - Perspectives on …, 2024 - journals.sagepub.com
Emotional voices attract considerable attention. A search on any browser using “emotional
prosody” as a key phrase leads to more than a million entries. Such interest is evident in the …

Detection of common cold from speech signals using deep neural network

S Deb, P Warule, A Nair, H Sultan, R Dash… - Circuits, Systems, and …, 2023 - Springer
This paper presents a deep learning-based analysis and classification of cold speech
observed when a person is diagnosed with the common cold. The common cold is a viral …

Neural network-based blended ensemble learning for speech emotion recognition

B Yalamanchili, SK Samayamantula… - … Systems and Signal …, 2022 - Springer
Abstract Speech Emotion Recognition (SER) identifies human emotion from short speech
signals that enable natural Human Computer Interactions (HCI). Accurate emotion prediction …

Mexican emotional speech database based on semantic, frequency, familiarity, concreteness, and cultural shaping of affective prosody

MM Duville, LM Alonso-Valerdi, DI Ibarra-Zarate - Data, 2021 - mdpi.com
In this paper, the Mexican Emotional Speech Database (MESD) that contains single-word
emotional utterances for anger, disgust, fear, happiness, neutral and sadness with adult …

A new Amharic speech emotion dataset and classification benchmark

EA Retta, E Almekhlafi, R Sutcliffe, M Mhamed… - ACM Transactions on …, 2023 - dl.acm.org
In this article we present the Amharic Speech Emotion Dataset (ASED), which covers four
dialects (Gojjam, Wollo, Shewa, and Gonder) and five different emotions (neutral, fearful …

Hilbert Domain Analysis of Wavelet Packets for Emotional Speech Classification

B Karan, A Kumar - Circuits, Systems, and Signal Processing, 2024 - Springer
This work investigates the significance of Hilbert domain characterization of wavelet packets
in classifying different emotion of speech signal. The goal of this paper is to create a new …

Hierarchical emotion recognition from speech using source, power spectral and prosodic features

A Haque, KS Rao - Multimedia Tools and Applications, 2024 - Springer
Features related to the glottal closure instants (GCI) exhibit different patterns for different
emotions. In this work, our main objective was to explore the effectiveness of these features …

[HTML][HTML] Analysis of instantaneous frequency components of speech signals for epoch extraction

SR Kadiri, P Alku, B Yegnanarayana - Computer Speech & Language, 2023 - Elsevier
The major impulse-like excitation in the speech signal is due to abrupt closure of the vocal
folds, which takes place at the glottal closure instant (GCI) or epoch in each cycle. GCIs are …