[HTML][HTML] Deep learning techniques for speech emotion recognition, from databases to models

BJ Abbaschian, D Sierra-Sosa, A Elmaghraby - Sensors, 2021 - mdpi.com
The advancements in neural networks and the on-demand need for accurate and near real-
time Speech Emotion Recognition (SER) in human–computer interactions make it …

Automated emotion recognition: Current trends and future perspectives

M Maithri, U Raghavendra, A Gudigar… - Computer methods and …, 2022 - Elsevier
Background Human emotions greatly affect the actions of a person. The automated emotion
recognition has applications in multiple domains such as health care, e-learning …

Speech emotion recognition with deep convolutional neural networks

D Issa, MF Demirci, A Yazici - Biomedical Signal Processing and Control, 2020 - Elsevier
The speech emotion recognition (or, classification) is one of the most challenging topics in
data science. In this work, we introduce a new architecture, which extracts mel-frequency …

[HTML][HTML] A review on speech emotion recognition using deep learning and attention mechanism

E Lieskovská, M Jakubec, R Jarina, M Chmulík - Electronics, 2021 - mdpi.com
Emotions are an integral part of human interactions and are significant factors in determining
user satisfaction or customer opinion. speech emotion recognition (SER) modules also play …

Survey of deep representation learning for speech emotion recognition

S Latif, R Rana, S Khalifa, R Jurdak… - IEEE Transactions …, 2021 - ieeexplore.ieee.org
Traditionally, speech emotion recognition (SER) research has relied on manually
handcrafted acoustic features using feature engineering. However, the design of …

An ensemble 1D-CNN-LSTM-GRU model with data augmentation for speech emotion recognition

MR Ahmed, S Islam, AKMM Islam… - Expert Systems with …, 2023 - Elsevier
Precise recognition of emotion from speech signals aids in enhancing human–computer
interaction (HCI). The performance of a speech emotion recognition (SER) system depends …

[HTML][HTML] CLSTM: Deep feature-based speech emotion recognition using the hierarchical ConvLSTM network

Mustaqeem, S Kwon - Mathematics, 2020 - mdpi.com
Artificial intelligence, deep learning, and machine learning are dominant sources to use in
order to make a system smarter. Nowadays, the smart speech emotion recognition (SER) …

Generative adversarial networks for speech processing: A review

A Wali, Z Alamgir, S Karim, A Fawaz, MB Ali… - Computer Speech & …, 2022 - Elsevier
Generative adversarial networks (GANs) have seen remarkable progress in recent years.
They are used as generative models for all kinds of data such as text, images, audio, music …

Self supervised adversarial domain adaptation for cross-corpus and cross-language speech emotion recognition

S Latif, R Rana, S Khalifa, R Jurdak… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Despite the recent advancement in speech emotion recognition (SER) within a single corpus
setting, the performance of these SER systems degrades significantly for cross-corpus and …

[HTML][HTML] Data augmentation for audio-visual emotion recognition with an efficient multimodal conditional GAN

F Ma, Y Li, S Ni, SL Huang, L Zhang - Applied Sciences, 2022 - mdpi.com
Audio-visual emotion recognition is the research of identifying human emotional states by
combining the audio modality and the visual modality simultaneously, which plays an …