[HTML][HTML] An ongoing review of speech emotion recognition

J de Lope, M Graña - Neurocomputing, 2023 - Elsevier
User emotional status recognition is becoming a key feature in advanced Human Computer
Interfaces (HCI). A key source of emotional information is the spoken expression, which may …

An ensemble 1D-CNN-LSTM-GRU model with data augmentation for speech emotion recognition

MR Ahmed, S Islam, AKMM Islam… - Expert Systems with …, 2023 - Elsevier
Precise recognition of emotion from speech signals aids in enhancing human–computer
interaction (HCI). The performance of a speech emotion recognition (SER) system depends …

Medical applications of generative adversarial network: a visualization analysis

F Zhang, L Wang, J Zhao, X Zhang - Acta Radiologica, 2023 - journals.sagepub.com
Background Deep learning (DL) is one of the latest approaches to artificial intelligence. As
an unsupervised DL method, a generative adversarial network (GAN) can be used to …

Data augmentation for audio-visual emotion recognition with an efficient multimodal conditional GAN

F Ma, Y Li, S Ni, SL Huang, L Zhang - Applied Sciences, 2022 - mdpi.com
Audio-visual emotion recognition is the research of identifying human emotional states by
combining the audio modality and the visual modality simultaneously, which plays an …

Speech emotion recognition using convolution neural networks and multi-head convolutional transformer

R Ullah, M Asif, WA Shah, F Anjam, I Ullah… - Sensors, 2023 - mdpi.com
Speech emotion recognition (SER) is a challenging task in human–computer interaction
(HCI) systems. One of the key challenges in speech emotion recognition is to extract the …

Data augmentation using generative adversarial networks for images and biomarkers in medicine and neuroscience

MS Meor Yahaya, J Teo - Frontiers in Applied Mathematics and …, 2023 - frontiersin.org
The fields of medicine and neuroscience often face challenges in obtaining a sufficient
amount of diverse data for training machine learning models. Data augmentation can …

Significance of voiced and unvoiced speech segments for the detection of common cold

P Warule, SP Mishra, S Deb - Signal, image and video processing, 2023 - Springer
This work investigates the significance of the voiced and unvoiced region for detecting
common cold from the speech signal. In literature, the entire speech signal is processed to …

Learning speech emotion representations in the quaternion domain

E Guizzo, T Weyde, S Scardapane… - … /ACM Transactions on …, 2023 - ieeexplore.ieee.org
The modeling of human emotion expression in speech signals is an important, yet
challenging task. The high resource demand of speech emotion recognition models …

Quality-aware bag of modulation spectrum features for robust speech emotion recognition

SR Kshirsagar, TH Falk - IEEE Transactions on Affective …, 2022 - ieeexplore.ieee.org
Automatic speech emotion recognition (SER) has gained popularity over the last decade
and numerous Challenges have emerged. While the latest Challenges have shown that …

Anomaly-based intrusion on iot networks using aigan-a generative adversarial network

Z Liu, J Hu, Y Liu, K Roy, X Yuan, J Xu - IEEE Access, 2023 - ieeexplore.ieee.org
Adversarial attacks have threatened the credibility of machine learning models and cast
doubts over the integrity of data. The attacks have created much harm in the fields of …