Two-way feature extraction for speech emotion recognition using deep learning

A Aggarwal, A Srivastava, A Agarwal, N Chahal… - Sensors, 2022 - mdpi.com
Recognizing human emotions by machines is a complex task. Deep learning models
attempt to automate this process by rendering machines to exhibit learning capabilities …

Masked graph learning with recurrent alignment for multimodal emotion recognition in conversation

T Meng, F Zhang, Y Shou, H Shao… - IEEE/ACM Transactions …, 2024 - ieeexplore.ieee.org
Since Multimodal Emotion Recognition in Conversation (MERC) can be applied to public
opinion monitoring, intelligent dialogue robots, and other fields, it has received extensive …

Speechformer++: A hierarchical efficient framework for paralinguistic speech processing

W Chen, X Xing, X Xu, J Pang… - IEEE/ACM Transactions …, 2023 - ieeexplore.ieee.org
Paralinguistic speech processing is important in addressing many issues, such as sentiment
and neurocognitive disorder analyses. Recently, Transformer has achieved remarkable …

[HTML][HTML] A fractional order numerical study for the influenza disease mathematical model

Z Sabir, SB Said, Q Al-Mdallal - Alexandria Engineering Journal, 2023 - Elsevier
The motive of these investigations is to present the numerical performances of the fractional
order mathematical influenza disease model (FO-MIDM) by designing the computational …

Speech emotion recognition based on parallel CNN-attention networks with multi-fold data augmentation

JL Bautista, YK Lee, HS Shin - Electronics, 2022 - mdpi.com
In this paper, an automatic speech emotion recognition (SER) task of classifying eight
different emotions was experimented using parallel based networks trained using the …

Semantic alignment network for multi-modal emotion recognition

M Hou, Z Zhang, C Liu, G Lu - IEEE Transactions on Circuits …, 2023 - ieeexplore.ieee.org
Modality alignment can maintain the consistency of semantics in multi-modal emotion
recognition tasks, ensuring that features from different modalities accurately represent the …

Mgat: Multi-granularity attention based transformers for multi-modal emotion recognition

W Fan, X Xing, B Cai, X Xu - ICASSP 2023-2023 IEEE …, 2023 - ieeexplore.ieee.org
Multi-modal emotion recognition is crucial for human-computer interaction. Many existing
algorithms attempt to achieve multi-modal interactions through a cross-attention mechanism …

[PDF][PDF] Speech Emotion Recognition via Generation using an Attention-based Variational Recurrent Neural Network.

M Baruah, B Banerjee - INTERSPEECH, 2022 - researchgate.net
The last decade has seen an exponential rise in the number of attention-based models for
speech emotion recognition (SER). Most of these models use a spectrogram as the input …

Exploring complementary features in multi-modal speech emotion recognition

S Wang, Y Ma, Y Ding - ICASSP 2023-2023 IEEE International …, 2023 - ieeexplore.ieee.org
Speech emotion recognition (SER) is of great importance in human-computer interaction.
Recent research has demonstrated that self-supervised learned acoustic and linguistic …

ESERNet: Learning spectrogram structure relationship for effective speech emotion recognition with swin transformer in classroom discourse analysis

T Liu, M Wang, B Yang, H Liu, S Yi - Neurocomputing, 2025 - Elsevier
Speech emotion recognition (SER) has received increased attention due to its extensive
applications in many fields, especially in the analysis of teacher-student dialogue in …