MSER: Multimodal speech emotion recognition using cross-attention with deep fusion

M Khan, W Gueaieb, A El Saddik, S Kwon - Expert Systems with …, 2024 - Elsevier
In human–computer interaction (HCI) and especially speech signal processing, emotion
recognition is one of the most important and challenging tasks due to multi-modality and …

Deep learning approaches for bimodal speech emotion recognition: Advancements, challenges, and a multi-learning model

S Kakuba, A Poulose, DS Han - IEEE Access, 2023 - ieeexplore.ieee.org
Though acoustic speech emotion recognition has been studied for a while, bimodal speech
emotion recognition using both acoustic and text has gained momentum since speech …

[HTML][HTML] Addressing data scarcity in speech emotion recognition: A comprehensive review

S Kakuba, DS Han - ICT Express, 2024 - Elsevier
Speech emotion recognition (SER) is a critical field within affective computing, aiming to
detect and classify emotional states from speech signals, which vary dynamically over time …

ESERNet: Learning spectrogram structure relationship for effective speech emotion recognition with swin transformer in classroom discourse analysis

T Liu, M Wang, B Yang, H Liu, S Yi - Neurocomputing, 2025 - Elsevier
Speech emotion recognition (SER) has received increased attention due to its extensive
applications in many fields, especially in the analysis of teacher-student dialogue in …

DC-BVM: Dual-channel information fusion network based on voting mechanism

B Miao, Y Xu, J Wang, Y Zhang - Biomedical Signal Processing and Control, 2024 - Elsevier
Emotion recognition in conversations (ERC) has been challenging due to the dynamics and
complexity of emotions in conversations. Most current emotion recognition studies have …

TER-CA-WGNN: Trimodel Emotion Recognition Using Cumulative Attribute-Weighted Graph Neural Network

HFT Al-Saadawi, R Das - Applied Sciences, 2024 - mdpi.com
Affective computing is a multidisciplinary field encompassing artificial intelligence, natural
language processing, linguistics, computer science, and social sciences. This field aims to …

[HTML][HTML] MelTrans: Mel-Spectrogram Relationship-Learning for Speech Emotion Recognition via Transformers

H Li, J Li, H Liu, T Liu, Q Chen, X You - Sensors, 2024 - mdpi.com
Speech emotion recognition (SER) is not only a ubiquitous aspect of everyday
communication, but also a central focus in the field of human–computer interaction …

Joint low-rank tensor fusion and cross-modal attention for multimodal physiological signals based emotion recognition

X Wan, Y Wang, Z Wang, Y Tang… - Physiological …, 2024 - iopscience.iop.org
Objective. Physiological signals based emotion recognition is a prominent research domain
in the field of human-computer interaction. Previous studies predominantly focused on …

Holistic-Based Cross-Attention Modal Fusion Network for Video Sign Language Recognition

Q Gao, J Hu, H Mai, Z Ju - IEEE Transactions on Computational …, 2024 - ieeexplore.ieee.org
As a bridge between the deaf people and the outside, sign language primarily involves hand
movements, complemented by intricate facial and body expressions. To enhance the …

Piezoelectric Touch Sensing and Random-Forest-Based Technique for Emotion Recognition

Y Qi, W Jia, L Feng, Y Dai, C Tang… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Emotion recognition, a process of automatic cognition of human emotions, has great
potential to improve the degree of social intelligence. Among various recognition methods …