Noise robust automatic speech recognition: review and analysis

M Dua, Akanksha, S Dua - International Journal of Speech Technology, 2023 - Springer
Abstract Automatic Speech Recognition (ASR) system is an emerging technology used in
various fields such as robotics, traffic controls, and healthcare, etc. The leading cause of …

Self-adaptive context and modal-interaction modeling for multimodal emotion recognition

H Yang, X Gao, J Wu, T Gan, N Ding… - Findings of the …, 2023 - aclanthology.org
The multimodal emotion recognition in conversation task aims to predict the emotion label
for a given utterance with its context and multiple modalities. Existing approaches achieve …

Hybrid multi-modal emotion recognition framework based on InceptionV3DenseNet

FM Alamgir, MS Alam - Multimedia Tools and Applications, 2023 - Springer
Emotion recognition is one of the most complex research areas as individuals express
emotional cues based on several modalities such as audio, facial expressions, and …

Multimodal affect models: An investigation of relative salience of audio and visual cues for emotion prediction

J Wu, T Dang, V Sethu, E Ambikairajah - Frontiers in Computer …, 2021 - frontiersin.org
People perceive emotions via multiple cues, predominantly speech and visual cues, and a
number of emotion recognition systems utilize both audio and visual cues. Moreover, the …

Deep learning methods for suicide prediction using audio classification

PG Jeyasheeli, C Kamaleshwar… - Journal of Positive …, 2022 - journalppw.com
Screening the suicidal ideation of people is one of the highly essential needs in this fast-
moving depressing world. We aim to design a model for finding suicidal ideation based on …

Speech Based Continuous Emotion Recognition: Modelling of Ambiguity and Temporal Dynamics

J Wu - 2024 - unsworks.unsw.edu.au
Speech emotion recognition (SER) plays a pivotal role in human-computer interaction (HCI).
The ability to perceive and understand emotion is crucial in effective human communication …