Multimodal emotion recognition using cross modal audio-video fusion with attention and deep metric learning

B Mocanu, R Tapu, T Zaharia - Image and Vision Computing, 2023 - Elsevier
In the last few years, the multi-modal emotion recognition has become an important research
issue in the affective computing community due to its wide range of applications that include …

Sentiment analysis and emotion recognition from speech using universal speech representations

BT Atmaja, A Sasou - Sensors, 2022 - mdpi.com
The study of understanding sentiment and emotion in speech is a challenging task in human
multimodal language. However, in certain cases, such as telephone calls, only audio data …

Estimating the uncertainty in emotion attributes using deep evidential regression

W Wu, C Zhang, PC Woodland - arXiv preprint arXiv:2306.06760, 2023 - arxiv.org
In automatic emotion recognition (AER), labels assigned by different human annotators to
the same utterance are often inconsistent due to the inherent complexity of emotion and the …

After: Active learning based fine-tuning framework for speech emotion recognition

D Li, Y Wang, K Funakoshi… - 2023 IEEE Automatic …, 2023 - ieeexplore.ieee.org
Speech emotion recognition (SER) has drawn increasing attention for its applications in
human-machine interaction. However, existing SER methods ignore the information gap …

[PDF][PDF] Improving Joint Speech and Emotion Recognition Using Global Style Tokens

J Kyung, JS Seong, JH Choi, YR Jeoung… - Proceedings of the …, 2023 - isca-archive.org
Automatic speech recognition (ASR) and speech emotion recognition (SER) are closely
related in that the acoustic features of speech, such as pitch, tone, and intensity, can vary …

Speech emotion: Investigating model representations, multi-task learning and knowledge distillation

V Mitra, HYS Chien, V Kowtha, JY Cheng… - arXiv preprint arXiv …, 2022 - arxiv.org
Estimating dimensional emotions, such as activation, valence and dominance, from acoustic
speech signals has been widely explored over the past few years. While accurate estimation …

Integrating emotion recognition with speech recognition and speaker diarisation for conversations

W Wu, C Zhang, PC Woodland - arXiv preprint arXiv:2308.07145, 2023 - arxiv.org
Although automatic emotion recognition (AER) has recently drawn significant research
interest, most current AER studies use manually segmented utterances, which are usually …

[PDF][PDF] 情感识别中的迁移学习问题综述.

黄兆培, 张峰源, 赵金明, 金琴 - Journal of Signal Processing, 2023 - signal.ejournal.org.cn
情感识别是实现自然人机交互的必要过程. 然而, 情感数据高昂的采集和标注成本成为了限制
情感识别研究发展的一大瓶颈. 在无标注或有限标注的场景下, 利用知识的跨领域或跨任务迁移 …

Cross-corpus Speech Emotion Recognition using Semi-supervised Domain Adaptation Network

Y Zhang, M Jia, X Cao, J Ru, X Zhang - Speech Communication, 2024 - Elsevier
Speech emotion recognition (SER) is an important topic in human-computer interactions.
When the input data in training and testing sets comes from different corpora, there is a …

Pre-trained model representations and their robustness against noise for speech emotion analysis

V Mitra, V Kowtha, HYS Chien, E Azemi… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
Pre-trained model representations have demonstrated state-of-the-art performance in
speech recognition, natural language processing, and other applications. Speech models …