作者
Tana Wang, Yaqing Hou, Dongsheng Zhou, Qiang Zhang
发表日期
2021/7/18
研讨会论文
2021 International Joint Conference on Neural Networks (IJCNN)
页码范围
1-7
出版商
IEEE
简介
Emotion recognition in conversation (ERC) is a challenging task due to the complexity of emotions and dynamics in dialogues. Current studies for emotion recognition mostly focus on the modeling of a single utterance in dialogue, which neglects self and inter-speaker influence. This paper presents a contextual attention neural network based on the multimodal framework that leverages the conversational information from both target and the other speaker for utterance-level emotion detection. Specifically, we utilize recurrent neural networks based on contextual attention for modeling the transaction and dependence between speakers. Further, the feature fusion is proposed to unite the important modal information extracted from multiple modalities, including audio, text and video, hence providing more useful and comprehensive knowledge for emotion recognition. The proposed approach shows its superiority in …
引用总数
学术搜索中的文章
T Wang, Y Hou, D Zhou, Q Zhang - 2021 International Joint Conference on Neural …, 2021