Multimodal sentiment analysis: A systematic review of history, datasets, multimodal fusion methods, applications, challenges and future directions

A Gandhi, K Adhvaryu, S Poria, E Cambria, A Hussain - Information Fusion, 2023 - Elsevier
Sentiment analysis (SA) has gained much traction In the field of artificial intelligence (AI) and
natural language processing (NLP). There is growing demand to automate analysis of user …

A systematic review on affective computing: Emotion models, databases, and recent advances

Y Wang, W Song, W Tao, A Liotta, D Yang, X Li, S Gao… - Information …, 2022 - Elsevier
Affective computing conjoins the research topics of emotion recognition and sentiment
analysis, and can be realized with unimodal or multimodal data, consisting primarily of …

A review of affective computing: From unimodal analysis to multimodal fusion

S Poria, E Cambria, R Bajpai, A Hussain - Information fusion, 2017 - Elsevier
Affective computing is an emerging interdisciplinary research field bringing together
researchers and practitioners from various fields, ranging from artificial intelligence, natural …

Long dialogue emotion detection based on commonsense knowledge graph guidance

W Nie, Y Bao, Y Zhao, A Liu - IEEE Transactions on Multimedia, 2023 - ieeexplore.ieee.org
Dialogue emotion detection is always challenging due to human subjectivity and the
randomness of dialogue content. In a conversation, the emotion of each person often …

A review of human activity recognition methods

M Vrigkas, C Nikou, IA Kakadiaris - Frontiers in Robotics and AI, 2015 - frontiersin.org
Recognizing human activities from video sequences or still images is a challenging task due
to problems, such as background clutter, partial occlusion, changes in scale, viewpoint …

A review and meta-analysis of multimodal affect detection systems

SK D'mello, J Kory - ACM computing surveys (CSUR), 2015 - dl.acm.org
Affect detection is an important pattern recognition problem that has inspired researchers
from several areas. The field is in need of a systematic review due to the recent influx of …

Learning affective features with a hybrid deep model for audio–visual emotion recognition

S Zhang, S Zhang, T Huang, W Gao… - IEEE transactions on …, 2017 - ieeexplore.ieee.org
Emotion recognition is challenging due to the emotional gap between emotions and audio-
visual features. Motivated by the powerful feature learning ability of deep neural networks …

Attentive convolutional neural network based speech emotion recognition: A study on the impact of input features, signal length, and acted speech

M Neumann, NT Vu - arXiv preprint arXiv:1706.00612, 2017 - arxiv.org
Speech emotion recognition is an important and challenging task in the realm of human-
computer interaction. Prior work proposed a variety of models and feature sets for training a …

Behavioral signal processing: Deriving human behavioral informatics from speech and language

S Narayanan, PG Georgiou - Proceedings of the IEEE, 2013 - ieeexplore.ieee.org
The expression and experience of human behavior are complex and multimodal and
characterized by individual and contextual heterogeneity and variability. Speech and …

Learning in audio-visual context: A review, analysis, and new perspective

Y Wei, D Hu, Y Tian, X Li - arXiv preprint arXiv:2208.09579, 2022 - arxiv.org
Sight and hearing are two senses that play a vital role in human communication and scene
understanding. To mimic human perception ability, audio-visual learning, aimed at …