Multimodal learning with transformers: A survey

P Xu, X Zhu, DA Clifton - IEEE Transactions on Pattern Analysis …, 2023 - ieeexplore.ieee.org
Transformer is a promising neural network learner, and has achieved great success in
various machine learning tasks. Thanks to the recent prevalence of multimodal applications …

Large Language Models Meet Text-Centric Multimodal Sentiment Analysis: A Survey

H Yang, Y Zhao, Y Wu, S Wang, T Zheng… - arXiv preprint arXiv …, 2024 - arxiv.org
Compared to traditional sentiment analysis, which only considers text, multimodal sentiment
analysis needs to consider emotional signals from multimodal sources simultaneously and …

Multi-modal sarcasm detection via cross-modal graph convolutional network

B Liang, C Lou, X Li, M Yang, L Gui… - Proceedings of the …, 2022 - wrap.warwick.ac.uk
With the increasing popularity of posting multimodal messages online, many recent studies
have been carried out utilizing both textual and visual information for multi-modal sarcasm …

Revisiting disentanglement and fusion on modality and context in conversational multimodal emotion recognition

B Li, H Fei, L Liao, Y Zhao, C Teng, TS Chua… - Proceedings of the 31st …, 2023 - dl.acm.org
It has been a hot research topic to enable machines to understand human emotions in
multimodal contexts under dialogue scenarios, which is tasked with multimodal emotion …

Vision-language pre-training for multimodal aspect-based sentiment analysis

Y Ling, J Yu, R Xia - arXiv preprint arXiv:2204.07955, 2022 - arxiv.org
As an important task in sentiment analysis, Multimodal Aspect-Based Sentiment Analysis
(MABSA) has attracted increasing attention in recent years. However, previous approaches …

CLMLF: A contrastive learning and multi-layer fusion method for multimodal sentiment detection

Z Li, B Xu, C Zhu, T Zhao - arXiv preprint arXiv:2204.05515, 2022 - arxiv.org
Compared with unimodal data, multimodal data can provide more features to help the model
analyze the sentiment of data. Previous research works rarely consider token-level feature …

A comprehensive review of visual–textual sentiment analysis from social media networks

IKS Al-Tameemi, MR Feizi-Derakhshi… - … of Computational Social …, 2024 - Springer
Social media networks have become a significant aspect of people's lives, serving as a
platform for their ideas, opinions and emotions. Consequently, automated sentiment …

Tackling modality heterogeneity with multi-view calibration network for multimodal sentiment detection

Y Wei, S Yuan, R Yang, L Shen, Z Li… - Proceedings of the …, 2023 - aclanthology.org
With the popularity of social media, detecting sentiment from multimodal posts (eg image-
text pairs) has attracted substantial attention recently. Existing works mainly focus on fusing …

Interpretable multimodal sentiment classification using deep multi-view attentive network of image and text data

IKS Al-Tameemi, MR Feizi-Derakhshi… - IEEE …, 2023 - ieeexplore.ieee.org
Multimodal data can convey user emotions and feelings more effectively and interactively
than unimodal content. Thus, multimodal sentiment analysis (MSA) research has recently …

[PDF][PDF] Targeted Multimodal Sentiment Classification based on Coarse-to-Fine Grained Image-Target Matching.

J Yu, J Wang, R Xia, J Li - IJCAI, 2022 - ijcai.org
Abstract Targeted Multimodal Sentiment Classification (TMSC) aims to identify the sentiment
polarities over each target mentioned in a pair of sentence and image. Existing methods to …