Km-bart: Knowledge enhanced multimodal bart for visual commonsense generation

S Islam, H Elmekki, A Elsebai, J Bentahar… - Expert Systems with …, 2023 - Elsevier

Abstract Transformers are Deep Neural Networks (DNN) that utilize a self-attention
mechanism to capture contextual relationships within sequential data. Unlike traditional …

被引用次数：41 相关文章所有 4 个版本

[PDF] arxiv.org

Knowledge graphs meet multi-modal learning: A comprehensive survey

Z Chen, Y Zhang, Y Fang, Y Geng, L Guo… - arXiv preprint arXiv …, 2024 - arxiv.org

Knowledge Graphs (KGs) play a pivotal role in advancing various AI applications, with the
semantic web community's exploration into multi-modal dimensions unlocking new avenues …

被引用次数：15 相关文章所有 2 个版本

[PDF] wiley.com Full View

A Fine‐Tuned BERT‐Based Transfer Learning Approach for Text Classification

R Qasim, WH Bangyal, MA Alqarni… - Journal of healthcare …, 2022 - Wiley Online Library

Text Classification problem has been thoroughly studied in information retrieval problems
and data mining tasks. It is beneficial in multiple tasks including medical diagnose health …

被引用次数：106 相关文章所有 9 个版本

[PDF] arxiv.org

Knowledge graph augmented network towards multiview representation learning for aspect-based sentiment analysis

Q Zhong, L Ding, J Liu, B Du, H Jin… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

Aspect-based sentiment analysis (ABSA) is a fine-grained task of sentiment analysis. To
better comprehend long complicated sentences and obtain accurate aspect-specific …

被引用次数：74 相关文章所有 5 个版本

[PDF] arxiv.org

Vision-language pre-training for multimodal aspect-based sentiment analysis

Y Ling, J Yu, R Xia - arXiv preprint arXiv:2204.07955, 2022 - arxiv.org

As an important task in sentiment analysis, Multimodal Aspect-Based Sentiment Analysis
(MABSA) has attracted increasing attention in recent years. However, previous approaches …

被引用次数：58 相关文章所有 4 个版本

[PDF] arxiv.org

Multi-source semantic graph-based multimodal sarcasm explanation generation

L Jing, X Song, K Ouyang, M Jia, L Nie - arXiv preprint arXiv:2306.16650, 2023 - arxiv.org

Multimodal Sarcasm Explanation (MuSE) is a new yet challenging task, which aims to
generate a natural language sentence for a multimodal social post (an image as well as its …

被引用次数：9 相关文章所有 4 个版本

[PDF] arxiv.org

Summary-oriented vision modeling for multimodal abstractive summarization

Y Liang, F Meng, J Xu, J Wang, Y Chen… - arXiv preprint arXiv …, 2022 - arxiv.org

Multimodal abstractive summarization (MAS) aims to produce a concise summary given the
multimodal data (text and vision). Existing studies mainly focus on how to effectively use the …

被引用次数：17 相关文章所有 4 个版本

[PDF] arxiv.org

Unisa: Unified generative framework for sentiment analysis

Z Li, TE Lin, Y Wu, M Liu, F Tang, M Zhao… - Proceedings of the 31st …, 2023 - dl.acm.org

Sentiment analysis is a crucial task that aims to understand people's emotional states and
predict emotional categories based on multimodal information. It consists of several …

被引用次数：3 相关文章所有 3 个版本

[PDF] arxiv.org

Recent advances in neural text generation: A task-agnostic survey

C Tang, F Guerin, C Lin - arXiv preprint arXiv:2203.03047, 2022 - arxiv.org

In recent years, considerable research has been dedicated to the application of neural
models in the field of natural language generation (NLG). The primary objective is to …

被引用次数：15 相关文章所有 3 个版本

[PDF] arxiv.org

Few-shot adaptation of multi-modal foundation models: A survey

F Liu, T Zhang, W Dai, W Cai, X Zhou… - arXiv preprint arXiv …, 2024 - arxiv.org

Multi-modal (vision-language) models, such as CLIP, are replacing traditional supervised
pre-training models (eg, ImageNet-based pre-training) as the new generation of visual …

被引用次数：3 相关文章所有 4 个版本

高级搜索

QQ 群