查看文章

aclanthology.org 中的 [PDF]

Mawqif: A Multi-label Arabic Dataset for Target-specific Stance Detection

作者

Nora Alturayeif, Hamzah Luqman, Moataz Ahmed

发表日期

2022

研讨会论文

Proceedings of the The Seventh Arabic Natural Language Processing Workshop (WANLP), EMNLP 2022

页码范围

174–184

简介

Social media platforms are becoming inherent parts of people’s daily life to express opinions and stances toward topics of varying polarities. Stance detection determines the viewpoint expressed in a text toward a target. While communication on social media (eg, Twitter) takes place in more than 40 languages, the majority of stance detection research has been focused on English. Although some efforts have recently been made to develop stance detection datasets in other languages, no similar efforts seem to have considered the Arabic language. In this paper, we present Mawqif, the first Arabic dataset for target-specific stance detection, composed of 4,121 tweets annotated with stance, sentiment, and sarcasm polarities. Mawqif, as a multi-label dataset, can provide more opportunities for studying the interaction between different opinion dimensions and evaluating a multi-task model. We provide a detailed description of the dataset, present an analysis of the produced annotation, and evaluate four BERT-based models on it. Our best model achieves a macro-F1 of 78.89%, which shows that there is ample room for improvement on this challenging task. We publicly release our dataset, the annotation guidelines, and the code of the experiments.

引用总数

被引用次数：9

202320243 6

学术搜索中的文章

Mawqif: a multi-label Arabic dataset for target-specific stance detection

NS Alturayeif, HA Luqman, MAK Ahmed - Proceedings of the Seventh Arabic Natural Language …, 2022

被引用次数：9 相关文章所有 3 个版本