Lexicon annotation in sentiment analysis for dialectal Arabic: Systematic review of current trends and future directions

SM Sherif, AH Alamoodi, OS Albahri, S Garfan… - Information Processing …, 2023 - Elsevier
Due to the vast volumes of newly streamed data on the Internet and social media, the use of
sentiment analysis (SA) to extract information and analyze people's opinions has become a …

[HTML][HTML] Arabic natural language processing: An overview

I Guellil, H Saâdane, F Azouaou, B Gueni… - Journal of King Saud …, 2021 - Elsevier
Arabic is recognised as the 4th most used language of the Internet. Arabic has three main
varieties:(1) classical Arabic (CA),(2) Modern Standard Arabic (MSA),(3) Arabic Dialect (AD) …

Language model tokenizers introduce unfairness between languages

A Petrov, E La Malfa, P Torr… - Advances in Neural …, 2024 - proceedings.neurips.cc
Recent language models have shown impressive multilingual performance, even when not
explicitly trained for it. Despite this, there are concerns about the quality of their outputs …

The interplay of variant, size, and task type in Arabic pre-trained language models

G Inoue, B Alhafni, N Baimukan, H Bouamor… - arXiv preprint arXiv …, 2021 - arxiv.org
In this paper, we explore the effects of language variants, data sizes, and fine-tuning task
types in Arabic pre-trained language models. To do so, we build three pre-trained language …

CAMeL tools: An open source python toolkit for Arabic natural language processing

O Obeid, N Zalmout, S Khalifa, D Taji… - Proceedings of the …, 2020 - aclanthology.org
Abstract We present CAMeL Tools, a collection of open-source tools for Arabic natural
language processing in Python. CAMeL Tools currently provides utilities for pre-processing …

AraT5: Text-to-text transformers for Arabic language generation

EMB Nagoudi, AR Elmadany… - arXiv preprint arXiv …, 2021 - arxiv.org
Transfer learning with a unified Transformer framework (T5) that converts all language
problems into a text-to-text format was recently proposed as a simple and effective transfer …

NADI 2022: The third nuanced Arabic dialect identification shared task

M Abdul-Mageed, C Zhang, AR Elmadany… - arXiv preprint arXiv …, 2022 - arxiv.org
We describe findings of the third Nuanced Arabic Dialect Identification Shared Task (NADI
2022). NADI aims at advancing state of the art Arabic NLP, including on Arabic dialects. It …

The MADAR shared task on Arabic fine-grained dialect identification

H Bouamor, S Hassan, N Habash - Proceedings of the Fourth …, 2019 - aclanthology.org
In this paper, we present the results and findings of the MADAR Shared Task on Arabic Fine-
Grained Dialect Identification. This shared task was organized as part of The Fourth Arabic …

A panoramic survey of natural language processing in the Arab world

K Darwish, N Habash, M Abbas, H Al-Khalifa… - Communications of the …, 2021 - dl.acm.org
THE TERM NATURAL language refers to any system of symbolic communication (spoken,
signed, or written) that has evolved naturally in humans without intentional human planning …

Fine-grained Arabic dialect identification

M Salameh, H Bouamor, N Habash - Proceedings of the 27th …, 2018 - aclanthology.org
Previous work on the problem of Arabic Dialect Identification typically targeted coarse-
grained five dialect classes plus Standard Arabic (6-way classification). This paper presents …