[HTML][HTML] Arabic natural language processing: An overview

I Guellil, H Saâdane, F Azouaou, B Gueni… - Journal of King Saud …, 2021 - Elsevier
Arabic is recognised as the 4th most used language of the Internet. Arabic has three main
varieties:(1) classical Arabic (CA),(2) Modern Standard Arabic (MSA),(3) Arabic Dialect (AD) …

Systematic literature review of dialectal Arabic: identification and detection

A Elnagar, SM Yagi, AB Nassif, I Shahin… - IEEE …, 2021 - ieeexplore.ieee.org
It is becoming increasingly difficult to know who is working on what and how in
computational studies of Dialectal Arabic. This study comes to chart the field by conducting a …

Language resources for Maghrebi Arabic dialects' NLP: a survey

J Younes, E Souissi, H Achour, A Ferchichi - Language Resources and …, 2020 - Springer
Diglossia is one of the main characteristics of Arabic language. In Arab countries, there are
three forms of Arabic that co-exist: Classical Arabic (CA) which is mainly used in the Quran …

Tarc: Tunisian arabish corpus first complete release

E Gugliotta, M Dinarelli - arXiv preprint arXiv:2207.04796, 2022 - arxiv.org
In this paper we present the final result of a project on Tunisian Arabic encoded in Arabizi,
the Latin-based writing system for digital conversations. The project led to the creation of two …

Translation from Tunisian Dialect to Modern Standard Arabic: Exploring Finite-State Transducers and Sequence-to-Sequence Transformer Approaches

R Torjmen, K Haddar - ACM Transactions on Asian and Low-Resource …, 2024 - dl.acm.org
Translation from the mother tongue, including the Tunisian dialect, to modern standard
Arabic is a highly significant field in natural language processing due to its wide range of …

Tarc: Incrementally and semi-automatically collecting a tunisian arabish corpus

E Gugliotta, M Dinarelli - arXiv preprint arXiv:2003.09520, 2020 - arxiv.org
This article describes the constitution process of the first morpho-syntactically annotated
Tunisian Arabish Corpus (TArC). Arabish, also known as Arabizi, is a spontaneous coding of …

Challenges and Progress in Constructing Arabic Dialect Corpora and Linguistic tools: A Focus on Moroccan and Tunisian Dialects

O Nahli, E Gugliotta, N Khlif… - 2023 7th IEEE Congress …, 2023 - ieeexplore.ieee.org
Given the lack of resources for Arabic dialects, the construction of corpora, lexical resources,
and tools is a non-trivial challenge. The focus of the article is to describe our in-progress …

The SMarT classifier for Arabic fine-grained dialect identification

K Meftouh, K Abidi, S Harrat… - Proceedings of the Fourth …, 2019 - aclanthology.org
This paper describes the approach adopted by the SMarT research group to build a dialect
identification system in the framework of the Madar shared task on Arabic fine-grained …

[PDF][PDF] Towards a Unified Digital Resource for Tunisian Arabic

E Gugliotta, M Mallia, L Panascì - Proceedings of the 4th …, 2023 - aclanthology.org
This paper presents our work on linking language tools for Tunisian Arabic, focusing on a
lexicographic database and a corpus of informal written texts. This work on Tunisian Arabic …

Arabic Text Formality Modification: A Review and Future Research Directions

SI Abudalfa, FJ Abdu, MM Alowaifeer - IEEE Access, 2024 - ieeexplore.ieee.org
Formality transfer seeks to adjust text formality without altering its core meaning, which
carries substantial implications across diverse domains like machine translation, dialogue …