I Guellil, H Saâdane, F Azouaou, B Gueni… - Journal of King Saud …, 2021 - Elsevier
Arabic is recognised as the 4th most used language of the Internet. Arabic has three main varieties:(1) classical Arabic (CA),(2) Modern Standard Arabic (MSA),(3) Arabic Dialect (AD) …
Recent language models have shown impressive multilingual performance, even when not explicitly trained for it. Despite this, there are concerns about the quality of their outputs …
In this paper, we explore the effects of language variants, data sizes, and fine-tuning task types in Arabic pre-trained language models. To do so, we build three pre-trained language …
Abstract We present CAMeL Tools, a collection of open-source tools for Arabic natural language processing in Python. CAMeL Tools currently provides utilities for pre-processing …
Transfer learning with a unified Transformer framework (T5) that converts all language problems into a text-to-text format was recently proposed as a simple and effective transfer …
We describe findings of the third Nuanced Arabic Dialect Identification Shared Task (NADI 2022). NADI aims at advancing state of the art Arabic NLP, including on Arabic dialects. It …
In this paper, we present the results and findings of the MADAR Shared Task on Arabic Fine- Grained Dialect Identification. This shared task was organized as part of The Fourth Arabic …
THE TERM NATURAL language refers to any system of symbolic communication (spoken, signed, or written) that has evolved naturally in humans without intentional human planning …
Previous work on the problem of Arabic Dialect Identification typically targeted coarse- grained five dialect classes plus Standard Arabic (6-way classification). This paper presents …