CAMeL tools: An open source python toolkit for Arabic natural language processing

O Obeid, N Zalmout, S Khalifa, D Taji… - Proceedings of the …, 2020 - aclanthology.org
Abstract We present CAMeL Tools, a collection of open-source tools for Arabic natural
language processing in Python. CAMeL Tools currently provides utilities for pre-processing …

[PDF][PDF] Madamira: A fast, comprehensive tool for morphological analysis and disambiguation of arabic.

A Pasha, M Al-Badrashiny, MT Diab, A El Kholy… - Lrec, 2014 - academia.edu
In this paper, we present MADAMIRA, a system for morphological analysis and
disambiguation of Arabic that combines some of the best aspects of two previously …

Overview of the SPMRL 2013 shared task: A cross-framework evaluation of parsing morphologically rich languages

D Seddah, R Tsarfaty, S Kübler, M Candito… - Proceedings of the …, 2013 - hal.science
This paper reports on the first shared task on statistical parsing of morphologically rich lan-
guages (MRLs). The task features data sets from nine languages, each available both in …

[PDF][PDF] Word segmentation of informal Arabic with domain adaptation

W Monroe, S Green, CD Manning - … of the 52nd Annual Meeting of …, 2014 - aclanthology.org
Segmentation of clitics has been shown to improve accuracy on a variety of Arabic NLP
tasks. However, state-of-the-art Arabic word segmenters are either limited to formal Modern …

Don't throw those morphological analyzers away just yet: Neural morphological disambiguation for Arabic

N Zalmout, N Habash - Proceedings of the 2017 Conference on …, 2017 - aclanthology.org
This paper presents a model for Arabic morphological disambiguation based on Recurrent
Neural Networks (RNN). We train Long Short-Term Memory (LSTM) cells in several …

Morphosyntactic tagging with pre-trained language models for Arabic and its dialects

G Inoue, S Khalifa, N Habash - arXiv preprint arXiv:2110.06852, 2021 - arxiv.org
We present state-of-the-art results on morphosyntactic tagging across different varieties of
Arabic using fine-tuned pre-trained transformer language models. Our models consistently …

BERT-Based Arabic Diacritization: A state-of-the-art approach for improving text accuracy and pronunciation

R Kharsa, A Elnagar, S Yagi - Expert Systems with Applications, 2024 - Elsevier
In order to accurately represent the meaning and pronunciation of Arabic words and
sentences, the presence of diacritics plays a crucial role. Over the years, researchers have …

An Arabic morphological analyzer and generator with copious features

D Taji, S Khalifa, O Obeid, F Eryani… - Proceedings of the …, 2018 - aclanthology.org
We introduce CALIMA-Star, a very rich Arabic morphological analyzer and generator that
provides functional and form-based morphological features as well as built-in tokenization …

A unified model for Arabizi detection and transliteration using sequence-to-sequence models

A Shazal, A Usman, N Habash - … of the fifth arabic natural language …, 2020 - aclanthology.org
While online Arabic is primarily written using the Arabic script, a Roman-script variety called
Arabizi is often seen on social media. Although this representation captures the phonology …

Fine-tuning bert-based pre-trained models for arabic dependency parsing

S Al-Ghamdi, H Al-Khalifa, A Al-Salman - Applied Sciences, 2023 - mdpi.com
With the advent of pre-trained language models, many natural language processing tasks in
various languages have achieved great success. Although some research has been …