F Qarah - arXiv preprint arXiv:2408.03524, 2024 - arxiv.org
This study presents EgyBERT, an Arabic language model pretrained on 10.4 GB of Egyptian dialectal texts. We evaluated EgyBERT's performance by comparing it with five other …
This survey offers a comprehensive overview of Large Language Models (LLMs) designed for Arabic language and its dialects. It covers key architectures, including encoder-only …
F Qarah - arXiv preprint arXiv:2405.06239, 2024 - arxiv.org
In this paper, we introduce SaudiBERT, a monodialect Arabic language model pretrained exclusively on Saudi dialectal text. To demonstrate the model's effectiveness, we compared …
A Sassi, J Tonga, S Poaty, S Steve… - 2024 International …, 2024 - ieeexplore.ieee.org
This paper presents the African Dialect Dataset for Sentiment Analysis, a new natural language processing dataset (AfriDial). This dataset is intended to aid in the classification of …
In addition to features and methods used in LI, this chapter introduces the notation devised by Jauhiainen et al. that is used throughout this book to describe LI methods. For easier …