K Darwish - arXiv preprint arXiv:1306.6755, 2013 - arxiv.org
Arabizi is Arabic text that is written using Latin characters. Arabizi is used to present both Modern Standard Arabic (MSA) or Arabic dialects. It is commonly used in informal settings …
K Darwish, W Magdy - Foundations and Trends® in …, 2014 - nowpublishers.com
In the past several years, Arabic Information Retrieval (IR) has garnered significant attention. The main research interests have focused on retrieval of formal language, mostly in the …
K Darwish - Proceedings of the 51st Annual Meeting of the …, 2013 - aclanthology.org
Some languages lack large knowledge bases and good discriminative features for Name Entity Recognition (NER) that can generalize to previously unseen named entities. One such …
In this paper, we present a new and fast state-of-the-art Arabic diacritizer that guesses the diacritics of words and then their case endings. We employ a Viterbi decoder at word-level …
Predicting the stance of social media users on a topic can be challenging, particularly for users who never express explicit stances. Earlier work has shown that using users' historical …
We present a dialectal Egyptian Arabic to English statistical machine translation system that leverages dialectal to Modern Standard Arabic (MSA) adaptation. In contrast to previous …
A Masmoudi, ME Khmekhem, M Khrouf… - ACM Transactions on …, 2019 - dl.acm.org
The evolution of information and communication technology has markedly influenced communication between correspondents. This evolution has facilitated the transmission of …
We propose a novel model to automatically extract transliteration pairs from parallel corpora. Our model is efficient, language pair independent and mines transliteration pairs in a …
Users of the WWW across the globe are increasing rapidly. According to Internet live stats there are more than 3 billion Internet users worldwide today and the number of non-English …