Homograph disambiguation through selective diacritic restoration

S Alqahtani, H Aldarmaki, M Diab - arXiv preprint arXiv:1912.04479, 2019 - arxiv.org
Lexical ambiguity, a challenging phenomenon in all natural languages, is particularly
prevalent for languages with diacritics that tend to be omitted in writing, such as Arabic …

[图书][B] Full and partial diacritic restoration: Development and impact on downstream applications

S Alqahtani - 2020 - search.proquest.com
Languages that include diacritics in speech but omit diacritics in writing to a certain degree
result in written texts that are even more ambiguous than typically expected. Not including …

[PDF][PDF] ARLEX: A large scale comprehensive lexical inventory for Modern Standard Arabic

S Alqahtani, M Diab, W Zaghouani - OSACT 3: The 3rd Workshop on …, 2018 - lrec-conf.org
This paper introduces a lexical resource, ARLEX, for Modern Standard Arabic (MSA) that
explicitly lists ambiguity at the lexical and syntactic levels for each token. Arabic orthography …

[PDF][PDF] Language Technologies for Social Media

W Zaghouani, E City - INTEGRATING ICTIN SOCIETY - infoz.ffzg.hr
We are witnessing an increased interest from stakeholders to collect and analyze in real
time the large-volume of information from social media streams using all kinds of …