P Le-Hong - Knowledge-Based Systems, 2021 - Elsevier
One of the challenging problems in text processing is diacritics generation where one needs to generate diacritic marks for non-accented text. With an ever increasing amount of informal …
I Orife - arXiv preprint arXiv:1804.00832, 2018 - arxiv.org
Yor\ub\'a is a widely spoken West African language with a writing system rich in tonal and orthographic diacritics. With very few exceptions, diacritics are omitted from electronic texts …
Z Ozer, I Ozer, O Findik - … Science and Technology, an International Journal, 2018 - Elsevier
Social media platforms such as Twitter have grown at a tremendous pace in recent years and have become an important source of data providing information countless field. This …
A Arslan - Information Processing & Management, 2016 - Elsevier
The absence of diacritics in text documents or search queries is a serious problem for Turkish information retrieval because it creates homographic ambiguity. Thus, the …
This paper presents an empirical study of two machine translation-based approaches for Vietnamese diacritic restoration problem, including phrase-based and neural-based …
CH Nga, NK Thinh, PC Chang… - 2019 IEEE international …, 2019 - ieeexplore.ieee.org
Diacritics are very important in diacritical languages, because the meaning of sentences can be changed in accordance to diacritics. Writing without diacritics makes the sentences …
TDA Dang, TTT Nguyen - Proceedings of the 34th Pacific Asia …, 2020 - aclanthology.org
Diacritic restoration plays an important role in Natural Language Processing (NLP) for many diacritical languages such as Vietnamese, Czech, Hungarian, etc. With the development of …
II Ayogu, O Abu - 2020 IEEE 2nd International Conference on …, 2021 - ieeexplore.ieee.org
The development and availability of high quality corpus for many African languages is still hampered by dearth of appropriate software tools and devices. To be able to rapidly create …