One of the widely used technological inventions is the Internet which gives rise to online social media platforms such as Twitter and Facebook to proliferate. These platforms are …
Africa has over 2000 languages; however, those languages are not well represented in the existing natural language processing ecosystem. African languages lack essential digital …
Hausa is a major Chadic language, spoken by over 100 million people in Africa. However, from a computational linguistic perspective, it is considered a low-resource language, with …
Creole languages such as Nigerian Pidgin English and Haitian Creole are under-resourced and largely ignored in the NLP literature. Creoles typically result from the fusion of a foreign …
Nigerian Pidgin is an English-derived contact language and is traditionally an oral language, spoken by approximately 100 million people. No orthographic standard has yet …
Safeguarding Personally Identifiable Information (PII) in an increasingly interconnected world presents intimidating challenges, particularly in low-resource languages like Luganda …
B Okgetheng, G Malema - Proceedings of the 2023 7th International …, 2023 - dl.acm.org
Named Entity Recognition (NER) is a fundamental task in Natural Language Processing (NLP) focused on identifying entities like individuals, organizations, and locations within text …
Hausa, a major Chadic language spoken by over 100 million people in Africa, faces a challenge in the digital age. While widely used, it is considered a low-resource language …
O Toyin, CO Akinduyite - 2024 International Conference on …, 2024 - ieeexplore.ieee.org
Parts-of-speech tagging is a linguistics task that assigns the best sequence of tags to a given sequence of input words. The process falls under word sense disambiguation, which …