A survey on Urdu and Urdu like language stemmers and stemming techniques

A Jabbar, S Iqbal, MUG Khan, S Hussain - Artificial Intelligence Review, 2018 - Springer
Stemming is one of the basic steps in natural language processing applications such as
information retrieval, parts of speech tagging, syntactic parsing and machine translation, etc …

Empirical evaluation and study of text stemming algorithms

A Jabbar, S Iqbal, MI Tamimy, S Hussain… - Artificial Intelligence …, 2020 - Springer
Text stemming is one of the basic preprocessing step for Natural Language Processing
applications which is used to transform different word forms into a standard root form. For …

Pseudo-relevance feedback based query expansion using boosting algorithm

I Rasheed, H Banka, HM Khan - Artificial Intelligence Review, 2021 - Springer
Retrieving relevant documents from a large set using the original query is a formidable
challenge. A generic approach to improve the retrieval process is realized using pseudo …

An Analytical Analysis of Text Stemming Methodologies in Information Retrieval and Natural Language Processing Systems

A Jabbar, S Iqbal, MI Tamimy, A Rehman… - IEEE …, 2023 - ieeexplore.ieee.org
The exponential increase in textual unstructured digital data creates significant demand for
advanced and smart stemming systems. As a preprocessing stage, stemming is applied in …

A comparative review of Urdu stemmers: Approaches and challenges

A Jabbar, S ul Islam, S Hussain, A Akhunzada… - Computer Science …, 2019 - Elsevier
With the advent of globalization epoch, the Internet-based resources for Urdu are increasing
in depth and breadth at a higher pace than ever and thus require a mechanism for …

RUTUT: roman Urdu to Urdu translator based on character substitution rules and unicode mapping

M Shahroz, MF Mushtaq, A Mehmood, S Ullah… - IEEE …, 2020 - ieeexplore.ieee.org
Urdu language written in English alphabets for communication is known as Roman Urdu. In
pronunciation, both are the same but different in spelling and have different shapes of the …

WebKey: a graph-based method for event detection in web news

E Rasouli, S Zarifzadeh, AJ Rafsanjani - Journal of Intelligent Information …, 2020 - Springer
With rapid and vast publishing of news over the Internet, there is a surge of interest to detect
underlying hot events from online news streams. There are two main challenges in event …

[PDF][PDF] Minimalist Entity Disambiguation for Mid-Resource Languages

B Kruit - Proceedings of The Fourth Workshop on Simple and …, 2023 - aclanthology.org
For many languages and applications, even though enough data is available for training
Named Entity Disambiguation (NED) systems, few off-the-shelf models are available for use …

SYN2015: representative corpus of written Czech

M Křen, V Cvrček, T Čapka, A Čermáková, M Hnátková… - 2015 - lindat.mff.cuni.cz
3. Prepis muze být rozdelen do vıce souboru. Soubory pak musı být vzestupne ocıslovány a
název souboru pak musı koncit cıslem souboru.(Naprıklad: Rozhovor s Janem Sokolem 1 …

Building a multilevel inflection handling stemmer to improve search effectiveness for Urdu language

A Jabbar, S Iqba, A Alaulamie, M Ilahi - IEEE Access, 2024 - ieeexplore.ieee.org
Stemming is an essential step in various Natural Language Processing (NLP) applications
and is used to reduce different variants of the query words to a standard form to avoid the …