Urdu language is used by approximately 200 million people for spoken and written communications. The bulk of unstructured Urdu textual data is available in the world. We can …
MS Husain - International Journal on Natural Language Computing …, 2012 - academia.edu
This paper presents an unsupervised approach for the development of a stemmer (For the case of Urdu & Marathi language). Especially, during last few years, a wide range of …
In information retrieval (IR), documents that match the query are retrieved. Search engines usually conflate word variants into a common stem when indexing documents because …
S Khan, W Anwar, U Bajwa, X Wang - International Arab Journal of …, 2015 - ccis2k.org
Word stemming is one of the most significant factors that affect the performance of a Natural Language Processing (NLP) application such as Information Retrieval (IR) system, part of …
M Zahedi, AG Sorkhi - Arabian Journal for Science and Engineering, 2013 - Springer
Persian text is usually associated with a wide range of important or useless features. This is the main reason why feature extraction process is one of the difficult tasks in the field of …
K Riaz - BCS IRSG Symposium: Future Directions in Information …, 2007 - scienceopen.com
This paper explains the challenges pertaining to Urdu stemming and presents a rule-based prototype with a few rules implemented for Urdu to motivate the intricacies. It shows that …
K Riaz - Proceedings of the 2nd PhD workshop on Information …, 2008 - dl.acm.org
This paper describes a thesis proposal to do concept search in non English and non European languages. Urdu is chosen as an example language because of its unique …
Persian is a challenging language in the field of NLP. Right-to-left orthography, complex morphology, complicated grammatical rules, and different forms of letters make it an …
SA Khan, W Anwar, UI Bajwa - … of the 2nd Workshop on South …, 2011 - aclanthology.org
Urdu language raises several challenges to Natural Language Processing (NLP) largely due to its rich morphology. In this language, morphological processing becomes particularly …