A stemming algorithm for Latin text databases

R Schinke, M Greengrass, AM Robertson… - Journal of …, 1996 - emerald.com
This paper describes the design of a stemming algorithm for searching databases of Latin
text. The algorithm uses a simple longest‐match approach with some recoding but differs …

[PDF][PDF] Simple rules malay stemmer

SA Fadzli, AK Norsalehen, IA Syarilla… - … on Informatics and …, 2012 - academia.edu
Stemming is a morphological analysis that tries to associate variants of the same term with a
common root form. It is important to improve recall and precision in IR systems. Malay word …

[PDF][PDF] Arabic word stemming algorithms and retrieval effectiveness

TMT Sembok, BA Ata - Proceedings of the World Congress on …, 2013 - academia.edu
Systems (IRS) is generally about retrieving of relevant documents pertaining to information
needs. The more the system able to understand the contents of documents the more …

[PDF][PDF] Effectiveness of stemming and n-grams string similarity matching on Malay documents

TMT Sembok, ZA Bakar - … Journal of Applied Mathematics and Informatics, 2011 - Citeseer
There are two main classes of conflation algorithms, namely, string-similarity algorithms and
stemming algorithms. String-similarity matching algorithms, bi-grams and tri-grams, are used …

[PDF][PDF] Word stemming algorithms and retrieval effectiveness in Malay and Arabic documents retrieval systems

TMT Sembok - Proceeding of World Academy of Science, Engineering …, 2005 - Citeseer
Systems (IRS) is generally about understanding of information in the documents concern.
The more the system able to understand the contents of documents the more effective will be …

[PDF][PDF] A rule and template based stemming algorithm for Arabic language

T Sembok, BMA Ata, ZA Bakar - Int J Math Mod Meth Appl Sci, 2011 - academia.edu
Stemming is defined as the conflation of all variations of specific words to a single form
called the root or stem. Stemming plays a vital role in natural language processing and …

[PDF][PDF] A rule-based Arabic stemming algorithm

TMT Sembok, BMA Ata, ZA Bakar - Proceedings of the European …, 2011 - academia.edu
Stemming is used in information retrieval systems to reduce variant word forms to common
roots in order to improve retrieval effectiveness. As in other languages, there is a need for an …

[DOC][DOC] Experiments with n-gram string-similarity measure on malay texts

TMT Sembok, P Willett - Universiti Kebangsaan Malaysia, 1995 - researchgate.net
Conflation is used in information retrieval systems to reduce variant word forms to common
roots in order to improve retrieval effectiveness. Conflation algorithms are classified into two …

Experiments in Malay information retrieval

TMT Sembok, ZA Bakar… - Proceedings of the 2011 …, 2011 - ieeexplore.ieee.org
There have been very few studies on the use of conflation algorithms for indexing and
retrieval of Malay documents. The two main classes of conflation algorithms are string …

Retrieval of morphological variants in searches of Latin text databases

R Schinke, M Greengrass, AM Robertson… - Computers and the …, 1997 - Springer
This paper reports a detailed evaluation of the effectiveness of a system that has been
developed for the identification and retrieval of morphological variants in searches of Latin …