F Ahmad, M Yusoff, TMT Sembok - Journal of the American …, 1996 - Wiley Online Library
Stemming is used in information retrieval systems to reduce variant word forms to common roots in order to improve retrieval effectiveness. As in other languages, there is a need for an …
Thesaurus has become another valuable structure in any Information Retrieval system. It is a list of terms and concepts that provide a controlled vocabulary of words to use in document …
ZA Bakar, NA Rahman - International Conference on Asian Digital …, 2003 - Springer
Abstract Information Technology has enabled information in many forms such as text, image or sound, to be accessed widely using search terms via a computer. Due to this type of …
TMT Sembok, BA Ata - Proceedings of the World Congress on …, 2013 - academia.edu
Systems (IRS) is generally about retrieving of relevant documents pertaining to information needs. The more the system able to understand the contents of documents the more …
TMT Sembok, ZA Bakar - … Journal of Applied Mathematics and Informatics, 2011 - Citeseer
There are two main classes of conflation algorithms, namely, string-similarity algorithms and stemming algorithms. String-similarity matching algorithms, bi-grams and tri-grams, are used …
Text stemmer is one of useful language preprocessing tools in the field of information retrieval, text mining and natural language processing. It is used to map morphological …
TMT Sembok - Proceeding of World Academy of Science, Engineering …, 2005 - Citeseer
Systems (IRS) is generally about understanding of information in the documents concern. The more the system able to understand the contents of documents the more effective will be …
In a way to make the result of Information Retrieval (IR) more accurate, a stemmer is needed to differentiate the words in searching useful information. This research aims to analyze both …
A web-based visualization system is developed to visualize the similarity between root words in Malay translated Quran Documents. The visualization of terms used is based on …