Sentence-level novelty detection in English and Malay

AT Kwee, FS Tsai, W Tang - Advances in Knowledge Discovery and Data …, 2009 - Springer
Novelty detection (ND) is a process for identifying information from an incoming stream of
documents. Although there are many studies of ND on English language documents …

Stemming Malay text and its application in automatic text categorization

M Yasukawa, HT Lim, H Yokoo - IEICE transactions on information …, 2009 - search.ieice.org
In Malay language, there are no conjugations and declensions and affixes have important
grammatical functions. In Malay, the same word may function as a noun, an adjective, an …

Behavior-driven multilingual stemming

G Fliedner, S Sundaramurthy, A Subramanian… - US Patent …, 2014 - Google Patents
BACKGROUND As the amount of information available electronically increases, there is a
corresponding need to improve the way in which users are able to locate information of …

[PDF][PDF] Malay documents clustering algorithm based on singular value decomposition

N Ab Samat, MAA Murad, MT Abdullah… - Journal of Theoretical …, 2005 - academia.edu
Document categorization is a widely researched area of information retrieval. A research on
Malay natural language processing has been done up to the level of retrieving documents …

Query translation architecture for Malay-English cross-language information retrieval system

NH Rais, MT Abdullah, RA Kadir - … International Symposium on …, 2010 - ieeexplore.ieee.org
This paper discusses research on query translation events in Malay-English Cross-
Language Information Retrieval (CLIR) system. We assume that by improving query …

Text simplification for Malay corpus: A Review

S Omar, JA Bakar, MM Nadzir… - … on Computer & …, 2021 - ieeexplore.ieee.org
Text Simplification (TS) is one of the directions for recent studies in NLP. The TS aims to
rewrite the complicated text into a simpler sentence, which is easier to understand by human …

[PDF][PDF] Categorization of Malay documents using latent semantic indexing

N Ab Samat, MAA Murad, R Atan… - Proceedings of …, 2008 - soc.uum.edu.my
Document categorization is a widely researched area of information retrieval. A popular
approach to categorize documents is the Vector Space Model (VSM), which represents texts …

[PDF][PDF] Construction Of Computational Malay Lexicon Using Affixed Words Analyser

H Hasmy, ZA Bakar, F Ahmad… - JIRKM| Journal of …, 2012 - jirkm.pecamp.org
This paper concerns an experiment on constructing computational Malay lexicon from Malay
root word. A lexicon is a repository of words and that is also known as the backbone of any …

Bayesian and Fuzzy Logic Implementation for SPAM/UCE Inline Filter

EM Tamil, WNAW Samsudin, MYI Idris… - International …, 2008 - search.proquest.com
Current growth in the use of email for communication and the corresponding rising problem
of unsolicited email, also known as 'spam', has generated a need of automatic processing of …