[PDF][PDF] Preprocessing techniques for text mining-an overview

S Vijayarani, MJ Ilamathi, M Nithya - International Journal of …, 2015 - researchgate.net
Data mining is used for finding the useful information from the large amount of data. Data
mining techniques are used to implement and solve different types of research problems …

ClubCF: A clustering-based collaborative filtering approach for big data application

R Hu, W Dou, J Liu - IEEE transactions on emerging topics in …, 2014 - ieeexplore.ieee.org
Spurred by service computing and cloud computing, an increasing number of services are
emerging on the Internet. As a result, service-relevant data become too big to be effectively …

An Analytical Analysis of Text Stemming Methodologies in Information Retrieval and Natural Language Processing Systems

A Jabbar, S Iqbal, MI Tamimy, A Rehman… - IEEE …, 2023 - ieeexplore.ieee.org
The exponential increase in textual unstructured digital data creates significant demand for
advanced and smart stemming systems. As a preprocessing stage, stemming is applied in …

Application of classification models on maintenance records through text mining approach in industrial environment

U Rahman, MU Mahbub - Journal of Quality in Maintenance …, 2023 - emerald.com
Purpose The data created from regular maintenance activities of equipment are stored as
text in industrial plants. The size of these data is increasing rapidly nowadays. Text mining …

[PDF][PDF] Big data analytics using Hadoop

B Dhyani, A Barthwal - International Journal of Computer …, 2014 - academia.edu
This paper is an effort to present the basic understanding of BIG DATA is and it's usefulness
to an organization from the performance perspective. Along-with the introduction of BIG …

[PDF][PDF] APPLICATION OF RANKING BASED ATTRIBUTE SELECTION FILTERS TO PERFORM AUTOMATED EVALUATION OF DESCRIPTIVE ANSWERS THROUGH …

CS Kumar, RJ Sree - ICTACT Journal on Soft Computing, 2014 - pdfs.semanticscholar.org
In this paper, we study the performance of various models for automated evaluation of
descriptive answers by using rank based feature selection filters for dimensionality …

[PDF][PDF] Influence of Gujarati STEmmeR in supervised learning of web page categorization

CD Patel, JM Patel - International Journal of Intelligent Systems …, 2021 - researchgate.net
With the large quantity of information offered on-line, it's equally essential to retrieve correct
information for a user query. A large amount of data is available in digital form in multiple …

LALITHA: A light weight Malayalam stemmer using suffix stripping method

U Prajitha, C Sreejith, PCR Raj - … International Conference on …, 2013 - ieeexplore.ieee.org
Stemming is the process of removing the affixes from inflections and to return the root form.
Malayalam is highly agglutinative in nature and hundreds of inflections are possible for each …

De-redundancy relative discrimination criterion-based feature selection for text data

L Jin, L Zhang - 2022 International Joint Conference on Neural …, 2022 - ieeexplore.ieee.org
High dimensionality of text data would degrade the performance of text classification for the
existence of irrelevant terms. Thus, it is necessary to perform feature selection to remove …

[PDF][PDF] A rule-based stemmer for Punjabi adjectives

H Kaur, PK Buttar - International Journal of Advanced Research in …, 2020 - academia.edu
This research work is concerned with the development of a rule-based stemmer for
stemming of adjectives in the Punjabi language. Stemming is a method of deriving the root …