Text categorization: past and present

A Dhar, H Mukherjee, NS Dash, K Roy - Artificial Intelligence Review, 2021 - Springer
Automatic text categorization is the operation of sorting out the text documents into pre-
defined text categories using some machine learning algorithms. Normally, it defines the …

Experimental evaluation of deep learning models for marathi text classification

A Kulkarni, M Mandhane, M Likhitkar… - Proceedings of the 2nd …, 2022 - Springer
The Marathi language is one of the prominent languages used in India. It is predominantly
spoken by the people of Maharashtra. Over the past decade, the usage of language on …

Text mining for Indonesian translation of the Quran: A systematic review

SJ Putra, T Mantoro… - … Conference on Computing …, 2017 - ieeexplore.ieee.org
Nowadays there is an increasing trend in computers used for learning Islamic knowledge
from Indonesian Translation of AL-Quran (ITQ). As a result, substantial knowledge is stored …

Application of tf-idf feature for categorizing documents of online bangla web text corpus

A Dhar, NS Dash, K Roy - … Engineering Informatics: Proceedings of the 6th …, 2018 - Springer
This paper explores the use of standard features as well as machine learning approaches
for categorizing Bangla text documents of online Web corpus. The TF-IDF feature with …

Classification of Bangla text documents based on inverse class frequency

A Dhar, NS Dash, K Roy - 2018 3rd International Conference …, 2018 - ieeexplore.ieee.org
With the increasing availability of the textual content on the internet, automatic text
classification or text categorization analogously becomes a prime key in solving the problem …

Categorization of Bangla web text documents based on TF-IDF-ICF text analysis scheme

A Dhar, NS Dash, K Roy - … –Digital Way: 52nd Annual Convention of the …, 2018 - Springer
With the rapid growth and huge availability of digital text data, automatic text categorization
or classification is a comparatively more effective solution in organizing and managing …

Author identification using sequential minimal optimization with rule-based decision tree on Indian literature in Marathi

KS Digamberrao, RS Prasad - Procedia computer science, 2018 - Elsevier
Authorship Identification is the task of identifying who wrote a given piece of text from a given
set of candidate authors (suspects). The increasingly large volumes of texts on the Internet …

[PDF][PDF] Classification of Gujarati documents using Naïve Bayes classifier

RM Rakholia, JR Saini - Indian Journal of Science and …, 2017 - researchgate.net
Objectives: Information overload on the web is a major problem faced by institutions and
businesses today. Sorting out some useful documents from the web which is written in …

Survey of progressive era of text summarization for indian and foreign languages using natural language processing

AD Dhawale, SB Kulkarni, V Kumbhakarna - … and Application: ICIDCA …, 2020 - Springer
The last few years of Data Science definitely show the upward trend in growth of popularity,
different industries which are effectively relating with data science & the transformation of …

Performance of classifiers in bangla text categorization

A Dhar, H Mukherjee, NS Dash… - … on Innovations in …, 2018 - ieeexplore.ieee.org
Automated text categorization or text classification has become an important text mining task
especially with the speedy development and increase of the number of on-line documents …