[HTML][HTML] A recent overview of the state-of-the-art elements of text classification

MM Mirończuk, J Protasiewicz - Expert Systems with Applications, 2018 - Elsevier
The aim of this study is to provide an overview the state-of-the-art elements of text
classification. For this purpose, we first select and investigate the primary and recent studies …

Web text categorization based on statistical merging algorithm in big data environment

R Wang, G Wang - International Journal of Ambient Computing and …, 2019 - igi-global.com
In the field of modern information technology, how to find information quickly, accurately and
comprehensively that users really needed has become the focus of research in this field. In …

A variational bayes model for count data learning and classification

AS Bakhtiari, N Bouguila - Engineering Applications of Artificial Intelligence, 2014 - Elsevier
Several machine learning and knowledge discovery approaches have been proposed for
count data modeling and classification. In particular, latent Dirichlet allocation (LDA)(Blei et …

Chi-square statistics feature selection based on term frequency and distribution for text categorization

C Jin, T Ma, R Hou, M Tang, Y Tian… - IETE journal of …, 2015 - Taylor & Francis
Text categorization (TC) becomes the key technology to find relevant and timely information
from a volume of digital documents, and feature selection techniques are proposed to …

Multiple Bayesian discriminant functions for high-dimensional massive data classification

J Zhang, S Wang, L Chen, P Gallinari - Data mining and knowledge …, 2017 - Springer
The presence of complex distributions of samples concealed in high-dimensional, massive
sample-size data challenges all of the current classification methods for data mining …

Kernel-based linear classification on categorical data

L Chen, Y Ye, G Guo, J Zhu - Soft Computing, 2016 - Springer
Kernel-based methods have been widely investigated in the soft-computing community.
However, they focus mainly on numeric data. In this paper, we propose a novel method for …

Kernel-based data transformation model for nonlinear classification of symbolic data

X Yan, L Chen, G Guo - Soft Computing, 2022 - Springer
Symbolic data are usually composed of some categorical variables used to represent
discrete entities in many real-world applications. Mining of symbolic data is more difficult …

[PDF][PDF] A new document representation based on global policy for supervised term weighting schemes in text categorization

L Jia, B Zhang - Mathematical Biosciences and Engineering, 2022 - aimspress.com
There are two main factors involved in documents classification, document representation
method and classification algorithm. In this study, we focus on document representation …

Adaptive system for handling variety in big text

S Pathak, D Rajeshwar Rao - … : Proceedings of Internet of Things for …, 2018 - Springer
Today in every corporate, banking, judicial, or medical ecosystem varieties of text are
generated like customer reviews, product manuals, white papers, system logs, and usage …

A similarity based supervised decision rule for qualitative improvement of text categorization

T Basu, CA Murthy - Fundamenta Informaticae, 2015 - content.iospress.com
The similarity based decision rule computes the similarity between a new test document and
the existing documents of the training set that belong to various categories. The new …