Comparative Multinomial Text Classification Analysis of Naïve Bayes and XGBoost with SMOTE on Imbalanced Dataset

A Chaturvedi, S Yadav, MAMH Ansari… - … Intelligence in Pattern …, 2022 - Springer
In supervised machine learning, with an imbalanced dataset, achieving better classification
in minority classes is a major challenge. In such situation, machine learning model shows …

A combination of resampling method and machine learning for text classification on imbalanced data

H Feng, T Dan, W Wang, R Gui, J Liu, Y Li - International Conference on …, 2021 - Springer
Imbalanced data will affect the accuracy of text classification, in order to solve this issue, 11
different algorithms are used to resampling the dataset. Results show that, 5 different …

Reducing the effect of imbalance in text classification using SVD and GloVe with ensemble and deep learning

T Hossain, HZ Mauni, R Rab - Computing and Informatics, 2022 - cai.sk
Due to the recent escalation in the amount of text data available and used online, text
classification has become a staple for data analysts when extracting relevant information …

A combination of resampling and ensemble method for text classification on imbalanced data

H Feng, W Qin, H Wang, Y Li, G Hu - International Conference on Big Data, 2021 - Springer
One of the major factor which can affect the accuracy of text classification is the imbalanced
dataset. In order to find the suitable method to handle this issue, six different ensemble …

[PDF][PDF] Comparative Analysis using Various Performance Metrics in Imbalanced Data for Multi-class Text Classification

S Riyanto, SS Imas, T Djatna… - International Journal of …, 2023 - pdfs.semanticscholar.org
Precision, Recall, and F1-score are metrics that are often used to evaluate model
performance. Precision and Recall are very important to consider when the data is …

Binary text classification using genetic programming with crossover-based oversampling for imbalanced datasets

M Aljero, N Dimililer - Turkish Journal of Electrical …, 2023 - journals.tubitak.gov.tr
It is well known that classifiers trained using imbalanced datasets usually have a bias toward
the majority class. In this context, classification models can present a high classification …

Synthetic-MixUp: A Simple Framework for Imbalanced Text classification

R Asyrofi, R Fauzan - 2023 IEEE 12th Global Conference on …, 2023 - ieeexplore.ieee.org
This study focuses on analyzing effective strategies to handle imbalanced data in text
classification. The study employs probabilistic Naive Bayes as a common machine learning …

An improved native bayes classifier for imbalanced text categorization based on k-means and chi-square feature selection

F Meng, L Xu - 2018 Eighth International Conference on …, 2018 - ieeexplore.ieee.org
In multiclass text classification, the performance of classifier is usually very low while the
data distribution is very uneven. The reason may be that the majority class has a great …

Fine-Tuning of a BERT-Based Uncased Model for Unbalanced Text Classification

SK Behera, R Dash - … and Communication: Proceedings of ICAC 2021, 2022 - Springer
Unbalanced datasets make it hard for text classifiers to learn well. Having limited information
in minority classes makes it difficult to classify the unbalanced texts. In this study, a BERT …

Research on the Improvement of K-Nearest Neighbor Classifier for Imbalanced Text Categorization

Y Yang, L Xu - 2018 Eighth International Conference on …, 2018 - ieeexplore.ieee.org
Some of the most widely used text classification methods, such as the K-Nearest Neighbor
(KNN) algorithm, the Native Bayes (NB) algorithm and the Support Vector Machine (SVM) …