A combination of resampling and ensemble method for text classification on imbalanced data

H Feng, W Qin, H Wang, Y Li, G Hu - International Conference on Big Data, 2021 - Springer
One of the major factor which can affect the accuracy of text classification is the imbalanced
dataset. In order to find the suitable method to handle this issue, six different ensemble …

Text generation for imbalanced text classification

S Akkaradamrongrat, P Kachamas… - … Joint Conference on …, 2019 - ieeexplore.ieee.org
The problem of imbalanced data can be frequently found in the real-world data. It leads to
the bias of classification models, that is, the models predict most samples as major classes …

Fine-Tuning of a BERT-Based Uncased Model for Unbalanced Text Classification

SK Behera, R Dash - … and Communication: Proceedings of ICAC 2021, 2022 - Springer
Unbalanced datasets make it hard for text classifiers to learn well. Having limited information
in minority classes makes it difficult to classify the unbalanced texts. In this study, a BERT …

ForesTexter: An efficient random forest algorithm for imbalanced text categorization

Q Wu, Y Ye, H Zhang, MK Ng, SS Ho - Knowledge-Based Systems, 2014 - Elsevier
In this paper, we propose a new random forest (RF) based ensemble method, F ores T exter,
to solve the imbalanced text categorization problems. RF has shown great success in many …

Optimal feature selection for imbalanced text classification

A Khurana, OP Verma - IEEE Transactions on Artificial …, 2022 - ieeexplore.ieee.org
Textual data suffers from two main problems, large number of features and class imbalance.
Many conventional approaches and their variants exist in literature to solve both these …

Reducing the effect of imbalance in text classification using SVD and GloVe with ensemble and deep learning

T Hossain, HZ Mauni, R Rab - Computing and Informatics, 2022 - cai.sk
Due to the recent escalation in the amount of text data available and used online, text
classification has become a staple for data analysts when extracting relevant information …

A Novel Classification Method Based on a Two-Phase Technique for Learning Imbalanced Text Data

DC Li, SC Chen, YS Lin, WY Hsu - Symmetry, 2022 - mdpi.com
The problem of imbalanced data has a heavy impact on the performance of learning models.
In the case of an imbalanced text dataset, minority class data are often classified to the …

Comparison of feature selection for imbalance text datasets

A Chandra - 2019 International Conference on Information …, 2019 - ieeexplore.ieee.org
The numbers of documents are increasing rapidly in a web format. Therefore, automatic
document classification is needed to help human to classify the documents. Text …

IDA: An Imbalanced Data Augmentation for Text Classification

A Siagh, FZ Laallam, O Kazar, H Salem… - … Conference on Intelligent …, 2023 - Springer
With the increasing amount of textual data generated online, an automatic system for text
classification is imperative. However, classification models face the challenge of limited and …

T5W: A Paraphrasing Approach to Oversampling for Imbalanced Text Classification

AP Patil, S Jere, R Ram… - 2022 IEEE International …, 2022 - ieeexplore.ieee.org
Imbalanced datasets are datasets with one or more underrepresented classes when
compared to other classes. Such datasets pose problems during classification due to the …