Multi-co-training for document classification using various document representations: TF–IDF, LDA, and Doc2Vec

D Kim, D Seo, S Cho, P Kang - Information sciences, 2019 - Elsevier
The purpose of document classification is to assign the most appropriate label to a specified
document. The main challenges in document classification are insufficient label information …

Text classification from unlabeled documents with bootstrapping and feature projection techniques

Y Ko, J Seo - Information Processing & Management, 2009 - Elsevier
Many machine learning algorithms have been applied to text classification tasks. In the
machine learning paradigm, a general inductive process automatically builds a text classifier …

Text classification method based on self-training and LDA topic models

M Pavlinek, V Podgorelec - Expert Systems with Applications, 2017 - Elsevier
Supervised text classification methods are efficient when they can learn with reasonably
sized labeled sets. On the other hand, when only a small set of labeled documents is …

[PDF][PDF] Text classification by labeling words

B Liu, X Li, WS Lee, PS Yu - Aaai, 2004 - cdn.aaai.org
Traditionally, text classifiers are built from labeled training examples. Labeling is usually
done manually by human experts (or the users), which is a labor intensive and time …

Effective document labeling with very few seed words: A topic model approach

C Li, J Xing, A Sun, Z Ma - Proceedings of the 25th ACM international on …, 2016 - dl.acm.org
Developing text classifiers often requires a large number of labeled documents as training
examples. However, manually labeling documents is costly and time-consuming. Recently …

Visual and textual deep feature fusion for document image classification

S Bakkali, Z Ming, M Coustaty… - Proceedings of the …, 2020 - openaccess.thecvf.com
The topic of text document image classification has been explored extensively over the past
few years. Most recent approaches handled this task by jointly learning the visual features of …

SCDV: Sparse Composite Document Vectors using soft clustering over distributional representations

D Mekala, V Gupta, B Paranjape, H Karnick - arXiv preprint arXiv …, 2016 - arxiv.org
We present a feature vector formation technique for documents-Sparse Composite
Document Vector (SCDV)-which overcomes several shortcomings of the current …

A fusion model-based label embedding and self-interaction attention for text classification

Y Dong, P Liu, Z Zhu, Q Wang, Q Zhang - IEEE Access, 2019 - ieeexplore.ieee.org
Text classification is a pivotal task in NLP (Natural Language Processing), which has
received widespread attention recently. Most of the existing methods leverage the power of …

Towards unsupervised text classification leveraging experts and word embeddings

Z Haj-Yahia, A Sieg, LA Deleris - … of the 57th annual meeting of …, 2019 - aclanthology.org
Text classification aims at mapping documents into a set of predefined categories.
Supervised machine learning models have shown great success in this area but they …

Document-level text classification using single-layer multisize filters convolutional neural network

MP Akhter, Z Jiangbin, IR Naqvi, M Abdelmajeed… - IEEE …, 2020 - ieeexplore.ieee.org
The rapid growth of electronic documents are causing problems like unstructured data that
need more time and effort to search a relevant document. Text Document Classification …