Arabic text classification based on word and document embeddings

A El Mahdaouy, E Gaussier, SO El Alaoui - Proceedings of the …, 2017 - Springer
Abstract Recently, Word Embeddings have been introduced as a major breakthrough in
Natural Language Processing (NLP) to learn viable representation of linguistic items based …

Multi-class document classification using improved word embeddings

BA Rabut, AC Fajardo, RP Medina - Proceedings of the 2nd …, 2019 - dl.acm.org
In this paper, we conducted an experiment to build a classification model that combines
different techniques in most of the Natural Language Processing Tasks. We used the word …

A comparative study on word embeddings in deep learning for text classification

C Wang, P Nulty, D Lillis - … of the 4th international conference on natural …, 2020 - dl.acm.org
Word embeddings act as an important component of deep models for providing input
features in downstream language tasks, such as sequence labelling and text classification …

Vector of locally-aggregated word embeddings (VLAWE): A novel document-level representation

RT Ionescu, AM Butnaru - arXiv preprint arXiv:1902.08850, 2019 - arxiv.org
In this paper, we propose a novel representation for text documents based on aggregating
word embedding vectors into document embeddings. Our approach is inspired by the Vector …

[HTML][HTML] Contextual semantic embeddings based on fine-tuned AraBERT model for Arabic text multi-class categorization

F El-Alami, SO El Alaoui, NE Nahnahi - Journal of King Saud University …, 2022 - Elsevier
Despite that pre-trained word embedding models have advanced a wide range of natural
language processing applications, they ignore the contextual information and meaning …

[HTML][HTML] Vector representation based on a supervised codebook for Nepali documents classification

C Sitaula, A Basnet, S Aryal - PeerJ Computer Science, 2021 - peerj.com
Document representation with outlier tokens exacerbates the classification performance due
to the uncertain orientation of such tokens. Most existing document representation methods …

Word sense representation based-method for Arabic text categorization

FZ El-Alami, SO El Alaoui - 2018 9th international symposium …, 2018 - ieeexplore.ieee.org
Word embedding and document embedding representations have proved good
performance in several natural language tasks such as information retrieval, text …

Bengali word embeddings and it's application in solving document classification problem

A Ahmad, MR Amin - 2016 19th international conference on …, 2016 - ieeexplore.ieee.org
In this paper, we present Bengali word embeddings and it's application in the classification
of news documents. Word embeddings are multi-dimensional vectors that can be created by …

Enhanced word embeddings using multi-semantic representation through lexical chains

T Ruas, CHP Ferreira, W Grosky, FO de França… - Information …, 2020 - Elsevier
The relationship between words in a sentence often tells us more about the underlying
semantic content of a document than its actual words, individually. In this work, we propose …

Document embedding based supervised methods for Turkish text classification

Hİ Çelenli, ST Öztürk, G Şahin, A Gerek… - 2018 3rd International …, 2018 - ieeexplore.ieee.org
Following the recent increase in the amount of available data, Deep Learning has become
the most popular branch of Machine Learning. This trend can also be seen in Natural …