Bengali word embeddings and it's application in solving document classification problem

A Ahmad, MR Amin - 2016 19th international conference on …, 2016 - ieeexplore.ieee.org
… directly for news document classification; hence, we show that document classification can
be … Software framework for topic modelling with large corpora." In Proceedings of the LREC …

Multi-category news classification using Support Vector Machine based classifiers

P Saigal, V Khanna - SN Applied Sciences, 2020 - Springer
… text document classification. In this paper, we compare the results for LS-SVM, TWSVM and
LS-TWSVM for multi-category News classification … Unlike other corpora, 20 Newsgroups data …

Open-schema event profiling for massive news corpora

Q Yuan, X Ren, W He, C Zhang, X Geng… - Proceedings of the 27th …, 2018 - dl.acm.org
… profiles for news events from opendomain news corpora. A … Document-level ones treat
news articles independent of each other and overlook their relations. Worse still, document

Classifying Web corpora into domain and genre using automatic feature identification

S Sharoff - Proceedings of the 3rd Web as Corpus Workshop, 2007 - books.google.com
… clusters are more specific in terms of their domains, eg news from Iraq, contagious diseases
… small samples of 170 documents for English and for-Russian classified into five classes of …

Discovering event evolution graphs from news corpora

CC Yang, X Shi, CP Wei - IEEE Transactions on Systems, Man …, 2009 - ieeexplore.ieee.org
… detecting news topics and tracking new stories for a newsnews document as a query that
was made on the previous clustered documents to determine if the incoming news document

Knowledge-enhanced document embeddings for text classification

RA Sinoara, J Camacho-Collados, RG Rossi… - Knowledge-Based …, 2019 - Elsevier
… can mention e-mail classification and spam filtering, news and scientific articles organization,
… of huge corpora. In our approach, this knowledge is effortlessly transmitted to the document

A study on machine learning and deep learning methods using feature extraction for Bengali news document classification

N Humaira, H Afia, S Haque - 2021 Asian Conference on …, 2021 - ieeexplore.ieee.org
… , annotated corpora, name … to classify Bengali news articles where they showed that n-gram
length 2 or 3 are most useful. It was one of the earliest works in Bengali News Classification. …

A category classification algorithm for Indonesian and Malay news documents

J Jaafar, Z Indra, N Zamin - Jurnal Teknologi, 2016 - journals.utm.my
… the language and then classify the category for identified news documents. Furthermore, top…
online news corpora classification challenges: rapid data growth of online news documents, …

The influence of feature representation of text on the performance of document classification

S Martinčić-Ipšić, T Miličić, L Todorovski - Applied Sciences, 2019 - mdpi.com
… become an important tool for the relevant applications of news filtering, information retrieval,
… Table 1 provides an overview of the properties of the four document corpora used in the …

Chinese news text classification based on machine learning algorithm

F Miao, P Zhang, L Jin, H Wu - 2018 10th International …, 2018 - ieeexplore.ieee.org
document, and N is the total number of all documents in corpora, n is the number of documents
… in the whole corpora, and reflects the feature between the documents. This algorithm is …