Knowledge-enhanced document embeddings for text classification

RA Sinoara, J Camacho-Collados, RG Rossi… - Knowledge-Based …, 2019 - Elsevier
Accurate semantic representation models are essential in text mining applications. For a
successful application of the text mining process, the text representation adopted must keep …

Building semantic kernels for text classification using wikipedia

P Wang, C Domeniconi - Proceedings of the 14th ACM SIGKDD …, 2008 - dl.acm.org
Document classification presents difficult challenges due to the sparsity and the high
dimensionality of text data, and to the complex semantics of the natural language. The …

Text categorization by boosting automatically extracted concepts

L Cai, T Hofmann - Proceedings of the 26th annual international ACM …, 2003 - dl.acm.org
Term-based representations of documents have found wide-spread use in information
retrieval. However, one of the main shortcomings of such methods is that they largely …

Document classification with distributions of word vectors

C Xing, D Wang, X Zhang, C Liu - Signal and information …, 2014 - ieeexplore.ieee.org
The word-to-vector (W2V) technique represents words as low-dimensional continuous
vectors in such a way that semantic related words are close to each other. This produces a …

An approach to the use of word embeddings in an opinion classification task

F Enríquez, JA Troyano, T López-Solaz - Expert Systems with Applications, 2016 - Elsevier
In this paper we show how a vector-based word representation obtained via word2vec can
help to improve the results of a document classifier based on bags of words. Both models …

Bag-of-concepts: Comprehending document representation through clustering words in distributed representation

HK Kim, H Kim, S Cho - Neurocomputing, 2017 - Elsevier
Two document representation methods are mainly used in solving text mining problems.
Known for its intuitive and simple interpretability, the bag-of-words method represents a …

[PDF][PDF] Linear discriminant analysis in document classification

K Torkkola - IEEE ICDM workshop on text mining, 2001 - cse.unr.edu
Document representation using the bag-of-words approach may require bringing the
dimensionality of the representation down in order to be able to make effective use of …

[PDF][PDF] A New Method of Region Embedding for Text Classification.

C Qiao, B Huang, G Niu, D Li, D Dong, W He… - ICLR (Poster …, 2018 - research.baidu.com
To represent a text as a bag of properly identified “phrases” and use the representation for
processing the text is proved to be useful. The key question here is how to identify the …

Text classification based on multi-word with support vector machine

W Zhang, T Yoshida, X Tang - Knowledge-Based Systems, 2008 - Elsevier
One of the main themes supporting text mining is text representation, ie, looking for the
appropriate terms to transfer the documents into numerical vectors. Recently, many efforts …

Task-oriented word embedding for text classification

Q Liu, HY Huang, Y Gao, X Wei, Y Tian… - Proceedings of the 27th …, 2018 - aclanthology.org
Distributed word representation plays a pivotal role in various natural language processing
tasks. In spite of its success, most existing methods only consider contextual information …