Exploiting Turkish Wikipedia as a semantic resource for text classification

M Poyraz, MC Ganiz, S Akyokuş… - … on Innovations in …, 2012 - ieeexplore.ieee.org
Majority of the existing text classification algorithms are based on the “bag of words”(BOW)
approach, in which the documents are represented as weighted occurrence frequencies of …

Semantic enrichment of text representation with wikipedia for text classification

H Yamakawa, A Feldman - 2010 IEEE International …, 2010 - ieeexplore.ieee.org
Text classification is a widely studied topic in the area of machine learning. A number of
techniques have been developed to represent and classify text documents. Most of the …

Applying RDF ontologies to improve text classification

W Xiaoyue, B Rujiang - 2009 International Conference on …, 2009 - ieeexplore.ieee.org
Current classification methods are based on the ldquobag of wordsrdquo (BOW)
representation, which only accounts for term frequency in the documents, and ignores …

Text2arff: Automatic feature extraction software for Turkish texts

MF Amasyali, F Davletov, AI Torayew… - 2010 IEEE 18th Signal …, 2010 - ieeexplore.ieee.org
Which features are the most important for the text classification tasks? In the automatic text
categorization area, several studies seek answers to this question. In this paper, a feature …

Using Wikipedia knowledge to improve text classification

P Wang, J Hu, HJ Zeng, Z Chen - Knowledge and Information Systems, 2009 - Springer
Text classification has been widely used to assist users with the discovery of useful
information from the Internet. However, traditional classification methods are based on the …

Improving documents classification with semantic features

B Rujiang, L Junhua - 2009 Second International Symposium …, 2009 - ieeexplore.ieee.org
Successful text classification is highly dependent on the representations used. Currently,
most approaches to text classification adopt thebag-of-words' document representation …

Improving text categorization with semantic knowledge in Wikipedia

X Wang, Y Jia, R Chen, H Fan… - IEICE TRANSACTIONS on …, 2013 - search.ieice.org
Text categorization, especially short text categorization, is a difficult and challenging task
since the text data is sparse and multidimensional. In traditional text classification methods …

Developing a text categorization template for Turkish news portals

C Toraman, F Can, S Koçberber - … International Symposium on …, 2011 - ieeexplore.ieee.org
In news portals, text category information is needed for news presentation. However, for
many news stories the category information is unavailable, incorrectly assigned or too …

A novel approach to document classification using wordnet

K Sarkar, R Law - arXiv preprint arXiv:1510.02755, 2015 - arxiv.org
Content based Document Classification is one of the biggest challenges in the context of
free text mining. Current algorithms on document classifications mostly rely on cluster …

Improving text classification by using encyclopedia knowledge

P Wang, J Hu, HJ Zeng, L Chen… - … conference on data …, 2007 - ieeexplore.ieee.org
The exponential growth of text documents available on the Internet has created an urgent
need for accurate, fast, and general purpose text classification algorithms. However, the" …