Using Wikipedia knowledge to improve text classification

P Wang, J Hu, HJ Zeng, Z Chen - Knowledge and Information Systems, 2009 - Springer
Text classification has been widely used to assist users with the discovery of useful
information from the Internet. However, traditional classification methods are based on the …

Building semantic kernels for text classification using wikipedia

P Wang, C Domeniconi - Proceedings of the 14th ACM SIGKDD …, 2008 - dl.acm.org
Document classification presents difficult challenges due to the sparsity and the high
dimensionality of text data, and to the complex semantics of the natural language. The …

[PDF][PDF] Overcoming the brittleness bottleneck using Wikipedia: Enhancing text categorization with encyclopedic knowledge

E Gabrilovich, S Markovitch - AAAI, 2006 - cdn.aaai.org
When humans approach the task of text categorization, they interpret the specific wording of
the document in the much larger context of their background knowledge and experience. On …

[PDF][PDF] Text Categorization with Knowledge Transfer from Heterogeneous Data Sources.

R Gupta, LA Ratinov - AAAI, 2008 - cdn.aaai.org
Multi-category classification of short dialogues is a common task performed by humans.
When assigning a question to an expert, a customer service operator tries to classify the …

[PDF][PDF] Harnessing the Expertise of 70,000 Human Editors: Knowledge-Based Feature Generation for Text Categorization.

E Gabrilovich, S Markovitch - Journal of Machine Learning Research, 2007 - jmlr.org
Most existing methods for text categorization employ induction algorithms that use the words
appearing in the training documents as features. While they perform well in many …

An efficient Wikipedia semantic matching approach to text document classification

Z Wu, H Zhu, G Li, Z Cui, H Huang, J Li, E Chen… - Information Sciences, 2017 - Elsevier
A traditional classification approach based on keyword matching represents each text
document as a set of keywords, without considering the semantic information, thereby …

Classification of short texts by deploying topical annotations

D Vitale, P Ferragina, U Scaiella - European Conference on Information …, 2012 - Springer
We propose a novel approach to the classification of short texts based on two factors: the
use of Wikipedia-based annotators that have been recently introduced to detect the main …

[PDF][PDF] Feature generation for text categorization using world knowledge

E Gabrilovich, S Markovitch - IJCAI, 2005 - researchgate.net
We enhance machine learning algorithms for text categorization with generated features
based on domain-specific and common-sense knowledge. This knowledge is represented …

Boosting for text classification with semantic features

S Bloehdorn, A Hotho - … workshop on knowledge discovery on the web, 2004 - Springer
Current text classification systems typically use term stems for representing document
content. Semantic Web technologies allow the usage of features on a higher semantic level …

[PDF][PDF] A semantic kernel to classify texts with very few training examples

R Basili, M Cammisa, A Moschitti - Informatica, 2006 - informatica.si
The access to Web distributed information often requires the detection and filtering of the
target objects (eg texts) before specialized automatic retrieval processes can be applied. In …