Text categorization by boosting automatically extracted concepts

L Cai, T Hofmann - Proceedings of the 26th annual international ACM …, 2003 - dl.acm.org
Term-based representations of documents have found wide-spread use in information
retrieval. However, one of the main shortcomings of such methods is that they largely …

Towards unsupervised text classification leveraging experts and word embeddings

Z Haj-Yahia, A Sieg, LA Deleris - … of the 57th annual meeting of …, 2019 - aclanthology.org
Text classification aims at mapping documents into a set of predefined categories.
Supervised machine learning models have shown great success in this area but they …

Boosting for text classification with semantic features

S Bloehdorn, A Hotho - … workshop on knowledge discovery on the web, 2004 - Springer
Current text classification systems typically use term stems for representing document
content. Semantic Web technologies allow the usage of features on a higher semantic level …

Complex linguistic features for text classification: A comprehensive study

A Moschitti, R Basili - European conference on information retrieval, 2004 - Springer
Previous researches on advanced representations for document retrieval have shown that
statistical state-of-the-art models are not improved by a variety of different linguistic …

Phrase-based document categorization revisited

CHA Koster, JG Beney - Proceedings of the 2nd international workshop …, 2009 - dl.acm.org
This paper takes a fresh look at an old idea in Information Retrieval: the use of linguistically
extracted phrases as terms in the automatic categorization (aka classification) of documents …

Towards language independent automated learning of text categorization models

C Apte, F Damerau, SM Weiss - SIGIR'94: Proceedings of the Seventeenth …, 1994 - Springer
We describe the results of extensive machine learning experiments on large collections of
Reuters' English and German newswires. The goal of these experiments was to …

Knowledge-enhanced document embeddings for text classification

RA Sinoara, J Camacho-Collados, RG Rossi… - Knowledge-Based …, 2019 - Elsevier
Accurate semantic representation models are essential in text mining applications. For a
successful application of the text mining process, the text representation adopted must keep …

A meta-learning approach for text categorization

W Lam, KY Lai - Proceedings of the 24th annual international ACM …, 2001 - dl.acm.org
We investigate a meta-model approach, called Meta-learning Using Document Feature
characteristics (MUDOF), for the task of automatic textual document categorization. It …

Using WordNet to complement training information in text categorization

M Rodriguez, J Hidalgo, B Agudo - Proceedings of 2nd International …, 2000 - torrossa.com
Abstract Automatic Text Categorisation (TC) is a complex and useful task for many natural
language applications, and is usually performed through the use of a set of manually …

[PDF][PDF] Scalable term selection for text categorization

J Li, M Sun - Proceedings of the 2007 Joint Conference on …, 2007 - aclanthology.org
In text categorization, term selection is an important step for the sake of both categorization
accuracy and computational efficiency. Different dimensionalities are expected under …