[PDF][PDF] Feature selection, perceptron learning, and a usability case study for text categorization

HT Ng, WB Goh, KL Low - Proceedings of the 20th annual international …, 1997 - dl.acm.org
In this paper, we describe an automated learning approach to text categorization based on
perception learning and a new feature selection metric, called correlation coefficient. Our …

[PDF][PDF] A comparison of two learning algorithms for text categorization

DD Lewis, M Ringuette - Third annual symposium on document …, 1994 - researchgate.net
This paper examines the use of inductive learning to categorize natural language
documents into prede ned content categories. Categorization of text is of increasing …

A re-examination of text categorization methods

Y Yang, X Liu - Proceedings of the 22nd annual international ACM …, 1999 - dl.acm.org
This paper reports a controlled study with statistical significance tests on five text
categorization methods: the Support Vector Machines (SVM), a k-Nearest Neighbor (kNN) …

An evaluation of statistical approaches to text categorization

Y Yang - Information retrieval, 1999 - Springer
This paper focuses on a comparative evaluation of a wide-range of text categorization
methods, including previously published results on the Reuters corpus and new results of …

[PDF][PDF] Cluster-based text categorization: a comparison of category search strategies

M Iwayama, T Tokunaga - Proceedings of the 18th annual international …, 1995 - dl.acm.org
Text categorization canbeviewed asaprocessof catego~ search, in which one or more
categories for a testdocument are searchedfor by using given training documents with …

Inductive learning algorithms and representations for text categorization

S Dumais, J Platt, D Heckerman… - Proceedings of the seventh …, 1998 - dl.acm.org
Text categorization–the assignment of natural language texts to one or more predefined
categories based on their content–is an important component in many information …

[PDF][PDF] Hierarchical neural networks for text categorization

ME Ruiz, P Srinivasan - Proceedings of the 22nd annual international …, 1999 - dl.acm.org
This paper presents the design and evaluation of a text categorization method based on the
Hierarchical Mixture of Experts model. This model uses a divide and conquer principle to …

Information gain and divergence-based feature selection for machine learning-based text categorization

C Lee, GG Lee - Information processing & management, 2006 - Elsevier
Most previous works of feature selection emphasized only the reduction of high
dimensionality of the feature space. But in cases where many features are highly redundant …

A comparative study on text representation schemes in text categorization

F Song, S Liu, J Yang - Pattern analysis and applications, 2005 - Springer
It is well known that the classification effectiveness of the text categorization system is not
simply a matter of learning algorithms. Text representation factors are also at work. This …

Experiments on the use of feature selection and negative evidence in automated text categorization

L Galavotti, F Sebastiani, M Simi - … , September 18–20, 2000 Proceedings 4, 2000 - Springer
We tackle two different problems of text categorization (TC), namely feature selection and
classifier induction. Feature selection (FS) refers to the activity of selecting, from the set of r …