Machine learning in automated text categorization

F Sebastiani - ACM computing surveys (CSUR), 2002 - dl.acm.org
The automated categorization (or classification) of texts into predefined categories has
witnessed a booming interest in the last 10 years, due to the increased availability of …

[PDF][PDF] Text mining methods and techniques

SV Gaikwad, A Chaugule, P Patil - International Journal of Computer …, 2014 - academia.edu
In recent years growth of digital data is increasing, knowledge discovery and data mining
have attracted great attention with coming up need for turning such data into useful …

[图书][B] The text mining handbook: advanced approaches in analyzing unstructured data

R Feldman, J Sanger - 2007 - books.google.com
Text mining is a new and exciting area of computer science research that tries to solve the
crisis of information overload by combining techniques from data mining, machine learning …

Toward optimal feature selection in naive Bayes for text categorization

B Tang, S Kay, H He - IEEE transactions on knowledge and …, 2016 - ieeexplore.ieee.org
Automated feature selection is important for text categorization to reduce feature size and to
speed up learning process of classifiers. In this paper, we present a novel and efficient …

Effective pattern discovery for text mining

N Zhong, Y Li, ST Wu - IEEE transactions on knowledge and …, 2010 - ieeexplore.ieee.org
Many data mining techniques have been proposed for mining useful patterns in text
documents. However, how to effectively use and update discovered patterns is still an open …

A machine learning approach to web page filtering using content and structure analysis

M Chau, H Chen - Decision Support Systems, 2008 - Elsevier
As the Web continues to grow, it has become increasingly difficult to search for relevant
information using traditional search engines. Topic-specific search engines provide an …

A Bayesian classification approach using class-specific features for text categorization

B Tang, H He, PM Baggenstoss… - IEEE Transactions on …, 2016 - ieeexplore.ieee.org
In this paper, we present a Bayesian classification approach for automatic text categorization
using class-specific features. Unlike conventional text categorization approaches, our …

Hierarchical text categorization using neural networks

ME Ruiz, P Srinivasan - Information retrieval, 2002 - Springer
This paper presents the design and evaluation of a text categorization method based on the
Hierarchical Mixture of Experts model. This model uses a divide and conquer principle to …

MeSH Up: effective MeSH text classification for improved document retrieval

D Trieschnigg, P Pezik, V Lee, F De Jong… - …, 2009 - academic.oup.com
Motivation: Controlled vocabularies such as the Medical Subject Headings (MeSH)
thesaurus and the Gene Ontology (GO) provide an efficient way of accessing and organizing …

Web page feature selection and classification using neural networks

A Selamat, S Omatu - Information sciences, 2004 - Elsevier
Automatic categorization is the only viable method to deal with the scaling problem of the
World Wide Web (WWW). In this paper, we propose a news web page classification method …