Machine learning in automated text categorization

F Sebastiani - ACM computing surveys (CSUR), 2002 - dl.acm.org
The automated categorization (or classification) of texts into predefined categories has
witnessed a booming interest in the last 10 years, due to the increased availability of …

Web page classification: Features and algorithms

X Qi, BD Davison - ACM computing surveys (CSUR), 2009 - dl.acm.org
Classification of Web page content is essential to many tasks in Web information retrieval
such as maintaining Web directories and focused crawling. The uncontrolled nature of Web …

[PDF][PDF] 基于机器学习的文本分类技术研究进展

苏金树, 张博锋, 徐昕[1 - 软件学报, 2006 - Citeseer
文本自动分类是信息检索与数据挖掘领域的研究热点与核心技术, 近年来得到了广泛的关注和
快速的发展. 提出了基于机器学习的文本分类技术所面临的互联网内容信息处理等复杂应用的 …

A survey of text classification algorithms

CC Aggarwal, CX Zhai - Mining text data, 2012 - Springer
The problem of classification has been widely studied in the data mining, machine learning,
database, and information retrieval communities with applications in a number of diverse …

[图书][B] Data mining: concepts and techniques

J Han, J Pei, H Tong - 2022 - books.google.com
Data Mining: Concepts and Techniques, Fourth Edition introduces concepts, principles, and
methods for mining patterns, knowledge, and models from various kinds of data for diverse …

[图书][B] The text mining handbook: advanced approaches in analyzing unstructured data

R Feldman, J Sanger - 2007 - books.google.com
Text mining is a new and exciting area of computer science research that tries to solve the
crisis of information overload by combining techniques from data mining, machine learning …

Link-based classification

S Bandyopadhyay, U Maulik, LB Holder… - Advanced methods for …, 2005 - Springer
A key challenge for machine learning is the problem of mining richly structured data sets,
where the objects are linked in some way due to either an explicit or implicit relationship that …

Link mining: a survey

L Getoor, CP Diehl - Acm Sigkdd Explorations Newsletter, 2005 - dl.acm.org
Many datasets of interest today are best described as a linked collection of interrelated
objects. These may represent homogeneous networks, in which there is a single-object type …

Text categorization

F Sebastiani - Encyclopedia of database technologies and …, 2005 - igi-global.com
During the last 15 years, the production of documents in digital form has exploded, due to
the increased availability of hardware and software tools for generating digital data (eg …

PEBL: positive example based learning for web page classification using SVM

H Yu, J Han, KCC Chang - Proceedings of the eighth ACM SIGKDD …, 2002 - dl.acm.org
Web page classification is one of the essential techniques for Web mining. Specifically,
classifying Web pages of a user-interesting class is the first step of mining interesting …