[PDF][PDF] A review of machine learning algorithms for text-documents classification

A Khan, B Baharudin, LH Lee, K Khan - Journal of advances in …, 2010 - academia.edu
Journal of advances in information technology, 2010academia.edu
With the increasing availability of electronic documents and the rapid growth of the World
Wide Web, the task of automatic categorization of documents became the key method for
organizing the information and knowledge discovery. Proper classification of e-documents,
online news, blogs, e-mails and digital libraries need text mining, machine learning and
natural language processing techniques to get meaningful knowledge. The aim of this paper
is to highlight the important techniques and methodologies that are employed in text …
Abstract
With the increasing availability of electronic documents and the rapid growth of the World Wide Web, the task of automatic categorization of documents became the key method for organizing the information and knowledge discovery. Proper classification of e-documents, online news, blogs, e-mails and digital libraries need text mining, machine learning and natural language processing techniques to get meaningful knowledge. The aim of this paper is to highlight the important techniques and methodologies that are employed in text documents classification, while at the same time making awareness of some of the interesting challenges that remain to be solved, focused mainly on text representation and machine learning techniques. This paper provides a review of the theory and methods of document classification and text mining, focusing on the existing literature.
academia.edu
以上显示的是最相近的搜索结果。 查看全部搜索结果