[PDF][PDF] KNN based machine learning approach for text and document mining

V Bijalwan, V Kumar, P Kumari… - International Journal of …, 2014 - researchgate.net
International Journal of Database Theory and Application, 2014researchgate.net
Text Categorization (TC), also known as Text Classification, is the task of automatically
classifying a set of text documents into different categories from a predefined set. If a
document belongs to exactly one of the categories, it is a single-label classification task;
otherwise, it is a multi-label classification task. TC uses several tools from Information
Retrieval (IR) and Machine Learning (ML) and has received much attention in the last years
from both researchers in the academia and industry developers. In this paper, we first …
Abstract
Text Categorization (TC), also known as Text Classification, is the task of automatically classifying a set of text documents into different categories from a predefined set. If a document belongs to exactly one of the categories, it is a single-label classification task; otherwise, it is a multi-label classification task. TC uses several tools from Information Retrieval (IR) and Machine Learning (ML) and has received much attention in the last years from both researchers in the academia and industry developers. In this paper, we first categorize the documents using KNN based machine learning approach and then return the most relevant documents.
researchgate.net
以上显示的是最相近的搜索结果。 查看全部搜索结果