作者
Aurangzeb Khan, Baharum Baharudin, Khairullah Khan
发表日期
2010/3/19
研讨会论文
2010 Second International Conference on Computer Engineering and Applications
卷号
2
页码范围
398-403
出版商
IEEE
简介
Feature selection is of paramount concern in document classification process which improves the efficiency and accuracy of text classifier. Vector Space Model is used to represent the ¿Bag of Word¿ BOW of the documents with term weighting phenomena. Documents representing through this model has some limitations that is, ignoring term dependencies, structure and ordering of the terms in documents. To overcome this problem semantic base feature vector is proposed. That is used to extracts the concept of term, co-occurring and associated terms using ontology. The proposed method is applied on small documents dataset, which shows that this method outperforms then term frequency/ inverse document frequency (TF-IDF) with BOW feature selection method for text classification.
引用总数
20112012201320142015201620172018201920202021202220232231212211
学术搜索中的文章