查看文章

Efficient feature selection and domain relevance term weighting method for document classification

作者

Aurangzeb Khan, Baharum Baharudin, Khairullah Khan

发表日期

2010/3/19

研讨会论文

2010 Second International Conference on Computer Engineering and Applications

卷号

页码范围

398-403

出版商

IEEE

简介

Feature selection is of paramount concern in document classification process which improves the efficiency and accuracy of text classifier. Vector Space Model is used to represent the ¿Bag of Word¿ BOW of the documents with term weighting phenomena. Documents representing through this model has some limitations that is, ignoring term dependencies, structure and ordering of the terms in documents. To overcome this problem semantic base feature vector is proposed. That is used to extracts the concept of term, co-occurring and associated terms using ontology. The proposed method is applied on small documents dataset, which shows that this method outperforms then term frequency/ inverse document frequency (TF-IDF) with BOW feature selection method for text classification.

引用总数

被引用次数：17

20112012201320142015201620172018201920202021202220232 2 3 1 2 1 2 2 1 1

学术搜索中的文章

Efficient feature selection and domain relevance term weighting method for document classification

A Khan, B Baharudin, K Khan - 2010 Second International Conference on Computer …, 2010

被引用次数：17 相关文章所有 3 个版本