[PDF][PDF] Automatic document classification using a domain ontology

P Wijewickrema, RCG Gamage - 09 th National Conference on Library and …, 2012 - slla.lk
09 th National Conference on Library and Information Science, 2012slla.lk
Automatic classification has become an important research area due to the rapid increase of
digital information today. Evidently, manual classification of documents is a tough work due
to occurrences of vocabulary ambiguities of classification schemes as well as the language
used in the text in hand. In our study, we made an attempt to resolve this matter. This
research has developed a computer programme that can automatically classify a given text
document based on a well developed ontology. Therefore, the user gets correct options of …
Abstract
Automatic classification has become an important research area due to the rapid increase of digital information today. Evidently, manual classification of documents is a tough work due to occurrences of vocabulary ambiguities of classification schemes as well as the language used in the text in hand.
In our study, we made an attempt to resolve this matter. This research has developed a computer programme that can automatically classify a given text document based on a well developed ontology. Therefore, the user gets correct options of classification just after feeding the document to the new system. The new ontology is a domain ontology which is based on the Dewey Decimal Classification scheme and the Sears list. Data was obtained for classification accuracy for both manual and automatic methods. Moreover, the relationship between the vagueness of language in documents and the inaccuracy of classification were
slla.lk
以上显示的是最相近的搜索结果。 查看全部搜索结果