作者
AK Jumani, MH Mahar, FH Khoso, MA Memon
发表日期
2018/3/7
期刊
Sindh University Research Journal-SURJ (Science Series)
卷号
50
期号
01
页码范围
85-90
简介
Text Classification is a need of day, large text existing in the form of stories, news etc. Likewise, this system came into being along several techniques like, Support Vector Machine, Neural Networks and Decision Tree. Stories, newspapers are the page collection that belongs to text categorization. Various Sindhi newspapers are regularly published and Daily Kawish is one of them. People are facing difficulties during reading newspaper because there is no any specific option that will categorize particular news related to sports, technologies, crime, fashion and current affairs. For this purpose, a Text Categorization System (TCS) for Sindhi language is presented in this paper. Five classes are used and scanned each newspaper page inside a single class. It is too difficult to predict how many users will read newspaper simultaneously and for this, web performance is tested. Moreover, for the classification of the text from pages, precision, recall and f-measure are used to measure and achieved 67% of accuracy to classify the text from newspaper pages. It would be beneficial for those who want to save their precious time during reading newspaper.
引用总数
20192020202120222023202425111
学术搜索中的文章
AK Jumani, MH Mahar, FH Khoso, MA Memon - Sindh University Research Journal-SURJ (Science …, 2018