作者
Abolfazl Aleahmad, Parsia Hakimian, Farzad Mahdikhani, Farhad Oroumchian
发表日期
2007/2/12
研讨会论文
2007 9th international symposium on signal processing and its applications
页码范围
1-4
出版商
IEEE
简介
The Persian language is one of the languages in Middle-East, so there are significant amount of Persian documents available on the Web. But there are relatively few studies on retrieval of Persian documents in the literature. In this experimental study, we assessed term and N-gram based vector space model and a query expansion method, namely, local context analysis using different weighting schemes on a realistic corpus containing 160000+ news articles. Then we compared our results with previous works reported on Persian language. Our experimental results show that among the assessed methods, 4-gram based vector space model with Lnu.ltu weighting scheme has acceptable performance and Local context analysis has the best performance for Persian text retrieval so far.
引用总数
20072008200920102011201220132014201520162017201820192020202120222023202447632732111174221
学术搜索中的文章
A Aleahmad, P Hakimian, F Mahdikhani, F Oroumchian - 2007 9th international symposium on signal processing …, 2007