Assessment of a modern farsi corpus- 学术资源搜索

文章

学术资源搜索

[PDF][PDF] Assessment of a modern farsi corpus

E Darrudi, MR Hejazi, F Oroumchian - … of the 2nd Workshop on Information …, 2004 - Citeseer

ABSTRACT The development of Language Engineering (LE) and Information Retrieval (IR)
applications requires availability of sizeable, reliable and representative corpora. This paper
describes how we have constructed a well-structured 345 MB tagged corpus of news, and
presents some beneficial statistics of this corpus based upon the characteristics of Farsi
language. It also goes into particular detail on the fitness of the frequency and rank of Farsi
words with Zipf-Mandelbrot's law. We will then present our measurement of Entropy of Farsi …

被引用次数：51 相关文章所有 3 个版本

[引用][C] Assessment of a modern Farsi corpus

F Oroumchian, E Darrudi, MR Hejazi - Proceedings of The 2nd Workshop on …, 2004

被引用次数：11 相关文章

以上显示的是最相近的搜索结果。查看全部搜索结果

高级搜索

QQ 群

[PDF][PDF] Assessment of a modern farsi corpus

[引用][C] Assessment of a modern Farsi corpus

引用