Distinct word length frequencies: distributions and symbol entropies

RD Smith - arXiv preprint arXiv:1207.2334, 2012 - arxiv.org
The distribution of frequency counts of distinct words by length in a language's vocabulary
will be analyzed using two methods. The first, will look at the empirical distributions of …

[HTML][HTML] Lexico-semantic effects on word naming in Persian: Does age of acquisition have an effect?

M Bakhtiar, B Weekes - Memory & Cognition, 2015 - Springer
The age of acquisition (AoA) of a word has an effect on skilled reading performance.
According to the arbitrary-mapping (AM) hypothesis, AoA effects on word naming are a …

Vafa spell-checker for detecting spelling, grammatical, and real-word errors of Persian language

H Faili, N Ehsan, M Montazery… - Digital Scholarship in …, 2016 - academic.oup.com
With advancements in industry and information technology, large volumes of electronic
documents such as newspapers, emails, weblogs, and theses are produced daily …

Farsi lexical analysis and stop word list

MR Davarpanah, M Sanji, M Aramideh - Library Hi Tech, 2009 - emerald.com
Purpose–The purpose of this article is to present an aggregated methodology for
construction of the stop word list in Farsi language and generate a generic Farsi stop word …

[PDF][PDF] Improving weak queries using local cluster analysis as a preliminary framework

AH Jadidinejad, H Sadr - Indian Journal of Science and Technology, 2015 - academia.edu
In a web retrieval task, the query is usually short and the users expect to find the relevant
documents in the first several result pages. To address this issue, the possibilities of using …

Classification of Persian textual documents using learning vector quantization

MT Pilevar, H Feili, M Soltani - 2009 International Conference …, 2009 - ieeexplore.ieee.org
Classification of the text documents into a predefined set of classes is considered to be an
important task for natural language processing applications. There is usually a tradeoff …

Query expansion in information retrieval for Urdu language

I Rasheed, H Banka - 2018 Fourth International Conference on …, 2018 - ieeexplore.ieee.org
The information retrieval system need to be upgraded constantly to meet the challenges
posed by the advanced user queries as the search system becoming more sophisticated …

[PDF][PDF] Sumono: A representative modern bengali corpus

MA Al Mumin, AAM Shoeb, MR Selim… - SUST Journal of …, 2014 - researchgate.net
Abstract The development of Language Engineering applications requires availability of
sizable, reliable and representative corpora. However, such corpora are not routinely …

[PDF][PDF] CLE Urdu books n-grams

F Adeeba, Q Akram, H Khalid, S Hussain - Conference on language and …, 2014 - cle.org.pk
The paper presents the development of first publically available Urdu N-grams extracted
from different books. For the best representation of N-grams, large amount of Urdu corpus is …

Introduction to a new Farsi stemmer

A Mokhtaripour, S Jahanpour - … of the 15th ACM International Conference …, 2006 - dl.acm.org
Introduction to a new Farsi stemmer Page 1 Introduction to a New Farsi Stemmer Alireza
Mokhtaripour Department of Electrical and Computer Engineering Shahid Beheshti University …