Zipf's word frequency law in natural language: A critical review and future directions

ST Piantadosi - Psychonomic bulletin & review, 2014 - Springer
The frequency distribution of words has been a key object of study in statistical linguistics for
the past 70 years. This distribution approximately follows a simple mathematical form known …

Zipf's law for word frequencies: Word forms versus lemmas in long texts

Á Corral, G Boleda, R Ferrer-i-Cancho - PloS one, 2015 - journals.plos.org
Zipf's law is a fundamental paradigm in the statistics of written and spoken natural language
as well as in other communication systems. We raise the question of the elementary units for …

[PDF][PDF] Empirical and theoretical bases of Zipf's law

RE Wyllys - 1981 - ideals.illinois.edu
ONEOF THE MOST PUZZLING phenomena in bibliometrics-and, more broadly, in
quantitative linguistics-is Zipf's law. Asonecommentator, the statistician Gustav Herdan, has …

[PDF][PDF] Extension of Zipf's law to words and phrases

EI Sicilia-Garcia, J Ming, FJ Smith - COLING 2002: The 19th …, 2002 - aclanthology.org
Zipf's law states that the frequency of word tokens in a large corpus of natural language is
inversely proportional to the rank. The law is investigated for two languages English and …

Statistical models for word frequency distributions: A linguistic evaluation

H Baayen - Computers and the Humanities, 1992 - Springer
Three models for word frequency distributions, the lognormal law, the generalized inverse
Gauss-Poisson law and the extended generalized Zipf's law are compared and evaluated …

Large-scale analysis of Zipf's law in English texts

I Moreno-Sánchez, F Font-Clos, Á Corral - PloS one, 2016 - journals.plos.org
Despite being a paradigm of quantitative linguistics, Zipf's law for words suffers from three
main problems: its formulation is ambiguous, its validity has not been tested rigorously from …

[图书][B] Word frequency distributions

RH Baayen - 2001 - books.google.com
This book is an introduction to the statistical analysis of word frequency distributions,
intended for linguists, psycholinguistics, and researchers work ing in the field of quantitative …

Dynamics of text generation with realistic Zipf's distribution

D Zanette, M Montemurro - Journal of quantitative Linguistics, 2005 - Taylor & Francis
We investigate the origin of Zipf's law for words in written texts by means of a stochastic
dynamic model for text generation. The model incorporates both features related to the …

The evolution of the exponent of Zipf's law in language ontogeny

J Baixeries, B Elvevåg, R Ferrer-i-Cancho - PloS one, 2013 - journals.plos.org
It is well-known that word frequencies arrange themselves according to Zipf's law. However,
little is known about the dependency of the parameters of the law and the complexity of a …

Word length, sentence length and frequency–Zipf revisited

B Sigurd, M Eeg‐Olofsson, J Van Weijer - Studia linguistica, 2004 - Wiley Online Library
This paper examines data from English, Swedish and German in order to find a theoretical
distribution that describes the observed relation between word length and frequency. In …