Information theory as a bridge between language function and language form

R Futrell, M Hahn - Frontiers in Communication, 2022 - frontiersin.org
Formal and functional theories of language seem disparate, because formal theories answer
the question of what a language is, while functional theories answer the question of what …

The entropy of words—Learnability and expressivity across more than 1000 languages

C Bentz, D Alikaniotis, M Cysouw, R Ferrer-i-Cancho - Entropy, 2017 - mdpi.com
The choice associated with words is a fundamental property of natural languages. It lies at
the heart of quantitative linguistics, computational linguistics and language sciences more …

A standardized Project Gutenberg corpus for statistical analysis of natural language and quantitative linguistics

M Gerlach, F Font-Clos - Entropy, 2020 - mdpi.com
The use of Project Gutenberg (PG) as a text corpus has been extremely popular in statistical
analysis of language for more than 25 years. However, in contrast to other major linguistic …

The statistical trade-off between word order and word structure–Large-scale evidence for the principle of least effort

A Koplenig, P Meyer, S Wolfer, C Müller-Spitzer - PloS one, 2017 - journals.plos.org
Languages employ different strategies to transmit structural and grammatical information.
While, for example, grammatical dependency relationships in sentences are mainly …

Simplification in translated Chinese: An entropy-based approach

K Liu, Z Liu, L Lei - Lingua, 2022 - Elsevier
For a long time, translation researchers, particularly those working in corpus-based
translation studies, have held the presumption that translated texts tend to be simpler in …

Prioritizing user concerns in app reviews–A study of requests for new features, enhancements and bug fixes

S Malgaonkar, SA Licorish, BTR Savarimuthu - Information and Software …, 2022 - Elsevier
Context: App developers spend exhaustive manual efforts towards the identification and
prioritization of informative end-user reviews. Informative reviews are those that express end …

The learnability consequences of Zipfian distributions in language

O Lavi-Rotbain, I Arnon - Cognition, 2022 - Elsevier
While the languages of the world differ in many respects, they share certain commonalties,
which can provide insight on our shared cognition. Here, we explore the learnability …

[图书][B] Information theory meets power laws: Stochastic processes and language models

L Debowski - 2020 - books.google.com
Discover new theoretical connections between stochastic phenomena and the structure of
natural language with this powerful volume! Information Theory Meets Power Laws …

[图书][B] Statistical Universals of Language

K Tanaka-Ishii - 2021 - Springer
1.1 Aims For nearly hundred years, researchers have noticed how language ubiquitously
follows certain mathematical properties. These properties differ from linguistic universals that …

Languages with more speakers tend to be harder to (machine-) learn

A Koplenig, S Wolfer - Scientific Reports, 2023 - nature.com
Computational language models (LMs), most notably exemplified by the widespread
success of OpenAI's ChatGPT chatbot, show impressive performance on a wide range of …