Sifting robotic from organic text: a natural language approach for detecting automation on Twitter

EM Clark, JR Williams, CA Jones, RA Galbraith… - Journal of computational …, 2016 - Elsevier
Twitter, a popular social media outlet, has evolved into a vast source of linguistic data, rich
with opinion, sentiment, and discussion. Due to the increasing popularity of Twitter, its …

A standardized Project Gutenberg corpus for statistical analysis of natural language and quantitative linguistics

M Gerlach, F Font-Clos - Entropy, 2020 - mdpi.com
The use of Project Gutenberg (PG) as a text corpus has been extremely popular in statistical
analysis of language for more than 25 years. However, in contrast to other major linguistic …

Sentiment and structure in word co-occurrence networks on Twitter

MI Fudolig, T Alshaabi, MV Arnold, CM Danforth… - Applied Network …, 2022 - Springer
We explore the relationship between context and happiness scores in political tweets using
word co-occurrence networks, where nodes in the network are the words, and the weight of …

On the physical origin of linguistic laws and lognormality in speech

IG Torre, B Luque, L Lacasa… - Royal Society …, 2019 - royalsocietypublishing.org
Physical manifestations of linguistic units include sources of variability due to factors of
speech production which are by definition excluded from counts of linguistic symbols. In this …

Detecting social bots on facebook in an information veracity context

GC Santia, MI Mujib, JR Williams - … AAAI conference on web and social …, 2019 - ojs.aaai.org
Misleading information is nothing new, yet its impacts seem only to grow. We investigate this
phenomenon in the context of social bots. Social bots are software agents that mimic …

Zipf's law holds for phrases, not words

J Ryland Williams, PR Lessard, S Desu, EM Clark… - Scientific reports, 2015 - nature.com
With Zipf's law being originally and most famously observed for word frequency, it is
surprisingly limited in its applicability to human language, holding over no more than three to …

[HTML][HTML] Information flow estimation: a study of news on Twitter

T South, B Smart, M Roughan, L Mitchell - Online Social Networks and …, 2022 - Elsevier
News media has long been an ecosystem of creation, reproduction, and critique, where
news outlets report on current events and add commentary to ongoing stories …

Allotaxonometry and rank-turbulence divergence: A universal instrument for comparing complex systems

PS Dodds, JR Minot, MV Arnold, T Alshaabi… - EPJ Data …, 2023 - epjds.epj.org
Complex systems often comprise many kinds of components which vary over many orders of
magnitude in size: Populations of cities in countries, individual and corporate wealth in …

[HTML][HTML] Variation of Zipf's exponent in one hundred live languages: A study of the Holy Bible translations

A Mehri, M Jamaati - Physics Letters A, 2017 - Elsevier
Zipf's law, as a power-law regularity, confirms long-range correlations between the elements
in natural and artificial systems. In this article, this law is evaluated for one hundred live …

The brevity law as a scaling law, and a possible origin of Zipf's law for word frequencies

Á Corral, I Serra - Entropy, 2020 - mdpi.com
An important body of quantitative linguistics is constituted by a series of statistical laws about
language usage. Despite the importance of these linguistic laws, some of them are poorly …