A densitometric approach to web page segmentation

C Kohlschütter, W Nejdl - Proceedings of the 17th ACM conference on …, 2008 - dl.acm.org
Web Page segmentation is a crucial step for many applications in Information Retrieval,
such as text classification, de-duplication and full-text search. In this paper we describe a …

A joint text mining-rank size investigation of the rhetoric structures of the US Presidents' speeches

V Ficcadenti, R Cerqueti, M Ausloos - Expert Systems with Applications, 2019 - Elsevier
This work presents a text mining context and its use for a deep analysis of the messages
delivered by politicians. Specifically, we deal with an expert systems-based exploration of …

[PDF][PDF] On a Zipf's law extension to impact factors.

II Popescu - Glottometrics, 2003 - ram-verlag.eu
The Lavalette's law is further promoted with empirical arguments from its original area of
impact factors of scientific journals. Alike its famous precursory Zipf's and Mandelbrot's rank …

Zipfian regularities in “non-point” word representations

F Şahinuç, A Koç - Information Processing & Management, 2021 - Elsevier
Being one of the most common empirical regularities, the Zipf's law for word frequencies is a
power law relation between word frequencies and frequency ranks of words. We …

Fractal geometry of texts: An initial application to the works of Shakespeare

A Eftekhari - Journal of Quantitative linguistics, 2006 - Taylor & Francis
It has been demonstrated that there is a geometrical order in text structures. Fractal
geometry, as a modern mathematical approach and a new geometrical standpoint on …

О фрактальном самоподобии в языке

ЛВ Бронник - Известия Волгоградского государственного …, 2009 - cyberleninka.ru
Применительно к языковым реалиям поднимается вопрос подобия. Актуальность
тематики обусловлена совершенствованием математических представлений о (само) …

Corrections of Zipf's and Heaps' Laws Derived from Hapax Rate Models

Ł Dębowski - arXiv preprint arXiv:2307.12896, 2023 - arxiv.org
The article introduces corrections to Zipf's and Heaps' laws based on systematic models of
the hapax rate. The derivation rests on two assumptions: The first one is the standard urn …

Índice de Palabras de Contenido (IPC) y Distribución Porcentual de Legomena (DPL) en artículos de investigación en español

K Matsuda, S Sadowsky, O Sabaj - Revista signos, 2012 - SciELO Chile
A partir de una revisión de los índices clásicos en estadística léxica (Leyes de Estoup-Zipf-
Mandelbrot), se proponen dos índices lingüísticos que buscan aportar nuevos datos en la …

Mathematical and Linguistic Characterization of Orhan Pamuk's Nobel Works

T Arsan, SS Simsek, O Pekcan - arXiv preprint arXiv:2304.05512, 2023 - arxiv.org
In this study, Nobel Laureate Orhan Pamuk's works are chosen as examples of Turkish
literature. By counting the number of letters and words in his texts, we find it possible to study …

Disclosing Zipfian Regularities in Semantic Breadth of Words Via Multimodal Gaussian Embeddings

F Şahinuç - 2021 - search.proquest.com
Being one of the most common empirical regularities, Zipf's law for word frequencies is a
power-law relation between word frequencies and frequency ranks of words. In this thesis …