This article builds on a mathematical explanation of one the most prominent stylometric measures, Burrows's Delta (and its variants), to understand and explain its working. Starting …
M Eder - Digital Scholarship in the Humanities, 2015 - academic.oup.com
The aim of this study is to find such a minimal size of text samples for authorship attribution that would provide stable results independent of random noise. A few controlled tests for …
M Eder - Digital Scholarship in the Humanities, 2017 - academic.oup.com
The aim of this article is to discuss reliability issues of a few visual techniques used in stylometry, and to introduce a new method that enhances the explanatory power of …
With the recent progress made in network and computing technology, the ubiquity of data, and textual repositories freely available, the scientific practice evolves towards a more data …
J Rybicki, M Eder - Literary and linguistic computing, 2011 - academic.oup.com
This article examines the success of authorship attribution of Burrows's Delta in several corpora representing a variety of languages and genres. Contrary to the approaches of our …
Machine-learning stylometric distance methods based on most-frequent-word frequencies are well-accepted and successful in authorship attribution. This study investigates the results …
M Eder - Studies in Polish Linguistics, 2011 - ruj.uj.edu.pl
The present study addresses one of the theoretical problems of computer-assisted authorship attribution, namely the question which traceable features of language can betray …
J Rybicki, M Heydel - Literary and Linguistic Computing, 2013 - academic.oup.com
The study investigates to what extent traditional stylistics and non-traditional stylometry can co-operate in the study of translations in terms of translatorial style. Stylistic authorship …
M Eder - Literary and linguistic computing, 2013 - academic.oup.com
In computational stylistics, any influence of unwanted noise—eg caused by an untidily prepared corpus—might lead to biased or false results. Relying on contaminated data is …