Tracing syntactic change in the scientific genre: Two Universal Dependency-parsed diachronic corpora of scientific English and German

MP Krielke, L Talamo, M Fawzi… - Proceedings of the …, 2022 - aclanthology.org
We present two comparable diachronic corpora of scientific English and German from the
Late Modern Period (17th c.–19th c.) annotated with Universal Dependencies. We describe …

Linguistic complexity in scientific writing: A large-scale diachronic study from 1821 to 1920

G Wang, H Wang, X Sun, N Wang, L Wang - Scientometrics, 2023 - Springer
This study intends to describe the diachronic changes of linguistic complexity (ie, overall,
morphological, and syntactic complexity) in scientific writing based on Kolmogorov …

[PDF][PDF] Modeling Diachronic Change in English Scientific Writing over 300+ Years with Transformer-based Language Model Surprisal

J Steuer, MP Krielke, S Fischer… - Proceedings of the …, 2024 - aclanthology.org
This study presents an analysis of diachronic linguistic changes in English scientific writing,
utilizing surprisal from transformer-based language models. Unlike traditional n-gram …

Engaging with bad (meta) data in historical corpus linguistics

T Vartiainen, T Säily - Challenges in Corpus Linguistics: Rethinking …, 2024 - degruyter.com
In this chapter, we discuss some common pitfalls related to historical data and its use in
linguistic analysis. We argue that the “philologist's dilemma”, as originally proposed by …

Initialisms in scientific writing in the 19th and early 20th centuries

K Menzel - Zeitschrift für Wortbildung/Journal of Word …, 2024 - journals.linguistik.de
This paper focusses on the role of initialisms in scientific English articles in the Royal Society
Corpus (Fischer et al. 2020; Kermes et al. 2016). The development of scientific initialisms is …

The diffusion of scientific terms–tracing individuals' influence in the history of science for English

Y Bizzoni, S Degaetano-Ortlieb… - Proceedings of the 5th …, 2021 - aclanthology.org
Tracing the influence of individuals or groups in social networks is an increasingly popular
task in sociolinguistic studies. While methods to determine someone's influence in shortterm …

Fractality of informativity in 300 years of English scientific writing

Y Bizzoni, S Degaetano-Ortlieb - Proceedings of the 7th Joint …, 2023 - aclanthology.org
Scientific writing is assumed to have become more informationally dense over time
(Halliday, 1988; Biber and Gray, 2016). By means of fractal analysis, we study whether over …

Medical discourse in Late Modern English: Insights from a multidisciplinary corpus of scientific journal articles

K Menzel - Corpus pragmatic studies on the history of medical …, 2022 - jbe-platform.com
This chapter demonstrates how the Royal Society Corpus, a richly annotated corpus of
around 48,000 English scientific journal articles covering more than 330 years, can be used …

Optimizing scientific communication: the role of relative clauses as markers of complexity in English and German scientific writing between 1650 and 1900

MP Krielke - 2023 - publikationen.sulb.uni-saarland.de
The aim of this thesis is to show that both scientific English and German have become
increasingly optimized for scientific communication from 1650 to 1900 by adapting the …

Managing Fine-grained Metadata for Text Bases in Extremely Low Resource Languages: the Cases of Two Regional Languages of France

M Vergez-Couret, D Bernhard, M Nauge, M Bras… - SIGUL 2024, 2024 - hal.science
Metadata are key components of language resources and facilitate their exploitation and re-
use. Their creation is a labour intensive process and requires a modeling step, which …