A large and evolving cognate database

K Batsuren, G Bella, F Giunchiglia - Language Resources and Evaluation, 2022 - Springer
We present CogNet, a large-scale, automatically-built database of sense-tagged cognates—
words of common origin and meaning across languages. CogNet is continuously evolving …

Using lexical language models to detect borrowings in monolingual wordlists

JE Miller, T Tresoldi, R Zariquiey, CA Beltrán Castañón… - Plos one, 2020 - journals.plos.org
Lexical borrowing, the transfer of words from one language to another, is one of the most
frequent processes in language evolution. In order to detect borrowings, linguists make use …

[HTML][HTML] Automated identification of borrowings in multilingual wordlists

JM List, R Forkel - Open Research Europe, 2021 - ncbi.nlm.nih.gov
Although lexical borrowing is an important aspect of language evolution, there have been
few attempts to automate the identification of borrowings in lexical datasets. Moreover, none …

Frequent violation of the sonority sequencing principle in hundreds of languages: how often and by which sequences?

R Yin, J van de Weijer, ER Round - Linguistic Typology, 2023 - degruyter.com
Abstract The Sonority Sequencing Principle (SSP) is a fundamental governing principle of
syllable structure; however, its details remain contested. This study aims to clarify the …

Statistical bias control in typology

M Guzmán Naranjo, L Becker - Linguistic Typology, 2022 - degruyter.com
In this paper, we propose two new statistical controls for genealogical and areal bias in
typological samples. Our test case being the effect of VO-order effect on affix position …

Estimating areal effects in typology: A case study of african phoneme inventories

M Guzmán Naranjo, M Mertner - Linguistic Typology, 2023 - degruyter.com
In this paper, we combine several statistical techniques (multivariate probit models,
Gaussian processes, and phylogenetic regression) into a new approach for exploring the …

[HTML][HTML] First steps towards the detection of contact layers in Bangime: a multi-disciplinary, computer-assisted approach

A Hantgan, H Babiker, JM List - Open Research Europe, 2022 - ncbi.nlm.nih.gov
Bangime is a language isolate, which has not been proven to be genealogically related to
any other language family, spoken in Central-Eastern Mali. Its speakers, the Bangande …

Detecting lexical borrowings from dominant languages in multilingual wordlists

JE Miller, JM List - arXiv preprint arXiv:2302.00189, 2023 - arxiv.org
Language contact is a pervasive phenomenon reflected in the borrowing of words from
donor to recipient languages. Most computational approaches to borrowing detection treat …

Computational approaches to historical language comparison

JM List - 2022 - hcommons.org
The chapter discusses recently developed computational techniques providing concrete
help in addressing various tasks in historical language comparison, focusing specifically on …

Neural borrowing detection with monolingual lexical models

J Miller, E Pariasca, CB Castañon - Proceedings of the Student …, 2021 - aclanthology.org
Identification of lexical borrowings, transfer of words between languages, is an essential
practice of historical linguistics and a vital tool in analysis of language contact and cultural …