Penn-Helsinki Parsed Corpus of Early Modern English: First Parsing Results and Analysis

S Kulick, N Ryant, B Santorini - arXiv preprint arXiv:2112.08532, 2021 - arxiv.org
We present the first parsing results on the Penn-Helsinki Parsed Corpus of Early Modern
English (PPCEME), a 1.9 million word treebank that is an important resource for research in …

Parsing Early Modern English for Linguistic Search

S Kulick, N Ryant - arXiv preprint arXiv:2002.10546, 2020 - arxiv.org
We investigate the question of whether advances in NLP over the last few years make it
possible to vastly increase the size of data usable for research in historical syntax. This …

[PDF][PDF] Learning Computational Models of Non-Standard Language

M Ryskina - 2022 - lti.cs.cmu.edu
Nonstandard language such as novel words or creative spellings of existing ones often
occurs in natural text corpora, posing significant challenges for natural lan guage processing …