Comparison of various approaches to tagging for the inflectional Slovak language

L Benko, D Munkova, M Pappová, M Munk - PeerJ Computer Science, 2024 - peerj.com
Morphological tagging provides essential insights into grammar, structure, and the mutual
relationships of words within the sentence. Tagging text in a highly inflectional language …

A corpus of German Reddit exchanges (GeRedE)

A Blombach, N Dykes, P Heinrich… - Proceedings of the …, 2020 - aclanthology.org
GeRedE is a 270 million token German CMC corpus containing approximately 380,000
submissions and 6,800,000 comments posted on Reddit between 2010 and 2018. Reddit is …

Processamento linguístico de narrativas produzidas por crianças lusodescendentes e proposta de interface de pesquisa

JV Antunes - 2024 - search.proquest.com
Este projeto conjugará duas importantes áreas de estudo da linguística e das humanidades
digitais, nomeadamente o bilinguismo e a análise e tratamento de corpora. Para tal, foram …

[PDF][PDF] Lexical Variation of the Albanian Language used in computer-mediated communication and the challenge for processing

B Kabashi - of the 11th Conference on computer-mediated …, 2024 - shs.hal.science
In addition to the standard variant of a language, a lot is also spoken and written in non-
standard variants. The processing of data that is available in a non-standard variant is …