We present here the largest publicly available corpus of Romanian. Its written component contains 1,257,752,812 tokens, distributed, in an unbalanced way, in several language …
This paper reports on recent developments within the European Reference Corpus EuReCo, an open initiative that aims at providing and using virtual and dynamically …
V Păiș, D Tufiș - Proceedings of the Romanian Academy, series …, 2018 - researchgate.net
We investigate the usability of the CoRoLa corpus for generating high quality vector representations of words for Romanian language. Different model parameters are tested …
B Trawiński, M Kupietz - Deutsch in Europa. Sprachpolitisch …, 2021 - degruyter.com
Der Beitrag beschreibt die Motivation und Ziele des Europäischen Referenzkorpus EuReCo, einer offenen Initiative, die darauf abzielt, dynamisch definierbare virtuelle vergleichbare …
M Kupietz, N Diewald… - Proceedings of the LREC …, 2018 - ids-pub.bsz-bw.de
How to get the computation near the data: Improving data accessibility to, and reusability of analysis functions in corpus query platforms Page 27 How to Get the Computation Near the …
The user interfaces for corpus analysis platforms must provide a high degree of accessibility for ordinary users and at the same time provide the possibility to answer complex research …
M Kupietz, H Lüngen, N Diewald - Korpora in der germanistischen …, 2022 - degruyter.com
Das Deutsche Referenzkorpus DeReKo dient als eine empirische Grundlage für die germanistische Linguistik. In diesem Beitrag geben wir einen Überblick über Grundlagen …
DRuKoLA, the accompanying project in the making of the Corpus of Romanian Language, is a cooperation between German and Romanian computer scientists, corpus linguists and …
This section concerns applications of comparable corpora beyond pure machine translation. It has been argued [,] that downstream applications such as cross-lingual document …