The new IDS corpus analysis platform: Challenges and prospects

P Bański, PM Fischer, E Frick, E Ketzan… - Proceedings of the …, 2012 - ids-pub.bsz-bw.de
The present article describes the first stage of the KorAP project, launched recently at the
Institut für Deutsche Sprache (IDS) in Mannheim, Germany. The aim of this project is to …

[图书][B] Multilayer corpus studies

A Zeldes - 2018 - taylorfrancis.com
This volume explores the opportunities afforded by the construction and evaluation of
multilayer corpora, an emerging methodology within corpus linguistics that brings about …

KorAP Architecture―Diving in the Deep Sea of Corpus Data

N Diewald, M Hanl, E Margaretha… - Proceedings of the …, 2016 - aclanthology.org
KorAP is a corpus search and analysis platform, developed at the Institute for the German
Language (IDS). It supports very large corpora with multiple annotation layers, multiple …

Linguistic variation in the Austrian Media Corpus. Dealing with the challenges of large amounts of data

R Jutta, M Karlheinz, Ď Matej - Procedia-Social and Behavioral Sciences, 2013 - Elsevier
The paper at hand deals with a new corpus of digital texts, the Austrian Media Corpus, which
is being created at the Institute for Corpus Linguistics and Text Technology of the Austrian …

Corpus query lingua franca (CQLF)

P Bański, E Frick, A Witt - … of the Tenth International Conference on …, 2016 - aclanthology.org
The present paper describes Corpus Query Lingua Franca (ISO CQLF), a specification
designed at ISO Technical Committee 37 Subcommittee 4 “Language resource …

GYANI: an indexing infrastructure for knowledge-centric tasks

D Gupta, K Berberich - Proceedings of the 27th ACM International …, 2018 - dl.acm.org
In this work, we describe GYANI (gyan stands for knowledge in Hindi), an indexing
infrastructure for search and analysis of large semantically annotated document collections …

Reflections and a proposal for a query and reporting language for richly annotated multiparallel corpora

S Clematide, G Gintare, A Utka, M Volk - Linköping Electronic …, 2015 - zora.uzh.ch
Large and open multiparallel corpora are a valuable resource for contrastive corpus
linguists if the data is annotated and stored in a way that allows precise and flexible ad hoc …

Corpus Query Lingua Franca part II: Ontology

S Evert, O Harlamov, P Heinrich… - Proceedings of the …, 2020 - aclanthology.org
The present paper outlines the projected second part of the Corpus Query Lingua Franca
(CQLF) family of standards: CQLF Ontology, which is currently in the process of …

Evaluating DBMS-based access strategies to very large multi-layer corpora

R Schneider - Proceedings of the LREC-12 Workshop on …, 2012 - ids-pub.bsz-bw.de
Linguistic query systems are special purpose IR applications. As text sizes, annotation
layers, and metadata schemes of language corpora grow rapidly, performing complex …

Korpusanalyseplattform der nächsten Generation

M Kupietz, E Frick - Grundlagen einer sprachwissenschaftlichen …, 2013 - ids-pub.bsz-bw.de
Für den Zugriff auf die IDS-Korpora wurde Anfang der 1990er Jahre am IDS das
Korpusrecherche-und-analysesystem COSMAS (Corpus Search, Management and Analysis …