ANNIS3: A new architecture for generic corpus query and visualization

T Krause, A Zeldes - Digital Scholarship in the Humanities, 2016 - academic.oup.com
This article is concerned with the data structures, properties of query languages, and
visualization facilities required for the generic representation of richly annotated …

ANNIS: A search tool for multi-layer annotated corpora

A Zeldes, A Lüdeling, J Ritz, C Chiarcos - 2009 - edoc.hu-berlin.de
ANNIS (see Dipper & Götze 2005; Chiarcos et al. 2008) is a flexible web-based corpus
architecture for search and visualization of multi-layer linguistic corpora. By multi-layer we …

[PDF][PDF] Recent developments in the National Corpus of Polish

A Przepiórkowski, RL Górski, M Łazinski… - NLP, Corpus Linguistics …, 2010 - korpus.sk
The aim of the paper is to present recent–as of July 2009–developments in the construction
of the National Corpus of Polish. The main developments are: 1) the design of text encoding …

Digital humanities and natural language processing:“je t'aime... moi non plus”

B McGillivray, T Poibeau, P Ruiz - Digital Humanities Quarterly, 2020 - hal.science
In spite of the increasingly large textual datasets humanities researchers are confronted
with, and the need for automatic tools to extract information from them, we observe a lack of …

[HTML][HTML] Why TEI stand-off annotation doesn't quite work

P Bański - Balisage: The markup conference, 2010 - balisage.net
The present submission focuses on the concept of stand-off annotation as it is implemented
in the current version of the TEI Guidelines. We look at the motivation for choosing the stand …

Multi-layered semantic annotation and the formalisation of annotation schemas for the investigation of modality in a Latin corpus

H Bermúdez-Sabel, F Dell'Oro, P Marongiu - Language Resources and …, 2024 - Springer
This paper stems from the project A World of Possibilities. Modal pathways over an extra-
long period of time: the diachrony of modality in the Latin language (WoPoss) which involves …

SYN2020: a new corpus of Czech with an innovated annotation

T Jelínek, J Křivan, V Petkevič, H Skoumalová… - Text, Speech, and …, 2021 - Springer
The paper introduces the SYN2020 corpus, a newly released representative corpus of
written Czech following the tradition of the Czech National Corpus SYN series. The design of …

[PDF][PDF] Riznica: the Croatian language corpus

D Ćavar, D Brozović Rončević - Prace filologiczne, 2012 - riznica.ihjj.hr
The Croatian Language Corpus (CLC) is currently a spin-off product of sub-projects of the
research program Riznica (Croatian Language Repository). Initially the Ministry of Science …

Izgradnja referentnog korpusa savremenog srpskog jezika

MV Utvić - 2013 - search.proquest.com
Izgradnja referentnog korpusa savremenog srpskog jezika Page 1 UNIVERZITET U
BEOGRADU FILOLOŠKI FAKULTET Miloš V. Utvic IZGRADNJA REFERENTNOG …

Designing annotation schemes: From model to representation

N Ide, C Chiarcos, M Stede, S Cassidy - Handbook of linguistic annotation, 2017 - Springer
The physical formats used to represent linguistic data and its annotations have evolved over
the past four decades, accommodating different needs and perspectives as well as …