[PDF][PDF] Recent developments in the National Corpus of Polish

A Przepiórkowski, RL Górski, M Łazinski… - NLP, Corpus Linguistics …, 2010 - korpus.sk
The aim of the paper is to present recent–as of July 2009–developments in the construction
of the National Corpus of Polish. The main developments are: 1) the design of text encoding …

The TEI and current standards for structuring linguistic data. An overview

M Stührenberg - Journal of the text encoding initiative, 2012 - journals.openedition.org
The TEI has served for many years as a mature annotation format for corpora of different
types, including linguistically annotated data. Although it is based on the consensus of a …

[PDF][PDF] National corpus of polish

A Przepiórkowski, M Bańko, RL Górski… - Proceedings of the 5th …, 2011 - academia.edu
The paper presents the main results of the National Corpus of Polish project, which took
place from December 2007 to June 2011, including: the sizes of the main corpus and …

Multi-layered semantic annotation and the formalisation of annotation schemas for the investigation of modality in a Latin corpus

H Bermúdez-Sabel, F Dell'Oro, P Marongiu - Language Resources and …, 2024 - Springer
This paper stems from the project A World of Possibilities. Modal pathways over an extra-
long period of time: the diachrony of modality in the Latin language (WoPoss) which involves …

[PDF][PDF] Towards the Annotation of Named Entities in the National Corpus of Polish.

A Savary, J Waszczuk, A Przepiórkowski - LREC, 2010 - academia.edu
We present the named entity annotation task within the on-going project of the National
Corpus of Polish. To the best of our knowledge, this is the first attempt at a large-scale …

[PDF][PDF] The Polish Sejm Corpus.

M Ogrodniczuk - LREC, 2012 - lrec-conf.org
This document presents the first edition of the Polish Sejm Corpus–a new specialized
resource containing transcribed, automatically annotated utterances of the Members of …

[PDF][PDF] The design of syntactic annotation levels in the National Corpus of Polish

K Głowinska, A Przepiórkowski - Proceedings of LREC 2010, 2010 - lexitron.nectec.or.th
This paper presents the procedure of the syntactic annotation of the National Corpus of
Polish. Syntactic annotation consists here of shallow parsing and manual post-editing of the …

Tools and methodologies for annotating syntax and named entities in the National Corpus of Polish

J Waszczuk, K Glowińska, A Savary… - Proceedings of the …, 2010 - ieeexplore.ieee.org
The on-going project aiming at the creation of the National Corpus of Polish assumes
several levels of linguistic annotation. We present the technical environment and …

[PDF][PDF] TEI P5 as an XML Standard for Treebank Encoding

A Przepiórkowski - Proceedings of the Eighth International Workshop …, 2009 - ufal.mff.cuni.cz
The aim of the paper is to show that a subset of Text Encoding Initiative Guidelines is a
reasonable choice as a standard for stand-off XML encoding of syntactically annotated …

Representation and Processing of Composition, Variation and Approximation in Language Resources and Tools

A Savary - 2014 - hal.science
In my habilitation dissertation, meant to validate my capacity of and maturity for directing
research activities, I present a panorama of several topics in computational linguistics …