[图书][B] Coreference: annotation, resolution and evaluation in Polish

M Ogrodniczuk, K Glowinska, M Kopec, A Savary… - 2014 - books.google.com
'Coreference'presents specificities of reference, anaphora and coreference in Polish,
establish identity-of-reference annotation model and present methodology used to create …

[PDF][PDF] The Polish Sejm Corpus.

M Ogrodniczuk - LREC, 2012 - lrec-conf.org
This document presents the first edition of the Polish Sejm Corpus–a new specialized
resource containing transcribed, automatically annotated utterances of the Members of …

Exploring machine learning algorithms and protein language models strategies to develop enzyme classification systems

D Fernández, Á Olivera-Nappa… - … Work-Conference on …, 2023 - Springer
Discovering functionalities for unknown enzymes has been one of the most common
bioinformatics tasks. Functional annotation methods based on phylogenetic properties have …

[PDF][PDF] Web Service integration platform for Polish linguistic resources.

M Ogrodniczuk, M Lenart - LREC, 2012 - researchgate.net
This paper presents a robust linguistic Web service framework for Polish, combining several
mature offline linguistic tools in a common online platform. The toolset comprise paragraph …

[PDF][PDF] TEI P5 as an XML Standard for Treebank Encoding

A Przepiórkowski - Proceedings of the Eighth International Workshop …, 2009 - ufal.mff.cuni.cz
The aim of the paper is to show that a subset of Text Encoding Initiative Guidelines is a
reasonable choice as a standard for stand-off XML encoding of syntactically annotated …

[PDF][PDF] Towards morphologically annotated corpus of hospital discharge reports in Polish

M Marciniak, A Mykowiecka - Proceedings of BioNLP 2011 …, 2011 - aclanthology.org
The paper discuses problems in annotating a corpus containing Polish clinical data with low
level linguistic information. We propose an approach to tokenization and automatic …

Analysing utterances in polish parliament to predict speaker's background

P Przybyła, P Teisseyre - Journal of quantitative linguistics, 2014 - Taylor & Francis
In this study we use transcripts of the Sejm (Polish parliament) to predict speaker's
background: gender, education, party affiliation and birth year. We create learning cases …

Which XML standards for multilevel corpus annotation?

A Przepiórkowski, P Bański - … for Computer Science and Linguistics: 4th …, 2011 - Springer
The paper attempts to answer the question: Which XML standard (s) should be used for
multilevel corpus annotation? Various more or less specific standards and best practices are …

Language Documentation and Standards in Digital Humanities: TEI and the documentation of Mixtepec-Mixtec

J Bowers - 2020 - theses.hal.science
This dissertation concerns a language documentation project covering the Mixtepec-Mixtec
variety of Mixtec (ISO 639-3: mix). Mixtepec-Mixtec is an Oto-Manguean spoken by roughly …

[PDF][PDF] Foreign language examination corpus for l2-learning studies

P Bański, R Gozdawa-Gołębiowski - Workshop Programme, 2010 - Citeseer
We describe the structure and the features of the Foreign Language Examination Corpus, a
University of Warsaw project, launched on the initiative of the University Council for the …