XML text interchange format in the National Corpus of Polish

M Ogrodniczuk, K Glowinska, M Kopec, A Savary… - 2014 - books.google.com

'Coreference'presents specificities of reference, anaphora and coreference in Polish,
establish identity-of-reference annotation model and present methodology used to create …

被引用次数：60 相关文章所有 4 个版本

[PDF] lrec-conf.org

[PDF][PDF] The Polish Sejm Corpus.

M Ogrodniczuk - LREC, 2012 - lrec-conf.org

This document presents the first edition of the Polish Sejm Corpus–a new specialized
resource containing transcribed, automatically annotated utterances of the Members of …

被引用次数：33 相关文章所有 6 个版本

Exploring machine learning algorithms and protein language models strategies to develop enzyme classification systems

D Fernández, Á Olivera-Nappa… - … Work-Conference on …, 2023 - Springer

Discovering functionalities for unknown enzymes has been one of the most common
bioinformatics tasks. Functional annotation methods based on phylogenetic properties have …

被引用次数：5 相关文章所有 2 个版本

[PDF] researchgate.net

[PDF][PDF] Web Service integration platform for Polish linguistic resources.

M Ogrodniczuk, M Lenart - LREC, 2012 - researchgate.net

This paper presents a robust linguistic Web service framework for Polish, combining several
mature offline linguistic tools in a common online platform. The toolset comprise paragraph …

被引用次数：19 相关文章所有 8 个版本

[PDF] cuni.cz

[PDF][PDF] TEI P5 as an XML Standard for Treebank Encoding

A Przepiórkowski - Proceedings of the Eighth International Workshop …, 2009 - ufal.mff.cuni.cz

The aim of the paper is to show that a subset of Text Encoding Initiative Guidelines is a
reasonable choice as a standard for stand-off XML encoding of syntactically annotated …

被引用次数：26 相关文章所有 11 个版本

[PDF] aclanthology.org

[PDF][PDF] Towards morphologically annotated corpus of hospital discharge reports in Polish

M Marciniak, A Mykowiecka - Proceedings of BioNLP 2011 …, 2011 - aclanthology.org

The paper discuses problems in annotating a corpus containing Polish clinical data with low
level linguistic information. We propose an approach to tokenization and automatic …

被引用次数：18 相关文章所有 12 个版本

[PDF] researchgate.net

Analysing utterances in polish parliament to predict speaker's background

P Przybyła, P Teisseyre - Journal of quantitative linguistics, 2014 - Taylor & Francis

In this study we use transcripts of the Sejm (Polish parliament) to predict speaker's
background: gender, education, party affiliation and birth year. We create learning cases …

被引用次数：11 相关文章所有 3 个版本

[PDF] researchgate.net

Which XML standards for multilevel corpus annotation?

A Przepiórkowski, P Bański - … for Computer Science and Linguistics: 4th …, 2011 - Springer

The paper attempts to answer the question: Which XML standard (s) should be used for
multilevel corpus annotation? Various more or less specific standards and best practices are …

被引用次数：20 相关文章所有 11 个版本

[PDF] hal.science

Language Documentation and Standards in Digital Humanities: TEI and the documentation of Mixtepec-Mixtec

J Bowers - 2020 - theses.hal.science

This dissertation concerns a language documentation project covering the Mixtepec-Mixtec
variety of Mixtec (ISO 639-3: mix). Mixtepec-Mixtec is an Oto-Manguean spoken by roughly …

被引用次数：7 相关文章所有 9 个版本

[PDF] psu.edu

[PDF][PDF] Foreign language examination corpus for l2-learning studies

P Bański, R Gozdawa-Gołębiowski - Workshop Programme, 2010 - Citeseer

We describe the structure and the features of the Foreign Language Examination Corpus, a
University of Warsaw project, launched on the initiative of the University Council for the …

被引用次数：7 相关文章所有 6 个版本

高级搜索

QQ 群