A reconfigurable stochastic tagger for languages with complex tag structure

YT Cao, H Daumé III - arXiv preprint arXiv:1910.13913, 2019 - arxiv.org

Correctly resolving textual mentions of people fundamentally entails making inferences
about those people. Such inferences raise the risk of systemic biases in coreference …

被引用次数：159 相关文章所有 7 个版本

[PDF] aclanthology.org

Toward gender-inclusive coreference resolution: An analysis of gender and bias throughout the machine learning lifecycle

YT Cao, H Daumé III - Computational Linguistics, 2021 - aclanthology.org

Correctly resolving textual mentions of people fundamentally entails making inferences
about those people. Such inferences raise the risk of systematic biases in coreference …

被引用次数：62 相关文章所有 4 个版本

[PDF] academia.edu

[PDF][PDF] The IPI PAN Corpus

A Przepiórkowski - … version. Institute of Computer Science, Polish …, 2004 - academia.edu

This publication is an outcome of a project financed chiefly by the State Committee for
Scientific Research (Komitet Badań Naukowych; KBN; project number 7T11C04320) carried …

被引用次数：220 相关文章所有 3 个版本

[PDF] mit.edu

Generalizing cross-document event coreference resolution across multiple corpora

M Bugert, N Reimers, I Gurevych - Computational Linguistics, 2021 - direct.mit.edu

Cross-document event coreference resolution (CDCR) is an NLP task in which mentions of
events need to be identified and clustered throughout a collection of documents. CDCR …

被引用次数：17 相关文章所有 8 个版本

[PDF] aclanthology.org

[PDF][PDF] The Unberable Lightness of Tagging* A Case Study in Morphosyntactic Tagging of Polish

A Przepiórkowski, M Woliński - … Corpora (LINC-03) at EACL 2003, 2003 - aclanthology.org

The article takes a step back and examines the notion of part of speech (POS), arguing that
POS tagsets should be constructed more carefully and, in effect, should be light in at least …

被引用次数：56 相关文章所有 4 个版本

[PDF] lrec-conf.org

[PDF][PDF] A Search Tool for Corpora with Positional Tagsets and Ambiguities.

A Przepiórkowski, Z Krynicki, L Debowski, M Wolinski… - LREC, 2004 - lrec-conf.org

This article describes POLIQARP, a corpus indexing and query tool, which understands
positional tagsets and which does not assume that word forms are annotated with unique …

被引用次数：38 相关文章所有 7 个版本

[PDF] psu.edu

Trigram morphosyntactic tagger for Polish

Ł Dębowski - Intelligent Information Processing and Web Mining …, 2004 - Springer

We introduce an implementation of a plain trigram part-of-speech tagger which appears to
work well on Polish texts. At this moment the tagger achieves 9.4% error rate, which makes it …

被引用次数：45 相关文章所有 4 个版本

[PDF] ipipan.waw.pl

[PDF][PDF] The potential of the IPI PAN Corpus

A Przepiórkowski - Poznan Studies in Contemporary Linguistics, 2006 - nlp.ipipan.waw.pl

The aim of this article is to present the IPI PAN Corpus (cf. http://korpus. pl/), a large
morphosyntactically annotated XML encoded corpus of Polish developed at the Institute of …

被引用次数：27 相关文章

[PDF] ipipan.waw.pl

[PDF][PDF] The IPI PAN Corpus in numbers

A Przepiórkowski - Proceedings of the 2nd Language & …, 2005 - nlp.ipipan.waw.pl

The aim of this article is to present the IPI PAN Corpus (cf. http://korpus. pl/), a large
morphosyntactically annotated XML encoded corpus of Polish developed at the Institute of …

被引用次数：22 相关文章所有 3 个版本

[PDF] amu.edu.pl

[PDF][PDF] MorphoDiTa-based tagger adapted to the Polish language technology

M Piasecki, W Walentynowicz - … as a Challenge for Computer Science …, 2017 - ltc.amu.edu.pl

We present a new morpho-syntactic tagger for Polish called MorphoIXTa-pl, which is based
on the adaptation of the MorphoIiTa tagger developed originally for the Czech language …

被引用次数：6 相关文章

高级搜索

QQ 群