Toward gender-inclusive coreference resolution

YT Cao, H Daumé III - arXiv preprint arXiv:1910.13913, 2019 - arxiv.org
Correctly resolving textual mentions of people fundamentally entails making inferences
about those people. Such inferences raise the risk of systemic biases in coreference …

Toward gender-inclusive coreference resolution: An analysis of gender and bias throughout the machine learning lifecycle

YT Cao, H Daumé III - Computational Linguistics, 2021 - aclanthology.org
Correctly resolving textual mentions of people fundamentally entails making inferences
about those people. Such inferences raise the risk of systematic biases in coreference …

[PDF][PDF] The IPI PAN Corpus

A Przepiórkowski - … version. Institute of Computer Science, Polish …, 2004 - academia.edu
This publication is an outcome of a project financed chiefly by the State Committee for
Scientific Research (Komitet Badań Naukowych; KBN; project number 7T11C04320) carried …

Generalizing cross-document event coreference resolution across multiple corpora

M Bugert, N Reimers, I Gurevych - Computational Linguistics, 2021 - direct.mit.edu
Cross-document event coreference resolution (CDCR) is an NLP task in which mentions of
events need to be identified and clustered throughout a collection of documents. CDCR …

[PDF][PDF] The Unberable Lightness of Tagging* A Case Study in Morphosyntactic Tagging of Polish

A Przepiórkowski, M Woliński - … Corpora (LINC-03) at EACL 2003, 2003 - aclanthology.org
The article takes a step back and examines the notion of part of speech (POS), arguing that
POS tagsets should be constructed more carefully and, in effect, should be light in at least …

[PDF][PDF] A Search Tool for Corpora with Positional Tagsets and Ambiguities.

A Przepiórkowski, Z Krynicki, L Debowski, M Wolinski… - LREC, 2004 - lrec-conf.org
This article describes POLIQARP, a corpus indexing and query tool, which understands
positional tagsets and which does not assume that word forms are annotated with unique …

Trigram morphosyntactic tagger for Polish

Ł Dębowski - Intelligent Information Processing and Web Mining …, 2004 - Springer
We introduce an implementation of a plain trigram part-of-speech tagger which appears to
work well on Polish texts. At this moment the tagger achieves 9.4% error rate, which makes it …

[PDF][PDF] The potential of the IPI PAN Corpus

A Przepiórkowski - Poznan Studies in Contemporary Linguistics, 2006 - nlp.ipipan.waw.pl
The aim of this article is to present the IPI PAN Corpus (cf. http://korpus. pl/), a large
morphosyntactically annotated XML encoded corpus of Polish developed at the Institute of …

[PDF][PDF] The IPI PAN Corpus in numbers

A Przepiórkowski - Proceedings of the 2nd Language & …, 2005 - nlp.ipipan.waw.pl
The aim of this article is to present the IPI PAN Corpus (cf. http://korpus. pl/), a large
morphosyntactically annotated XML encoded corpus of Polish developed at the Institute of …

[PDF][PDF] MorphoDiTa-based tagger adapted to the Polish language technology

M Piasecki, W Walentynowicz - … as a Challenge for Computer Science …, 2017 - ltc.amu.edu.pl
We present a new morpho-syntactic tagger for Polish called MorphoIXTa-pl, which is based
on the adaptation of the MorphoIiTa tagger developed originally for the Czech language …