The SIGMORPHON 2022 shared task on morpheme segmentation

K Batsuren, G Bella, A Arora, V Martinović… - arXiv preprint arXiv …, 2022 - arxiv.org
The SIGMORPHON 2022 shared task on morpheme segmentation challenged systems to
decompose a word into a sequence of morphemes and covered most types of morphology …

The SIGMORPHON 2020 shared task on multilingual grapheme-to-phoneme conversion

K Gorman, LFE Ashby, A Goyzueta… - Proceedings of the …, 2020 - aclanthology.org
We describe the design and findings of the SIGMORPHON 2020 shared task on multilingual
grapheme-to-phoneme conversion. Participants were asked to submit systems which take in …

Exact hard monotonic attention for character-level transduction

S Wu, R Cotterell - arXiv preprint arXiv:1905.06319, 2019 - arxiv.org
Many common character-level, string-to-string transduction tasks, eg graphemeto-phoneme
conversion and morphological inflection, consist almost exclusively of monotonic …

Towards realistic practices in low-resource natural language processing: The development set

K Kann, K Cho, SR Bowman - arXiv preprint arXiv:1909.01522, 2019 - arxiv.org
Development sets are impractical to obtain for real low-resource languages, since using all
available data for training is often more effective. However, development sets are widely …

IGT2P: From interlinear glossed texts to paradigms

S Moeller, L Liu, C Yang… - Proceedings of the …, 2020 - aclanthology.org
An intermediate step in the linguistic analysis of an under-documented language is to find
and organize inflected forms that are attested in natural speech. From this data, linguists …

Can a transformer pass the wug test? Tuning copying bias in neural morphological inflection models

L Liu, M Hulden - arXiv preprint arXiv:2104.06483, 2021 - arxiv.org
Deep learning sequence models have been successfully applied to the task of
morphological inflection. The results of the SIGMORPHON shared tasks in the past several …

Morphological Processing of Low-Resource Languages: Where We Are and What's Next

A Wiemerslage, M Silfverberg, C Yang… - arXiv preprint arXiv …, 2022 - arxiv.org
Automatic morphological processing can aid downstream natural language processing
applications, especially for low-resource languages, and assist language documentation …

Unsupervised morphological paradigm completion

H Jin, L Cai, Y Peng, C Xia, AD McCarthy… - arXiv preprint arXiv …, 2020 - arxiv.org
We propose the task of unsupervised morphological paradigm completion. Given only raw
text and a lemma list, the task consists of generating the morphological paradigms, ie, all …

Tackling the low-resource challenge for canonical segmentation

M Mager, Ö Çetinoğlu, K Kann - arXiv preprint arXiv:2010.02804, 2020 - arxiv.org
Canonical morphological segmentation consists of dividing words into their standardized
morphemes. Here, we are interested in approaches for the task when training data is limited …

Results of the second SIGMORPHON shared task on multilingual grapheme-to-phoneme conversion

LFE Ashby, TM Bartley, S Clematide… - Proceedings of the …, 2021 - aclanthology.org
Grapheme-to-phoneme conversion is an important component in many speech
technologies, but until recently there were no multilingual benchmarks for this task. The …