K Gorman, LFE Ashby, A Goyzueta… - Proceedings of the …, 2020 - aclanthology.org
We describe the design and findings of the SIGMORPHON 2020 shared task on multilingual grapheme-to-phoneme conversion. Participants were asked to submit systems which take in …
S Wu, R Cotterell - arXiv preprint arXiv:1905.06319, 2019 - arxiv.org
Many common character-level, string-to-string transduction tasks, eg graphemeto-phoneme conversion and morphological inflection, consist almost exclusively of monotonic …
Development sets are impractical to obtain for real low-resource languages, since using all available data for training is often more effective. However, development sets are widely …
S Moeller, L Liu, C Yang… - Proceedings of the …, 2020 - aclanthology.org
An intermediate step in the linguistic analysis of an under-documented language is to find and organize inflected forms that are attested in natural speech. From this data, linguists …
L Liu, M Hulden - arXiv preprint arXiv:2104.06483, 2021 - arxiv.org
Deep learning sequence models have been successfully applied to the task of morphological inflection. The results of the SIGMORPHON shared tasks in the past several …
Automatic morphological processing can aid downstream natural language processing applications, especially for low-resource languages, and assist language documentation …
H Jin, L Cai, Y Peng, C Xia, AD McCarthy… - arXiv preprint arXiv …, 2020 - arxiv.org
We propose the task of unsupervised morphological paradigm completion. Given only raw text and a lemma list, the task consists of generating the morphological paradigms, ie, all …
Canonical morphological segmentation consists of dividing words into their standardized morphemes. Here, we are interested in approaches for the task when training data is limited …
Grapheme-to-phoneme conversion is an important component in many speech technologies, but until recently there were no multilingual benchmarks for this task. The …