Multiword expression processing: A survey

M Constant, G Eryiğit, J Monti, L Van Der Plas… - Computational …, 2017 - direct.mit.edu
Multiword expressions (MWEs) are a class of linguistic forms spanning conventional word
boundaries that are both idiosyncratic and pervasive across different languages. The …

Improved transition-based parsing by modeling characters instead of words with LSTMs

M Ballesteros, C Dyer, NA Smith - arXiv preprint arXiv:1508.00657, 2015 - arxiv.org
We present extensions to a continuous-state dependency parsing method that makes it
applicable to morphologically rich languages. Starting with a high-performance transition …

Massive choice, ample tasks (MaChAmp): A toolkit for multi-task learning in NLP

R Van Der Goot, A Üstün, A Ramponi, I Sharaf… - arXiv preprint arXiv …, 2020 - arxiv.org
Transfer learning, particularly approaches that combine multi-task learning with pre-trained
contextualized embeddings and fine-tuning, have advanced the field of Natural Language …

Overview of the SPMRL 2013 shared task: A cross-framework evaluation of parsing morphologically rich languages

D Seddah, R Tsarfaty, S Kübler, M Candito… - Proceedings of the …, 2013 - hal.science
This paper reports on the first shared task on statistical parsing of morphologically rich lan-
guages (MRLs). The task features data sets from nine languages, each available both in …

[PDF][PDF] magyarlanc: A tool for morphological and dependency parsing of hungarian

J Zsibrita, V Vincze, R Farkas - Proceedings of the International …, 2013 - aclanthology.org
Hungarian is the stereotype of morphologically rich and free word order languages. Here,
we introduce magyarlanc, a natural language toolkit developed for the linguistic …

Building the essential resources for Finnish: the Turku Dependency Treebank

K Haverinen, J Nyblom, T Viljanen, V Laippala… - Language Resources …, 2014 - Springer
In this paper, we present the final version of a publicly available treebank of Finnish, the
Turku Dependency Treebank. The treebank contains 204,399 tokens (15,126 sentences) …

[PDF][PDF] Introducing the SPMRL 2014 shared task on parsing morphologically-rich languages

D Seddah, S Kübler, R Tsarfaty - … of the First Joint Workshop on …, 2014 - aclanthology.org
This first joint meeting on Statistical Parsing of Morphologically Rich Languages and
Syntactic Analysis of Non-Canonical English (SPMRL-SANCL) featured a shared task on …

MorphyNet: a large multilingual database of derivational and inflectional morphology

K Batsuren, G Bella, F Giunchiglia - Proceedings of the 18th …, 2021 - aclanthology.org
Large-scale morphological databases provide essential input to a wide range of NLP
applications. Inflectional data is of particular importance for morphologically rich …

What's in an embedding? Analyzing word embeddings through multilingual evaluation

A Köhn - 2015 - edoc.sub.uni-hamburg.de
In the last two years, there has been a surge of word embedding algorithms and research on
them. However, evaluation has mostly been carried out on a narrow set of tasks, mainly …

[PDF][PDF] E-magyar--A Digital Language Processing System

T Váradi, E Simon, B Sass, I Mittelholcz, A Novák… - 2018 - real.mtak.hu
Abstract e-magyar is a new toolset for the analysis of Hungarian texts. It was produced as a
collaborative effort of the Hungarian language technology community integrating the best …