SimAlign: High quality word alignments without parallel training data using static and contextualized embeddings

MJ Sabet, P Dufter, F Yvon, H Schütze - arXiv preprint arXiv:2004.08728, 2020 - arxiv.org
Word alignments are useful for tasks like statistical and neural machine translation (NMT)
and cross-lingual annotation projection. Statistical word aligners perform well, as do …

Modeling language variation and universals: A survey on typological linguistics for natural language processing

EM Ponti, H O'horan, Y Berzak, I Vulić… - Computational …, 2019 - direct.mit.edu
Linguistic typology aims to capture structural and semantic variation across the world's
languages. A large-scale typology could provide excellent guidance for multilingual Natural …

Learning language representations for typology prediction

C Malaviya, G Neubig, P Littell - arXiv preprint arXiv:1707.09569, 2017 - arxiv.org
One central mystery of neural NLP is what neural models" know" about their subject matter.
When a neural machine translation system learns to translate from one language to another …

Survey on the use of typological information in natural language processing

H O'Horan, Y Berzak, I Vulić, R Reichart… - arXiv preprint arXiv …, 2016 - arxiv.org
In recent years linguistic typology, which classifies the world's languages according to their
functional and structural properties, has been widely used to support multilingual NLP. While …

Automating gloss generation in interlinear glossed text

A McMillan-Major - Society for Computation in …, 2020 - openpublishing.library.umass.edu
Abstract Interlinear Glossed Text (IGT) is a rich data type produced by linguists for the
purposes of presenting an analysis of a language\'s semantic and grammatical properties. I …

Linguistic typology in natural language processing

EM Bender - Linguistic Typology, 2016 - degruyter.com
This paper explores the ways in which the field of natural language processing (NLP) can
and does benefit from work in linguistic typology. I describe the recent increase in interest in …

PMI-Align: Word alignment with point-wise mutual information without requiring parallel training data

F Azadi, H Faili, MJ Dousti - Findings of the Association for …, 2023 - aclanthology.org
Word alignment has many applications including cross-lingual annotation projection,
bilingual lexicon extraction, and the evaluation or analysis of translation outputs. Recent …

Graph neural networks for multiparallel word alignment

A Imani, LK Şenel, MJ Sabet, F Yvon… - arXiv preprint arXiv …, 2022 - arxiv.org
After a period of decrease, interest in word alignments is increasing again for their
usefulness in domains such as typological research, cross-lingual annotation projection …

[PDF][PDF] Learning grammar specifications from IGT: A case study of Chintang

EM Bender, J Crowgey, MW Goodman… - Proceedings of the 2014 …, 2014 - aclanthology.org
We present a case study of the methodology of using information extracted from interlinear
glossed text (IGT) to create of actual working HPSG grammar fragments using the Grammar …

[PDF][PDF] Towards creating precision grammars from interlinear glossed text: Inferring large-scale typological properties

EM Bender, MW Goodman, J Crowgey… - Proceedings of the 7th …, 2013 - aclanthology.org
We propose to bring together two kinds of linguistic resources—interlinear glossed text (IGT)
and a language-independent precision grammar resource—to automatically create …