Anchor-based bilingual word embeddings for low-resource languages

HH Nigatu, AL Tonja, B Rosman, T Solorio… - arXiv preprint arXiv …, 2024 - arxiv.org

The disparity in the languages commonly studied in Natural Language Processing (NLP) is
typically reflected by referring to languages as low vs high-resourced. However, there is …

被引用次数：3 相关文章所有 3 个版本

[PDF] aclanthology.org

DM-BLI: Dynamic multiple subspaces alignment for unsupervised bilingual lexicon induction

L Hu, Y Xu - Proceedings of the 62nd Annual Meeting of the …, 2024 - aclanthology.org

Unsupervised bilingual lexicon induction (BLI) task aims to find word translations between
languages and has achieved great success in similar language pairs. However, related …

被引用次数：2 相关文章所有 2 个版本

[PDF] arxiv.org

Isovec: Controlling the relative isomorphism of word embedding spaces

K Marchisio, N Verma, K Duh, P Koehn - arXiv preprint arXiv:2210.05098, 2022 - arxiv.org

The ability to extract high-quality translation dictionaries from monolingual word embedding
spaces depends critically on the geometric similarity of the spaces--their degree of" …

被引用次数：8 相关文章所有 4 个版本

[PDF] arxiv.org

Graph-based multilingual label propagation for low-resource part-of-speech tagging

A Imani, S Severini, MJ Sabet, F Yvon… - arXiv preprint arXiv …, 2022 - arxiv.org

Part-of-Speech (POS) tagging is an important component of the NLP pipeline, but many low-
resource languages lack labeled data for training. An established method for training a POS …

被引用次数：8 相关文章所有 24 个版本

[PDF] aclanthology.org

Do not neglect related languages: The case of low-resource Occitan cross-lingual word embeddings

L Woller, V Hangya, A Fraser - … of the 1st Workshop on Multilingual …, 2021 - aclanthology.org

Cross-lingual word embeddings (CLWEs) have proven indispensable for various natural
language processing tasks, eg, bilingual lexicon induction (BLI). However, the lack of data …

被引用次数：9 相关文章所有 4 个版本

[PDF] aclanthology.org

How to encode arbitrarily complex morphology in word embeddings, no corpus needed

L Schwartz, C Haley, F Tyers - … of the first workshop on NLP …, 2022 - aclanthology.org

In this paper, we present a straightforward technique for constructing interpretable word
embeddings from morphologically analyzed examples (such as interlinear glosses) for all of …

被引用次数：5 相关文章所有 3 个版本

[PDF] ed.ac.uk

Improving translation of out of vocabulary words using bilingual lexicon induction in low-resource machine translation

J Waldendorf, A Birch, B Haddow… - … Biennial Conference of …, 2022 - research.ed.ac.uk

Dictionary-based data augmentation techniques have been used in the field of domain
adaptation to learn words that do not appear in the parallel training data of a machine …

被引用次数：6 相关文章所有 4 个版本

[PDF] arxiv.org

被引用次数：1 相关文章所有 2 个版本

高级搜索

QQ 群