SemEval-2022 task 2: Multilingual idiomaticity detection and sentence embedding

HT Madabushi, E Gow-Smith, M Garcia… - arXiv preprint arXiv …, 2022 - arxiv.org
This paper presents the shared task on Multilingual Idiomaticity Detection and Sentence
Embedding, which consists of two subtasks:(a) a binary classification task aimed at …

Probing for idiomaticity in vector space models

M Garcia, TK Vieira, C Scarton… - Proceedings of the …, 2021 - eprints.whiterose.ac.uk
Contextualised word representation models have been successfully used for capturing
different word usages and they may be an attractive alternative for representing idiomaticity …

AStitchInLanguageModels: Dataset and methods for the exploration of idiomaticity in pre-trained language models

HT Madabushi, E Gow-Smith, C Scarton… - arXiv preprint arXiv …, 2021 - arxiv.org
Despite their success in a variety of NLP tasks, pre-trained language models, due to their
heavy reliance on compositionality, fail in effectively capturing the meanings of multiword …

Semantics of Multiword Expressions in Transformer-Based Models: A Survey

F Miletić, SS Walde - … of the Association for Computational Linguistics, 2024 - direct.mit.edu
Multiword expressions (MWEs) are composed of multiple words and exhibit variable
degrees of compositionality. As such, their meanings are notoriously difficult to model, and it …

Compositionality and Sentence Meaning: Comparing Semantic Parsing and Transformers on a Challenging Sentence Similarity Dataset

J Fodor, S De Deyne, S Suzuki - Computational Linguistics, 2024 - direct.mit.edu
One of the major outstanding questions in computational semantics is how humans integrate
the meaning of individual words into a sentence in a way that enables understanding of …

Assessing idiomaticity representations in vector models with a noun compound dataset labeled at type and token levels

M Garcia, T Kramer Vieira, C Scarton… - Proceedings of ACL …, 2021 - eprints.whiterose.ac.uk
Accurate assessment of the ability of embedding models to capture idiomaticity may require
evaluation at token rather than type level, to account for degrees of idiomaticity and possible …

A systematic search for compound semantics in pretrained BERT architectures

F Miletić, SS im Walde - Proceedings of the 17th Conference of the …, 2023 - aclanthology.org
To date, transformer-based models such as BERT have been less successful in predicting
compositionality of noun compounds than static word embeddings. This is likely related to a …

Are representations built from the ground up? an empirical examination of local composition in language models

E Liu, G Neubig - arXiv preprint arXiv:2210.03575, 2022 - arxiv.org
Compositionality, the phenomenon where the meaning of a phrase can be derived from its
constituent parts, is a hallmark of human language. At the same time, many phrases are non …

How well do embedding models capture non-compositionality? a view from multiword expressions

N Nandakumar, T Baldwin, B Salehi - Proceedings of the 3rd …, 2019 - aclanthology.org
In this paper, we apply various embedding methods on multiword expressions to study how
well they capture the nuances of non-compositional data. Our results from a pool of word …

A comparison of statistical association measures for identifying dependency-based collocations in various languages.

M Garcia, MG Salido, MA Ramos - Proceedings of the joint …, 2019 - aclanthology.org
This paper presents an exploration of different statistical association measures to
automatically identify collocations from corpora in English, Portuguese, and Spanish. To …