XL-WSD: An extra-large and cross-lingual evaluation framework for word sense disambiguation

T Pasini, A Raganato, R Navigli - … of the AAAI Conference on Artificial …, 2021 - ojs.aaai.org
Transformer-based architectures brought a breeze of change to Word Sense
Disambiguation (WSD), improving models' performances by a large margin. The fast …

[PDF][PDF] Natural language processing methods for language modeling

DM Nemeskey - 2020 - hlt.bme.hu
The field of natural language processing (NLP) is contemporaneous with computers.
Machine translation systems were developed as early as the 1950s, and the widespread …

DanNet: the challenge of compiling a wordnet for Danish by reusing a monolingual dictionary

BS Pedersen, S Nimb, J Asmussen… - Language resources …, 2009 - Springer
This paper is a contribution to the discussion on compiling computational lexical resources
from conventional dictionaries. It describes the theoretical as well as practical problems that …

XL-WiC: A multilingual benchmark for evaluating semantic contextualization

A Raganato, T Pasini… - Proceedings of the …, 2020 - aclanthology.org
The ability to correctly model distinct meanings of a word is crucial for the effectiveness of
semantic representation techniques. However, most existing evaluation benchmarks for …

Wikipedia entities as rendezvous across languages: Grounding multilingual language models by predicting Wikipedia hyperlinks

I Calixto, A Raganato, T Pasini - … of the 2021 Conference of the …, 2021 - aclanthology.org
Masked language models have quickly become the de facto standard when processing text.
Recently, several approaches have been proposed to further enrich word representations …

[PDF][PDF] Light verb constructions in the SzegedParalellFX English-Hungarian parallel corpus

V Vincze - 2012 - lrec.elra.info
In this paper, we describe the first English–Hungarian parallel corpus annotated for light
verb constructions, which contains 14,261 sentence alignment units. Annotation principles …

Building a production-ready multi-label classifier for legal documents with digital-twin-distiller

GM Csányi, R Vági, D Nagy, I Üveges, JP Vadász… - Applied Sciences, 2022 - mdpi.com
One of the most time-consuming parts of an attorney's job is finding similar legal cases.
Categorization of legal documents by their subject matter can significantly increase the …

[PDF][PDF] Headedness, again

M Polinsky - 2012 - dash.harvard.edu
Headedness is an intriguing feature of language design. On the one hand, headedness
manifests itself very clearly; preposed relative clauses are visibly different from postposed …

[图书][B] Semi-compositional noun+ verb constructions: Theoretical questions and computational linguistic analyses

V Vincze - 2013 - academia.edu
In this thesis, a subtype of multiword expressions, namely, semi-compositional constructions
will be analyzed from the perspectives of theoretical and computational linguistics. Multiword …

[PDF][PDF] The Role of Parallel Corpora in Bilingual Lexicography.

E Héja - LREC, 2010 - lexitron.nectec.or.th
This paper describes an approach based on word alignment on parallel corpora, which aims
at facilitating the lexicographic work of dictionary building. Although this method has been …