Watset: Automatic induction of synsets from a graph of synonyms

D Ustalov, A Panchenko, C Biemann - arXiv preprint arXiv:1704.07157, 2017 - arxiv.org
This paper presents a new graph-based approach that induces synsets using synonymy
dictionaries and word embeddings. First, we build a weighted graph of synonyms extracted …

Rudsi: graph-based word sense induction dataset for russian

A Aksenova, E Gavrishina, E Rykov… - arXiv preprint arXiv …, 2022 - arxiv.org
We present RuDSI, a new benchmark for word sense induction (WSI) in Russian. The
dataset was created using manual annotation and semi-automatic clustering of Word Usage …

RUSSE'2020: Findings of the First Taxonomy Enrichment Task for the Russian language

I Nikishina, V Logacheva, A Panchenko… - arXiv preprint arXiv …, 2020 - arxiv.org
This paper describes the results of the first shared task on taxonomy enrichment for the
Russian language. The participants were asked to extend an existing taxonomy with …

Synset expansion on translation graph for automatic wordnet construction

G Ercan, F Haziyev - Information Processing & Management, 2019 - Elsevier
Research on clustering algorithms in synonymy graphs of a single language yields
promising results, however, this idea is not yet explored in a multilingual setting …

Taxonomy enrichment with text and graph vector representations

I Nikishina, M Tikhomirov, V Logacheva… - Semantic …, 2022 - content.iospress.com
Abstract Knowledge graphs such as DBpedia, Freebase or Wikidata always contain a
taxonomic backbone that allows the arrangement and structuring of various concepts in …

Studying taxonomy enrichment on diachronic wordnet versions

I Nikishina, A Panchenko, V Logacheva… - arXiv preprint arXiv …, 2020 - arxiv.org
Ontologies, taxonomies, and thesauri are used in many NLP tasks. However, most studies
are focused on the creation of these lexical resources rather than the maintenance of the …

Sense-annotated corpus for Russian

A Kirillovich, N Loukachevitch, M Kulaev… - Proceedings of the …, 2022 - aclanthology.org
We present a sense-annotated corpus for Russian. The resource was obtained my manually
annotating texts from the OpenCorpora corpus, an open corpus for the Russian language …

Multiword expressions in Russian thesauri RuThes and RuWordnet

N Loukachevitch, G Lashevich - 2016 IEEE Artificial Intelligence …, 2016 - ieeexplore.ieee.org
We present the types or multiword expressions included into the thesaurus or Russian
language RuThes. Maoy of these expressions may look like compositiomd expressions but …

ReaderBench: Multilevel analysis of Russian text characteristics

D Corlatescu, Ș Ruseti, M Dascalu - Russian Journal of Linguistics, 2022 - journals.rudn.ru
This paper introduces an adaptation of the open source ReaderBench framework that now
supports Russian multilevel analyses of text characteristics, while integrating both textual …

An unsupervised word sense disambiguation system for under-resourced languages

D Ustalov, D Teslenko, A Panchenko… - arXiv preprint arXiv …, 2018 - arxiv.org
In this paper, we present Watasense, an unsupervised system for word sense
disambiguation. Given a sentence, the system chooses the most relevant sense of each …