Rule-based automatic multi-word term extraction and lemmatization

R Stanković, C Krstev, I Obradović, B Lazić, A Trtovac - LREC, 2016 - hal.science
In this paper we present a rule-based method for multi-word term extraction that relies on
extensive lexical resources in the form of electronic dictionaries and finite-state transducers …

От коллокаций к конструкциям

ЛМ Пивоварова, ЕВ Ягунова - Acta linguistica Petropolitana. Труды …, 2014 - cyberleninka.ru
В статье обсуждаются направления исследования неоднослов-ных единиц,
выделяемых статистическим образом; статистические характеристики, описывающие …

Extending lexical association measures for collocation extraction

S Petrović, J Šnajder, BD Bašić - Computer Speech & Language, 2010 - Elsevier
Collocations are linguistic phenomena that occur when two or more words appear together
more often than by chance and whose meaning often cannot be inferred from the meanings …

Comparison of collocation extraction measures for document indexing

S Petrovic, J Snajder, B Dalbelo-Basic… - 28th International …, 2006 - ieeexplore.ieee.org
Automatic extraction of collocations from a corpus is a well-known problem in the field of
natural language processing. It is typically carried out by employing some kind of a statistical …

Semi-automatic extraction of multiword terms from domain-specific corpora

V Pajić, S Vujičić Stanković, R Stanković… - The Electronic …, 2018 - emerald.com
Purpose A hybrid approach is presented, which combines linguistic and statistical
information to semi-automatically extract multiword term candidates from texts …

Multi-word discourse markers and their corpus-driven identification: The case of MWDM extraction from the reference corpus of spoken Slovene

K Dobrovoljc - International journal of corpus linguistics, 2017 - jbe-platform.com
With expanding evidence on the formulaic nature of human communication, there is a
growing need to extend discourse marker research to functionally analogue multi-word …

Automatic extraction of arabic multi-word terms

K Al Khatib, A Badarneh - Proceedings of the International …, 2010 - ieeexplore.ieee.org
Whereas a wide range of methods has been conducted to English multi-word terms (MWTs)
extraction, relatively few studied have been applied to Arabic MWTs extraction. In this paper …

A study of association measures and their combination for Arabic MWT extraction

AE Mahdaouy, SEL Ouatik, E Gaussier - arXiv preprint arXiv:1409.3005, 2014 - arxiv.org
Automatic Multi-Word Term (MWT) extraction is a very important issue to many applications,
such as information retrieval, question answering, and text categorization. Although many …

[PDF][PDF] * MWELex–MWE Lexica of Croatian, Slovene and Serbian Extracted from Parsed Corpora

N Ljubešić, K Dobrovoljc, D Fišer - Informatica, 2015 - informatica.si
V prispevku predstavimo vecjezicni leksikon* MWELex, ki vsebuje hrvaške, slovenske in
srbske vecbesedne zveze, ki smo jih izlušcili iz skladenjsko oznacenih korpusov. Leksikon …

[PDF][PDF] Automatic detection of stable grammatical features in n-grams

M Kopotev, L Pivovarova… - Workshop on …, 2013 - researchportal.helsinki.fi
This paper presents an algorithm that allows the user to issue a query pattern, collects multi-
word expressions (MWEs) that match the pattern, and then ranks them in a uniform fashion …