Automated phrase mining from massive text corpora

J Shang, J Liu, M Jiang, X Ren… - IEEE Transactions on …, 2018 - ieeexplore.ieee.org
As one of the fundamental tasks in text analysis, phrase mining aims at extracting quality
phrases from a text corpus and has various downstream applications including information …

Mining quality phrases from massive text corpora

J Liu, J Shang, C Wang, X Ren, J Han - Proceedings of the 2015 ACM …, 2015 - dl.acm.org
Text data are ubiquitous and play an essential role in big data applications. However, text
data are mostly unstructured. Transforming unstructured text into structured units (eg …

Termeval 2020: Shared task on automatic term extraction using the annotated corpora for term extraction research (acter) dataset

A Rigouts Terryn, V Hoste, P Drouin… - 6th International …, 2020 - biblio.ugent.be
The TermEval 2020 shared task provided a platform for researchers to work on automatic
term extraction (ATE) with the same dataset: the Annotated Corpora for Term Extraction …

Multiword expression aware neural machine translation

A Zaninello, A Birch - … of the Twelfth Language Resources and …, 2020 - aclanthology.org
Abstract Multiword Expressions (MWEs) are a frequently occurring phenomenon found in all
natural languages that is of great importance to linguistic theory, natural language …

Tint 2.0: an All-inclusive Suite for NLP in Italian

A Palmero Aprosio, G Moretti - … of the Fifth Italian Conference on …, 2018 - cris.fbk.eu
In this we paper present Tint 2.0, an open-source, fast and extendable Natural Language
Processing suite for Italian based on Stanford CoreNLP. The new release includes some …

A generic and open framework for multiword expressions treatment: from acquisition to applications

C Ramisch - 2012 - hal.science
The treatment of multiword expressions (MWEs), like take off, bus stop and big deal, is a
challenge for NLP applications. This kind of linguistic construction is not only arbitrary but …

[PDF][PDF] A Large Corpus of Product Reviews in Portuguese: Tackling Out-Of-Vocabulary Words.

N Hartmann, L Avanço, PP Balage Filho, MS Duran… - LREC, 2014 - pedrobalage.com
Web 2.0 has allowed a never imagined communication boom. With the widespread use of
computational and mobile devices, anyone, in practically any language, may post comments …

[PDF][PDF] Identification and treatment of multiword expressions applied to information retrieval

O Acosta, A Villavicencio, V Moreira - Proceedings of the …, 2011 - aclanthology.org
The extensive use of Multiword Expressions (MWE) in natural language texts prompts more
detailed studies that aim for a more adequate treatment of these expressions. A MWE …

[PDF][PDF] jmwe: A java toolkit for detecting multi-word expressions

N Kulkarni, M Finlayson - … : from Parsing and Generation to the …, 2011 - aclanthology.org
Abstract jMWE is a Java library for implementing and testing algorithms that detect Multi-
Word Expression (MWE) tokens in text. It provides (1) a detector API, including …

NLP for The Greek Language: A Longer Survey

K Papantoniou, Y Tzitzikas - arXiv preprint arXiv:2408.10962, 2024 - arxiv.org
English language is in the spotlight of the Natural Language Processing (NLP) community
with other languages, like Greek, lagging behind in terms of offered methods, tools and …