Understandable big data: a survey

CK Emani, N Cullot, C Nicolle - Computer science review, 2015 - Elsevier
This survey presents the concept of Big Data. Firstly, a definition and the features of Big Data
are given. Secondly, the different steps for Big Data data processing and the main problems …

GATECloud. net: a platform for large-scale, open-source text processing on the cloud

V Tablan, I Roberts… - … Transactions of the …, 2013 - royalsocietypublishing.org
Cloud computing is increasingly being regarded as a key enabler of the 'democratization of
science', because on-demand, highly scalable cloud computing facilities enable researchers …

Bringing semantics into historical archives with computer-aided rich metadata generation

D Colla, A Goy, M Leontino, D Magro… - Journal on Computing and …, 2022 - dl.acm.org
This article relies on the idea that a semantically rich metadata layer is required in order to
provide an effective, intelligent, and engaging access to historical archives. However …

An accuracy-enhanced light stemmer for arabic text

SR El-Beltagy, A Rafea - ACM Transactions on Speech and Language …, 2010 - dl.acm.org
Stemming is a key step in most text mining and information retrieval applications. Information
extraction, semantic annotation, as well as ontology learning are but a few examples where …

Linguistic information extraction for job ads (SIRE project)

R Loth, D Battistelli, FR Chaumartin… - … and Fusion of …, 2010 - shs.hal.science
As a text, each job advertisement expresses rich information about the occupation at hand,
such as competence needs (ie required degrees, field knowledge, task expertise or …

FOCIH: Form-based ontology creation and information harvesting

C Tao, DW Embley, SW Liddle - … , Gramado, Brazil, November 9-12, 2009 …, 2009 - Springer
Creating an ontology and populating it with data are both labor-intensive tasks requiring a
high degree of expertise. Thus, scaling ontology creation and population to the size of the …

[PDF][PDF] Knowledge graph construction to facilitate chemical compound hazard assessment in the toxin project

G Vrijens - 2023 - cris.vub.be
This master thesis presents a method for integrating multiple data sources from the field of
toxicology into a knowledge graph and linking it with the TOXIN knowledge graph to …

[PDF][PDF] An evaluation of annotation tools for biomedical texts.

KT Belloze, DISB Monteiro, TF Lima, FP Silva Jr… - ONTOBRAS-MOST, 2012 - Citeseer
Biomedical texts are a rich information source that cannot be ignored. There are several text
annotation tools that may be used to extract useful information from these texts. However …

Scalable semantic annotation of text using lexical and web resources

E Zavitsanos, G Tsatsaronis, I Varlamis… - … : Theories, Models and …, 2010 - Springer
In this paper we are dealing with the task of adding domain-specific semantic tags to a
document, based solely on the domain ontology and generic lexical and Web resources. In …

Multimodal Legal Information Retrieval

KJ Adebayo - 2018 - amsdottorato.unibo.it
The goal of this thesis is to present a multifaceted way of inducing semantic representation
from legal documents as well as accessing information in a precise and timely manner. The …