Towards corpora creation from social web in Brazilian Portuguese to support public security analyses and decisions

VDH de Carvalho, APCS Costa - Library Hi Tech, 2024 - emerald.com
Purpose This article presents two Brazilian Portuguese corpora collected from different
media concerning public security issues in a specific location. The primary motivation is …

[PDF][PDF] Quotation extraction for portuguese

WPD Fernandes, E Motta… - Proceedings of the 8th …, 2011 - aclanthology.org
Quotation extraction consists of identifying quotations and their authors. In this work, we
present a Quotation Extraction system for Portuguese that is based on Entropy Guided …

[图书][B] Entropy guided transformation learning

CN Dos Santos, RL Milidiú, CN dos Santos, RL Milidiú - 2012 - Springer
This chapter details the entropy guided transformation learning algorithm [8, 23]. ETL is an
effective way to overcome the transformation based learning bottleneck: the construction of …

A machine learning approach to Portuguese clause identification

ER Fernandes, CN dos Santos, RL Milidiú - International Conference on …, 2010 - Springer
In this work, we apply and evaluate a machine-learning-based system to Portuguese clause
identification. To the best of our knowledge, this is the first machine-learning-based …

Training state-of-the-art Portuguese POS taggers without handcrafted features

CN dos Santos, B Zadrozny - … Conference, PROPOR 2014, São Carlos/SP …, 2014 - Springer
Abstract Part-of-speech (POS) tagging for morphologically rich languages normally requires
the use of handcrafted features that encapsulate clues about the language's morphology. In …

Developing a Persian chunker using a hybrid approach

S Kiani, T Akhavan, M Shamsfard - … on Computer Science and …, 2009 - ieeexplore.ieee.org
Text segmentation is the process of recognizing boundaries of text constituents, such as
sentences, phrases and words. This paper focuses on phrase segmentation also known as …

A token classification approach to dependency parsing

RL Milidiu, CEM Crestana… - 2009 Seventh Brazilian …, 2009 - ieeexplore.ieee.org
The Dependency-based syntactic parsing task consists in identifying a head word for each
word in an input sentence. Hence, its output is a rooted tree where the nodes are the words …

[PDF][PDF] Portuguese language processing service

ER Fernandes, RL Milidiu, CN Santos - 2009 - ambuehler.ethz.ch
ABSTRACT Current Natural Language Processing tools provide shallow semantics for
textual data. These kind of knowledge could be used in the Semantic Web. In this paper, we …

Reconhecimento de entidades mencionadas em português utilizando aprendizado de máquina

WS Carvalho - 2012 - teses.usp.br
O Reconhecimento de Entidades Mencionadas (REM) é uma subtarefa da extração de
informações e tem como objetivo localizar e classificar elementos do texto em categorias …

Clause identification using entropy guided transformation learning

ER Fernandes, BA Pires… - … in Information and …, 2009 - ieeexplore.ieee.org
Entropy Guided Transformation Learning (ETL) is a machine learning strategy that extends
Transformation Based Learning by providing automatic template generation. In this work, we …