T Kiss, J Strunk - Computational linguistics, 2006 - direct.mit.edu
In this article, we present a language-independent, unsupervised approach to sentence boundary detection. It is based on the assumption that a large number of ambiguities in the …
As pesquisas baseadas em corpus têm tido na última década um amplo desenvolvimento no contexto brasileiro. Nota-se a sua relevância e pertinência nos domínios da Lingüística …
SSL Piao, F Bianchi, C Dayrell… - Proceedings of the …, 2015 - aclanthology.org
This paper reports on our research to generate multilingual semantic lexical resources and develop multilingual semantic annotation software, which assigns each word in running text …
We report on solutions we adopted for the specific issues that arise when developing new automatic taggers for Portuguese, solutions whose design is general enough, we believe, to …
CN Silla Jr, CAA Kaestner - … Conference on Intelligent Text Processing and …, 2004 - Springer
In this paper we present a study comparing the performance of different systems found in the literature that perform the task of automatic text segmentation in sentences for English …
In this paper we discuss the five requirements for building large publicly available corpora which geared the construction of the Lácio-Web corpora and their environments: 1) a …
R López, TAS Pardo - … Linguistics and Intelligent Text Processing: 16th …, 2015 - Springer
Abstract Sentence Boundary Detection (SBD) is a very important prerequisite for proper sentence analysis in different Natural Language Processing tasks. During the last years …
The great amount of text produced every day in the Web turned it as one of the main sources for obtaining linguistic corpora, that are further analyzed with Natural Language Processing …
Em uma de suas definções, a Terminologia representa o conjunto de princípios e métodos adotados no processo de gestão e criação de produtos terminológicos, tais como glossários …