Linguateca: um centro de recursos distribuído para o processamento computacional da língua...

JA Wagner Filho, R Wilkens, M Idiart… - Proceedings of the …, 2018 - aclanthology.org

In this work, we present the construction process of a large Web corpus for Brazilian
Portuguese, aiming to achieve a size comparable to the state of the art in other languages …

被引用次数：159 相关文章所有 8 个版本

[PDF] arxiv.org

Indexing portuguese nlp resources with pt-pump-up

R Almeida, R Campos, A Jorge, S Nunes - arXiv preprint arXiv …, 2024 - arxiv.org

The recent advances in natural language processing (NLP) are linked to training processes
that require vast amounts of corpora. Access to this data is commonly not a trivial process …

被引用次数：2 相关文章所有 4 个版本

[PDF] linguamatica.com

Caminhos percorridos no mapa da portuguesificação: A Linguateca em perspectiva

D Santos - Linguamática, 2009 - linguamatica.com

Caminhos percorridos no mapa da portuguesificaçao: A Linguateca em perspectiva Page 1
Caminhos percorridos no mapa da portuguesificaçao: A Linguateca em perspectiva Diana …

被引用次数：33 相关文章所有 9 个版本

Intensification in portuguese: A cross-dialectal study of muito and bem

C Lívio, C Howe - Hispania, 2020 - muse.jhu.edu

Intensifiers have been the focus of a number of studies over the past decade, with
considerable interest in their meaning and variability. Several scholars have discussed the …

被引用次数：6 相关文章所有 4 个版本

[PDF] usp.br

[PDF][PDF] Investigating lexical NP-chunking with universal dependencies for portuguese

AT Souza, EES Ruiz - Anais do 19º Encontro Nacional de …, 2022 - repositorio.usp.br

The task of shallow parsing consists of retrieving a limited amount of syntactic information
from sentences written in natural language. This work aims to identify and extract a particular …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

Compiling and processing historical and contemporary portuguese corpora

M Zampieri - arXiv preprint arXiv:1710.00803, 2017 - arxiv.org

This technical report describes the framework used for processing three large Portuguese
corpora. Two corpora contain texts from newspapers, one published in Brazil and the other …

被引用次数：5 相关文章所有 3 个版本

[PDF] up.pt

SUPeRB: sistema uniformizado de pesquisa de referências bibliográficas

LM Cabral - 2006 - repositorio-aberto.up.pt

As publicaçoes cientıficas sao um elemento importante na investigaçao cientıfica de
qualquer domınio. Por um lado, sao representativos do estado da arte desse domınio; por …

被引用次数：8 相关文章所有 3 个版本

[PDF] linguamatica.com

Medindo o precipício semântico

N Cardoso - Linguamática, 2012 - linguamatica.com

Este artigo descreve a minha participação na avaliação conjunta Págico e detalha a
estratégia seguida para a participação, que usou um sistema de recuperação de …

被引用次数：5 相关文章所有 5 个版本

[PDF] inesc-id.pt

[PDF][PDF] A supervised machine learning method for word sense disambiguation of Portuguese nouns

M Zampieri - Bulletin de Linguistique Aplique et Gnrale-BULAG, 2010 - string.l2f.inesc-id.pt

ABSTRACT Word Sense Disambiguation (WSD) is vital in many Natural Language
Processing (NLP) applications. This work aims to explore supervised machine learning …

被引用次数：5 相关文章所有 7 个版本

[PDF] rcaap.pt

[PDF][PDF] Representacao em xml da floresta sintactica

R Vilela, AM Simoes, E Bick… - quot; In José Carlos …, 2005 - comum.rcaap.pt

A Floresta Sintáctica é um recurso linguıstico mantido e distribuıdo livremente ao público
pela Linguateca. Facea necessidade de abranger uma maior comunidade de linguistas e …

被引用次数：6 相关文章所有 11 个版本

高级搜索

QQ 群