Ulysses Tesemõ: a new large corpus for Brazilian legal and governmental domain

FA Siqueira, D Vitório, E Souza, JAP Santos… - Language Resources …, 2024 - Springer
The increasing use of artificial intelligence methods in the legal field has sparked interest in
applying Natural Language Processing techniques to handle legal tasks and reduce the …

Segmenting Brazilian legislative text using weak supervision and active learning

FA Siqueira, D Pressato, FSF Pereira… - Artificial Intelligence and …, 2024 - Springer
Legislative houses all over the world are adopting tools based on artificial intelligence to
support their work. The incorporation of these tools can improve the analysis of the text of the …

Expanding UlyssesNER-Br named entity recognition corpus with informal user-generated text

R Costa, HO Albuquerque, G Silvestre… - EPIA Conference on …, 2022 - Springer
Abstract Named Entity Recognition (NER) is a challenging Natural Language Processing
task for a language as rich as Portuguese. When applied in a scenario appropriate to …

On the assessment of deep learning models for named entity recognition of brazilian legal documents

HO Albuquerque, E Souza, ALI Oliveira… - EPIA Conference on …, 2023 - Springer
A large amount of legal and legislative documents are generated every year with highly
specialized content and significant repercussions on society. Besides technical, the …

[PDF][PDF] RoBERTaLexPT: A Legal RoBERTa Model pretrained with deduplication for Portuguese

EAS Garcia, NFF Silva, F Siqueira… - Proceedings of the …, 2024 - aclanthology.org
This work investigates the application of Natural Language Processing (NLP) in the legal
context for the Portuguese language, emphasizing the importance of adapting pre-trained …

Ulysses-RFSQ: A novel method to improve legal information retrieval based on relevance feedback

D Vitório, E Souza, L Martins, NFF da Silva… - Brazilian Conference on …, 2022 - Springer
Obtaining relevant legal documents fast, from very large datasets, is essential for the proper
functioning of justice and legislative institutions. Nevertheless, legacy systems currently …

CachacaNER: a dataset for named entity recognition in texts about the cachaça beverage

P Silva, A Franco, T Santos, M Brito… - Language Resources and …, 2023 - Springer
Abstract Named Entity Recognition (NER) is the task of identifying and classifying tokens in
texts corresponding to a set of pre-defined categories, such as names of people …

[PDF][PDF] UlyssesNERQ: Expanding Queries from Brazilian Portuguese Legislative Documents through Named Entity Recognition

H Albuquerque, E Souza, T Silva… - Proceedings of the …, 2024 - aclanthology.org
This study presents UlyssesNERQ, a system designed to improve Information Retrieval for
Brazilian Portuguese legislative documents. It uses Named Entity Recognition in Query …

[PDF][PDF] A Named Entity Recognition Approach for Portuguese Legislative Texts Using Self-Learning

RO Nunes, DG Balreira, AS Spritzer… - Proceedings of the …, 2024 - aclanthology.org
Even if technology has made legislative documents more accessible, they are often written
in jargon that makes them hard to understand for ordinary citizens, researchers, journalists …

Reconhecimento de Entidades Nomeadas e Vazamento de Dados em Textos Legislativos

RO Nunes, AS Spritzer, CMDS Freitas… - Linguamática, 2024 - linguamatica.com
Este trabalho trata do vazamento de dados no treinamento de modelos de Reconhecimento
de Entidades Nomeadas (NER) em textos legislativos em português brasileiro, resultante de …