Assessing linguistic generalisation in language models: a dataset for Brazilian Portuguese

R Wilkens, L Zilio, A Villavicencio - Language Resources and Evaluation, 2024 - Springer
Much recent effort has been devoted to creating large-scale language models. Nowadays,
the most prominent approaches are based on deep neural networks, such as BERT …

Evaluating semantic similarity methods to build semantic predictability norms of reading data

S Leal, E Casanova, G Paetzold, S Aluísio - International Conference on …, 2021 - Springer
Predictability corpora built via Cloze task generally accompany eye-tracking data for the
study of processing costs of linguistic structures in tasks of reading for comprehension. Two …

Reconhecimento do vocabulário de jornais populares brasileiros por um dicionário computacional de acesso livre

MJB Finatto, OA Vale, É Laporte - … Revista de Linguística (São José do …, 2019 - SciELO Brasil
Relata-se um experimento de verificação da identificação de um universo de palavras do
português popular escrito por duas versões de um dicionário computacional do português …

Recognizing the vocabulary of Brazilian popular newspapers with a free-access computational dictionary

MJB Finatto, OA Vale, E Laporte - … Revista de Linguística (São José do …, 2019 - SciELO Brasil
■ ABSTRACT: We report an experiment to check the identification of a set of words in
popular written Portuguese with two versions of a computational dictionary of Brazilian …