作者
Hugo Gonçalo Oliveira, Ricardo Rodrigues, Bruno Ferreira, Purificação Silvano, Sara Carvalho
发表日期
2024/3
研讨会论文
Proceedings of the 16th International Conference on Computational Processing of Portuguese
页码范围
207-217
简介
This paper presents BATS-PT, the manual translation of the lexicographic portion of the Bigger Analogy Test Set (BATS) to European Portuguese. BATS-PT covers ten types of lexicosemantic analogies and can be used for assessing word embeddings and language models. Following this, the dataset is showcased while assessing two pretrained language models for Portuguese, BERTimbau and Albertina, in two tasks: analogy solving and relation completion, both in zero- and few-shot mask-prediction approaches. Experiments reveal different performance across relations and, in both tasks, the best overall performance was achieved with BERTimbau, in a five-shot scenario. We further discuss the limitations of the reported experiments and directions towards future improvements in these tasks
学术搜索中的文章
HG Oliveira, R Rodrigues, B Ferreira, MP Silvano… - Proceedings of the 16th International Conference on …, 2024