On the data requirements of probing

鞠天杰，刘功申，张倬胜，张茹 - 计算机学报, 2024 - cjc.ict.ac.cn

摘要随着大规模预训练模型的广泛应用, 自然语言处理的多个领域(如文本分类和机器翻译)
取得了长足的发展. 然而, 受限于预训练模型的“黑盒” 特性, 其内部的决策模式以及编码的知识 …

`Holmes` ⌕ A Benchmark to Assess the Linguistic Competence of Language Models

A Waldis, Y Perlitz, L Choshen, Y Hou… - Transactions of the …, 2024 - direct.mit.edu

We introduce Holmes, a new benchmark designed to assess language models'(LMs')
linguistic competence—their unconscious understanding of linguistic phenomena …

Predicting fine-tuning performance with probing

Z Zhu, S Shahtalebi, F Rudzicz - arXiv preprint arXiv:2210.07352, 2022 - arxiv.org

Large NLP models have recently shown impressive performance in language
understanding tasks, typically evaluated by their fine-tuned performance. Alternatively …

被引用次数：10 相关文章所有 4 个版本

[PDF] arxiv.org

A State-Vector Framework for Dataset Effects

E Sahak, Z Zhu, F Rudzicz - arXiv preprint arXiv:2310.10955, 2023 - arxiv.org

The impressive success of recent deep neural network (DNN)-based systems is significantly
influenced by the high-quality datasets used in training. However, the effects of the datasets …

被引用次数：1 相关文章所有 4 个版本

Less than Necessary or More than Sufficient: Validating Probing Dataset Size

E Orlov, O Serikov - International Conference on Analysis of Images …, 2023 - Springer

The vast body of research is dedicated to interpreting language models, particularly probing
them for linguistic properties. As in many other NLP fields, probing works tend to reuse …

Methods and Applications for Probing Deep Neural Networks

Z Zhu - 2024 - search.proquest.com

In recent years, the impressive abilities shown by deep neural network (DNN)-based
systems have led to the curiosity towards the intrinsic mechanisms. The query towards these …

高级搜索

QQ 群