[PDF][PDF] 自然语言处理中的探针可解释方法综述

鞠天杰, 刘功申, 张倬胜, 张茹 - 计算机学报, 2024 - cjc.ict.ac.cn
摘要随着大规模预训练模型的广泛应用, 自然语言处理的多个领域(如文本分类和机器翻译)
取得了长足的发展. 然而, 受限于预训练模型的“黑盒” 特性, 其内部的决策模式以及编码的知识 …

Holmes ⌕ A Benchmark to Assess the Linguistic Competence of Language Models

A Waldis, Y Perlitz, L Choshen, Y Hou… - Transactions of the …, 2024 - direct.mit.edu
We introduce Holmes, a new benchmark designed to assess language models'(LMs')
linguistic competence—their unconscious understanding of linguistic phenomena …

Predicting fine-tuning performance with probing

Z Zhu, S Shahtalebi, F Rudzicz - arXiv preprint arXiv:2210.07352, 2022 - arxiv.org
Large NLP models have recently shown impressive performance in language
understanding tasks, typically evaluated by their fine-tuned performance. Alternatively …

A State-Vector Framework for Dataset Effects

E Sahak, Z Zhu, F Rudzicz - arXiv preprint arXiv:2310.10955, 2023 - arxiv.org
The impressive success of recent deep neural network (DNN)-based systems is significantly
influenced by the high-quality datasets used in training. However, the effects of the datasets …

Less than Necessary or More than Sufficient: Validating Probing Dataset Size

E Orlov, O Serikov - International Conference on Analysis of Images …, 2023 - Springer
The vast body of research is dedicated to interpreting language models, particularly probing
them for linguistic properties. As in many other NLP fields, probing works tend to reuse …

Methods and Applications for Probing Deep Neural Networks

Z Zhu - 2024 - search.proquest.com
In recent years, the impressive abilities shown by deep neural network (DNN)-based
systems have led to the curiosity towards the intrinsic mechanisms. The query towards these …