Quantity doesn't buy quality syntax with neural language models

K Mahowald, AA Ivanova, IA Blank, N Kanwisher… - Trends in Cognitive …, 2024 - cell.com

Large language models (LLMs) have come closest among all models to date to mastering
human language, yet opinions about their linguistic and cognitive capabilities remain split …

被引用次数：289 相关文章所有 10 个版本

[PDF] mit.edu

Language model behavior: A comprehensive survey

TA Chang, BK Bergen - Computational Linguistics, 2024 - direct.mit.edu

Transformer language models have received widespread public attention, yet their
generated text is often surprising even to NLP researchers. In this survey, we discuss over …

被引用次数：53 相关文章所有 7 个版本

[HTML] sciencedirect.com

[HTML][HTML] Pre-trained models: Past, present and future

X Han, Z Zhang, N Ding, Y Gu, X Liu, Y Huo, J Qiu… - AI Open, 2021 - Elsevier

Large-scale pre-trained models (PTMs) such as BERT and GPT have recently achieved
great success and become a milestone in the field of artificial intelligence (AI). Owing to …

被引用次数：671 相关文章所有 9 个版本

[PDF] mit.edu

A primer in BERTology: What we know about how BERT works

A Rogers, O Kovaleva, A Rumshisky - Transactions of the Association …, 2021 - direct.mit.edu

Transformer-based models have pushed state of the art in many areas of NLP, but our
understanding of what is behind their success is still limited. This paper is the first survey of …

被引用次数：1586 相关文章所有 12 个版本

[PDF] aclanthology.org

COGS: A compositional generalization challenge based on semantic interpretation

N Kim, T Linzen - Proceedings of the 2020 conference on …, 2020 - aclanthology.org

Natural language is characterized by compositionality: the meaning of a complex expression
is constructed from the meanings of its constituent parts. To facilitate the evaluation of the …

被引用次数：251 相关文章所有 6 个版本

[PDF] aclanthology.org

Experience grounds language

Y Bisk, A Holtzman, J Thomason, J Andreas… - arXiv preprint arXiv …, 2020 - arxiv.org

Language understanding research is held back by a failure to relate language to the
physical world it describes and to the social interactions it facilitates. Despite the incredible …

被引用次数：378 相关文章所有 5 个版本

[PDF] arxiv.org

What artificial neural networks can tell us about human language acquisition

A Warstadt, SR Bowman - Algebraic structures in natural …, 2022 - taylorfrancis.com

Rapid progress in machine learning for natural language processing has the potential to
transform debates about how humans learn language. However, the learning environments …

被引用次数：84 相关文章所有 6 个版本

[PDF] arxiv.org

A systematic assessment of syntactic generalization in neural language models

J Hu, J Gauthier, P Qian, E Wilcox, RP Levy - arXiv preprint arXiv …, 2020 - arxiv.org

While state-of-the-art neural network models continue to achieve lower perplexity scores on
language modeling benchmarks, it remains unknown whether optimizing for broad …

被引用次数：218 相关文章所有 5 个版本

[PDF] annualreviews.org

Syntactic structure from deep learning

T Linzen, M Baroni - Annual Review of Linguistics, 2021 - annualreviews.org

Modern deep neural networks achieve impressive performance in engineering applications
that require extensive linguistic skills, such as machine translation. This success has …

被引用次数：214 相关文章所有 11 个版本

[PDF] arxiv.org

How can we accelerate progress towards human-like linguistic generalization?

T Linzen - arXiv preprint arXiv:2005.00955, 2020 - arxiv.org

This position paper describes and critiques the Pretraining-Agnostic Identically Distributed
(PAID) evaluation paradigm, which has become a central tool for measuring progress in …

被引用次数：185 相关文章所有 6 个版本

高级搜索

QQ 群