Dissociating language and thought in large language models

K Mahowald, AA Ivanova, IA Blank, N Kanwisher… - Trends in Cognitive …, 2024 - cell.com
Large language models (LLMs) have come closest among all models to date to mastering
human language, yet opinions about their linguistic and cognitive capabilities remain split …

Language model behavior: A comprehensive survey

TA Chang, BK Bergen - Computational Linguistics, 2024 - direct.mit.edu
Transformer language models have received widespread public attention, yet their
generated text is often surprising even to NLP researchers. In this survey, we discuss over …

Driving and suppressing the human language network using large language models

G Tuckute, A Sathe, S Srikant, M Taliaferro… - Nature Human …, 2024 - nature.com
Transformer models such as GPT generate human-like language and are predictive of
human brain responses to language. Here, using functional-MRI-measured brain responses …

The breakthrough of large language models release for medical applications: 1-year timeline and perspectives

M Cascella, F Semeraro, J Montomoli, V Bellini… - Journal of Medical …, 2024 - Springer
Within the domain of Natural Language Processing (NLP), Large Language Models (LLMs)
represent sophisticated models engineered to comprehend, generate, and manipulate text …

Understanding natural language understanding systems

A Lenci - Sistemi intelligenti, 2023 - rivisteweb.it
The development of machines that “talk like us”, also known as Natural Language
Understanding (NLU) systems, is the Holy Grail of Artificial Intelligence (AI), since language …

A better way to do masked language model scoring

C Kauf, A Ivanova - arXiv preprint arXiv:2305.10588, 2023 - arxiv.org
Estimating the log-likelihood of a given sentence under an autoregressive language model
is straightforward: one can simply apply the chain rule and sum the log-likelihood values for …

When language models fall in love: Animacy processing in transformer language models

M Hanna, Y Belinkov, S Pezzelle - arXiv preprint arXiv:2310.15004, 2023 - arxiv.org
Animacy-whether an entity is alive and sentient-is fundamental to cognitive processing,
impacting areas such as memory, vision, and language. However, animacy is not always …

Comparing Plausibility Estimates in Base and Instruction-Tuned Large Language Models

C Kauf, E Chersoni, A Lenci, E Fedorenko… - arXiv preprint arXiv …, 2024 - arxiv.org
Instruction-tuned LLMs can respond to explicit queries formulated as prompts, which greatly
facilitates interaction with human users. However, prompt-based approaches might not …

We Understand Elliptical Sentences, and Language Models should Too: A New Dataset for Studying Ellipsis and its Interaction with Thematic Fit

D Testa, E Chersoni, A Lenci - … of the 61st Annual Meeting of the …, 2023 - aclanthology.org
Ellipsis is a linguistic phenomenon characterized by the omission of one or more sentence
elements. Solving such a linguistic construction is not a trivial issue in natural language …

An LLM-Based Inventory Construction Framework of Urban Ground Collapse Events with Spatiotemporal Locations

Y Hao, J Qi, X Ma, S Wu, R Liu, X Zhang - ISPRS International Journal of …, 2024 - mdpi.com
Historical news media reports serve as a vital data source for understanding the risk of
urban ground collapse (UGC) events. At present, the application of large language models …