Testing the predictions of surprisal theory in 11 languages

EG Wilcox, CI Meister, R Cotterell… - Proceedings of the …, 2023 - research-collection.ethz.ch

Surprisal theory (Hale, 2001; Levy, 2008) posits that a word's reading time is proportional to
its surprisal (ie, to its negative log probability given the proceeding context). Since we are …

被引用次数：7 相关文章所有 4 个版本

Predictability in Language Comprehension: Prospects and Problems for Surprisal

A Staub - Annual Review of Linguistics, 2024 - annualreviews.org

Surprisal theory proposes that a word's predictability influences processing difficulty
because each word requires the comprehender to update a probability distribution over …

[PDF] tallinzen.net

Large-scale benchmark yields no evidence that language model surprisal explains syntactic disambiguation difficulty

KJ Huang, S Arehalli, M Kugemoto, C Muxica… - Journal of Memory and …, 2024 - Elsevier

Prediction has been proposed as an overarching principle that explains human information
processing in language and beyond. To what degree can processing difficulty in …

被引用次数：6 相关文章所有 4 个版本

[PDF] aclanthology.org

The linearity of the effect of surprisal on reading times across languages

W Xu, J Chon, T Liu, R Futrell - Findings of the Association for …, 2023 - aclanthology.org

In psycholinguistics, surprisal theory posits that the amount of online processing effort
expended by a human comprehender per word positively correlates with the surprisal of that …

被引用次数：5 相关文章所有 3 个版本

[PDF] mit.edu

Word frequency and predictability dissociate in naturalistic reading

C Shain - Open Mind, 2024 - direct.mit.edu

Many studies of human language processing have shown that readers slow down at less
frequent or less predictable words, but there is debate about whether frequency and …

被引用次数：5 相关文章所有 12 个版本

[PDF] arxiv.org

Frequency Explains the Inverse Correlation of Large Language Models' Size, Training Data Amount, and Surprisal's Fit to Reading Times

BD Oh, S Yue, W Schuler - arXiv preprint arXiv:2402.02255, 2024 - arxiv.org

Recent studies have shown that as Transformer-based language models become larger and
are trained on very large amounts of data, the fit of their surprisal estimates to naturalistic …

被引用次数：4 相关文章所有 3 个版本

[PDF] arxiv.org

Psychometric predictive power of large language models

T Kuribayashi, Y Oseki, T Baldwin - arXiv preprint arXiv:2311.07484, 2023 - arxiv.org

Next-word probabilities from language models have been shown to successfully simulate
human reading behavior. Building on this, we show that, interestingly, instruction-tuned …

被引用次数：3 相关文章所有 3 个版本

[PDF] arxiv.org

Leading Whitespaces of Language Models' Subword Vocabulary Poses a Confound for Calculating Word Probabilities

BD Oh, W Schuler - arXiv preprint arXiv:2406.10851, 2024 - arxiv.org

Word-by-word conditional probabilities from Transformer-based language models are
increasingly being used to evaluate their predictions over minimal pairs or to model the …

被引用次数：1 相关文章

[PDF] psyarxiv.com

Word length and frequency effects on text reading are highly similar in 12 alphabetic languages

V Kuperman, S Schroeder, D Gnetov - Journal of Memory and Language, 2024 - Elsevier

Reading research robustly finds that shorter and more frequent words are recognized faster
and skipped more often than longer and less frequent words. An empirical question that has …

被引用次数：6 相关文章所有 10 个版本

[PDF] openreview.net

TEMPERATURE-SCALING SURPRISAL ESTIMATES IMPROVE FIT TO HUMAN READING TIMES–BUT DOES IT DO SO FOR THE “RIGHT REASONS”?

T Liu, I Škrjanec, V Demberg - ICLR 2024 Workshop on …, 2024 - openreview.net

A wide body of evidence shows that human language processing difficulty is predicted by
the information-theoretic measure surprisal, a word's negative log probability in context …

被引用次数：3 相关文章所有 2 个版本

高级搜索

QQ 群