Towards understanding how transformer perform multi-step reasoning with matching operation

文章

学术资源搜索

获得 3 条结果（用时0.02秒）

我的图书馆

Towards understanding how transformer perform multi-step reasoning with matching operation

在引用文章中搜索

[PDF] arxiv.org

A mechanistic interpretation of syllogistic reasoning in auto-regressive language models

G Kim, M Valentino, A Freitas - arXiv preprint arXiv:2408.08590, 2024 - arxiv.org

Recent studies on logical reasoning in auto-regressive Language Models (LMs) have
sparked a debate on whether such models can learn systematic reasoning principles during …

被引用次数：5 相关文章所有 2 个版本

[PDF] arxiv.org

Exact Conversion of In-Context Learning to Model Weights in Linearized-Attention Transformers

BK Chen, T Hu, H Jin, HK Lee, K Kawaguchi - arXiv preprint arXiv …, 2024 - arxiv.org

In-Context Learning (ICL) has been a powerful emergent property of large language models
that has attracted increasing attention in recent years. In contrast to regular gradient-based …

Transformers Provably Solve Parity Efficiently with Chain of Thought

J Kim, T Suzuki - arXiv preprint arXiv:2410.08633, 2024 - arxiv.org

This work provides the first theoretical analysis of training transformers to solve complex
problems by recursively generating intermediate states, analogous to fine-tuning for chain-of …

高级搜索

QQ 群

Towards understanding how transformer perform multi-step reasoning with matching operation

A mechanistic interpretation of syllogistic reasoning in auto-regressive language models

Exact Conversion of In-Context Learning to Model Weights in Linearized-Attention Transformers

Transformers Provably Solve Parity Efficiently with Chain of Thought

引用