Implicit representations of meaning in neural language models

J Kaddour, J Harris, M Mozes, H Bradley… - arXiv preprint arXiv …, 2023 - arxiv.org

Large Language Models (LLMs) went from non-existent to ubiquitous in the machine
learning discourse within a few years. Due to the fast pace of the field, it is difficult to identify …

被引用次数：451 相关文章所有 3 个版本

[PDF] arxiv.org

Ai alignment: A comprehensive survey

J Ji, T Qiu, B Chen, B Zhang, H Lou, K Wang… - arXiv preprint arXiv …, 2023 - arxiv.org

AI alignment aims to make AI systems behave in line with human intentions and values. As
AI systems grow more capable, the potential large-scale risks associated with misaligned AI …

被引用次数：206 相关文章所有 3 个版本

[PDF] github.io

The rise and potential of large language model based agents: A survey

Z Xi, W Chen, X Guo, W He, Y Ding, B Hong… - arXiv preprint arXiv …, 2023 - arxiv.org

For a long time, humanity has pursued artificial intelligence (AI) equivalent to or surpassing
the human level, with AI agents considered a promising vehicle for this pursuit. AI agents are …

被引用次数：672 相关文章所有 4 个版本

[PDF] arxiv.org

Towards reasoning in large language models: A survey

J Huang, KCC Chang - arXiv preprint arXiv:2212.10403, 2022 - arxiv.org

Reasoning is a fundamental aspect of human intelligence that plays a crucial role in
activities such as problem solving, decision making, and critical thinking. In recent years …

被引用次数：629 相关文章所有 6 个版本

[PDF] acm.org Full View

Talking about large language models

M Shanahan - Communications of the ACM, 2024 - dl.acm.org

Talking about Large Language Models Page 1 key insights ˽ As LLMs become more powerful,
it becomes increasingly tempting to describe LLM-based dialog agents in human-like terms …

被引用次数：369 相关文章所有 5 个版本

[PDF] pnas.org Full View

The debate over understanding in AI's large language models

M Mitchell, DC Krakauer - Proceedings of the National …, 2023 - National Acad Sciences

We survey a current, heated debate in the artificial intelligence (AI) research community on
whether large pretrained language models can be said to understand language—and the …

被引用次数：267 相关文章所有 9 个版本

[PDF] openreview.net

Large language models still can't plan (a benchmark for LLMs on planning and reasoning about change)

K Valmeekam, A Olmo, S Sreedharan… - … Models for Decision …, 2022 - openreview.net

Recent advances in large language models (LLMs) have transformed the field of natural
language processing (NLP). From GPT-3 to PaLM, the state-of-the-art performance on …

被引用次数：321 相关文章所有 2 个版本

[PDF] mlr.press

Language models as zero-shot planners: Extracting actionable knowledge for embodied agents

W Huang, P Abbeel, D Pathak… - … conference on machine …, 2022 - proceedings.mlr.press

Can world knowledge learned by large language models (LLMs) be used to act in
interactive environments? In this paper, we investigate the possibility of grounding high-level …

被引用次数：1032 相关文章所有 5 个版本

[PDF] arxiv.org

Program synthesis with large language models

J Austin, A Odena, M Nye, M Bosma… - arXiv preprint arXiv …, 2021 - arxiv.org

This paper explores the limits of the current generation of large language models for
program synthesis in general purpose programming languages. We evaluate a collection of …

被引用次数：1327 相关文章所有 3 个版本

[PDF] arxiv.org

Show your work: Scratchpads for intermediate computation with language models

M Nye, AJ Andreassen, G Gur-Ari… - arXiv preprint arXiv …, 2021 - arxiv.org

Large pre-trained language models perform remarkably well on tasks that can be done" in
one pass", such as generating realistic text or synthesizing computer programs. However …

被引用次数：570 相关文章所有 5 个版本

高级搜索

QQ 群