Gpt is becoming a turing machine: Here are some ways to program it

S Hao, Y Gu, H Ma, JJ Hong, Z Wang, DZ Wang… - arXiv preprint arXiv …, 2023 - arxiv.org

Large language models (LLMs) have shown remarkable reasoning capabilities, especially
when prompted to generate intermediate reasoning steps (eg, Chain-of-Thought, CoT) …

被引用次数：217 相关文章所有 7 个版本

[PDF] neurips.cc

Evaluating cognitive maps and planning in large language models with CogEval

I Momennejad, H Hasanbeig… - Advances in …, 2024 - proceedings.neurips.cc

Recently an influx of studies claims emergent cognitive abilities in large language models
(LLMs). Yet, most rely on anecdotes, overlook contamination of training sets, or lack …

被引用次数：29 相关文章所有 6 个版本

[PDF] arxiv.org

Promptagent: Strategic planning with language models enables expert-level prompt optimization

X Wang, C Li, Z Wang, F Bai, H Luo, J Zhang… - arXiv preprint arXiv …, 2023 - arxiv.org

Highly effective, task-specific prompts are often heavily engineered by experts to integrate
detailed instructions and domain insights based on a deep understanding of both instincts of …

被引用次数：29 相关文章所有 3 个版本

[PDF] arxiv.org

Code simulation challenges for large language models

E La Malfa, C Weinhuber, O Torre, F Lin… - arXiv preprint arXiv …, 2024 - arxiv.org

We investigate the extent to which Large Language Models (LLMs) can simulate the
execution of computer code and algorithms. We begin by looking straight line programs, and …

被引用次数：6 相关文章所有 3 个版本

[PDF] arxiv.org

CoRE: LLM as Interpreter for Natural Language Programming, Pseudo-Code Programming, and Flow Programming of AI Agents

S Xu, Z Li, K Mei, Y Zhang - arXiv preprint arXiv:2405.06907, 2024 - arxiv.org

Since their inception, programming languages have trended towards greater readability and
lower barriers for programmers. Following this trend, natural language can be a promising …

被引用次数：3 相关文章所有 2 个版本

[PDF] arxiv.org

AutoGRAMS: Autonomous Graphical Agent Modeling Software

B Krause, L Chen, E Kahembwe - arXiv preprint arXiv:2407.10049, 2024 - arxiv.org

We introduce the AutoGRAMS framework for programming multi-step interactions with
language models. AutoGRAMS represents AI agents as a graph, where each node can …

被引用次数：1 相关文章所有 2 个版本

Comparing the performance of GPT-3 with BERT for decision requirements modeling

A Goossens, J De Smedt, J Vanthienen - International Conference on …, 2023 - Springer

Operational decisions such as loan or subsidy allocation are taken with high frequency and
require a consistent decision quality which decision models can ensure. Decision models …

被引用次数：2 相关文章所有 2 个版本

[PDF] arxiv.org

被引用次数：3 相关文章

高级搜索

QQ 群