Reasoning with language model is planning with world model

S Hao, Y Gu, H Ma, JJ Hong, Z Wang, DZ Wang… - arXiv preprint arXiv …, 2023 - arxiv.org
Large language models (LLMs) have shown remarkable reasoning capabilities, especially
when prompted to generate intermediate reasoning steps (eg, Chain-of-Thought, CoT) …

Evaluating cognitive maps and planning in large language models with CogEval

I Momennejad, H Hasanbeig… - Advances in …, 2024 - proceedings.neurips.cc
Recently an influx of studies claims emergent cognitive abilities in large language models
(LLMs). Yet, most rely on anecdotes, overlook contamination of training sets, or lack …

Promptagent: Strategic planning with language models enables expert-level prompt optimization

X Wang, C Li, Z Wang, F Bai, H Luo, J Zhang… - arXiv preprint arXiv …, 2023 - arxiv.org
Highly effective, task-specific prompts are often heavily engineered by experts to integrate
detailed instructions and domain insights based on a deep understanding of both instincts of …

Code simulation challenges for large language models

E La Malfa, C Weinhuber, O Torre, F Lin… - arXiv preprint arXiv …, 2024 - arxiv.org
We investigate the extent to which Large Language Models (LLMs) can simulate the
execution of computer code and algorithms. We begin by looking straight line programs, and …

CoRE: LLM as Interpreter for Natural Language Programming, Pseudo-Code Programming, and Flow Programming of AI Agents

S Xu, Z Li, K Mei, Y Zhang - arXiv preprint arXiv:2405.06907, 2024 - arxiv.org
Since their inception, programming languages have trended towards greater readability and
lower barriers for programmers. Following this trend, natural language can be a promising …

AutoGRAMS: Autonomous Graphical Agent Modeling Software

B Krause, L Chen, E Kahembwe - arXiv preprint arXiv:2407.10049, 2024 - arxiv.org
We introduce the AutoGRAMS framework for programming multi-step interactions with
language models. AutoGRAMS represents AI agents as a graph, where each node can …

Comparing the performance of GPT-3 with BERT for decision requirements modeling

A Goossens, J De Smedt, J Vanthienen - International Conference on …, 2023 - Springer
Operational decisions such as loan or subsidy allocation are taken with high frequency and
require a consistent decision quality which decision models can ensure. Decision models …

Executing Natural Language-Described Algorithms with Large Language Models: An Investigation

X Zheng, Q Zhu, H Lin, Y Lu, X Han, L Sun - arXiv preprint arXiv …, 2024 - arxiv.org
Executing computer programs described in natural language has long been a pursuit of
computer science. With the advent of enhanced natural language understanding capabilities …

AutoFlow: Automated Workflow Generation for Large Language Model Agents

Z Li, S Xu, K Mei, W Hua, B Rama, O Raheja… - arXiv preprint arXiv …, 2024 - arxiv.org
Recent advancements in Large Language Models (LLMs) have shown significant progress
in understanding complex natural language. One important application of LLM is LLM-based …

Integrating GPT-Technologies with Decision Models for Explainability

A Goossens, J Vanthienen - World Conference on Explainable Artificial …, 2023 - Springer
The ability to provide clear and transparent explanations for the outcome of a decision is
critical for gaining user trust and acceptance, particularly in areas such as healthcare …