Smartplay: A benchmark for llms as intelligent agents

Y Wu, X Tang, TM Mitchell, Y Li - arXiv preprint arXiv:2310.01557, 2023 - arxiv.org
Recent large language models (LLMs) have demonstrated great potential toward intelligent
agents and next-gen automation, but there currently lacks a systematic benchmark for …

SmartPlay: A Benchmark for LLMs as Intelligent Agents

Y Wu, X Tang, T Mitchell, Y Li - The Twelfth International Conference on … - openreview.net
Recent large language models (LLMs) have demonstrated great potential toward intelligent
agents and next-gen automation, but there currently lacks a systematic benchmark for …

SmartPlay: A Benchmark for LLMs as Intelligent Agents

Y Wu, X Tang, TM Mitchell, Y Li - arXiv e-prints, 2023 - ui.adsabs.harvard.edu
Recent large language models (LLMs) have demonstrated great potential toward intelligent
agents and next-gen automation, but there currently lacks a systematic benchmark for …

SmartPlay: A Benchmark for LLMs as Intelligent Agents

Y Wu, X Tang, T Mitchell, Y Li - Second Agent Learning in Open … - openreview.net
Recent large language models (LLMs) have demonstrated great potential toward intelligent
agents and next-gen automation, but there currently lacks a systematic benchmark for …