Despite the success in specific tasks and scenarios, existing foundation agents, empowered by large models (LMs) and advanced tools, still cannot generalize to different scenarios …
C Rawles, S Clinckemaillie, Y Chang, J Waltz… - arXiv preprint arXiv …, 2024 - arxiv.org
Autonomous agents that execute human tasks by controlling computers can enhance human productivity and application accessibility. Yet, progress in this field will be driven by …
M Wornow, A Narayan, K Opsahl-Ong… - arXiv preprint arXiv …, 2024 - arxiv.org
Automating enterprise workflows could unlock $4 trillion/year in productivity gains. Despite being of interest to the data management community for decades, the ultimate vision of end …
Leveraging multiple large language model (LLM) agents has shown to be a promising approach for tackling complex tasks, while the effective design of multiple agents for a …
Y Wang, Z Wu, J Yao, J Su - arXiv preprint arXiv:2402.10178, 2024 - arxiv.org
The emergence of Large Language Models (LLMs) like ChatGPT has inspired the development of LLM-based agents capable of addressing complex, real-world tasks …
L Zheng, Z Huang, Z Xue, X Wang, B An… - arXiv preprint arXiv …, 2024 - arxiv.org
Creating autonomous virtual agents capable of using arbitrary software on any digital device remains a major challenge for artificial intelligence. Two key obstacles hinder progress …
One of the primary driving forces contributing to the superior performance of Large Language Models (LLMs) is the extensive availability of human-annotated natural language …
A Liao, N Tomlin, D Klein - arXiv preprint arXiv:2406.18872, 2024 - arxiv.org
Game-playing agents like AlphaGo have achieved superhuman performance through self- play, which is theoretically guaranteed to yield optimal policies in competitive games …
M Wornow, A Narayan, B Viggiano, IS Khare… - arXiv preprint arXiv …, 2024 - arxiv.org
Existing ML benchmarks lack the depth and diversity of annotations needed for evaluating models on business process management (BPM) tasks. BPM is the practice of documenting …