Taskweaver: A code-first agent framework

Z Wu, C Han, Z Ding, Z Weng, Z Liu, S Yao… - arXiv preprint arXiv …, 2024 - arxiv.org

Autonomous interaction with the computer has been a longstanding challenge with great
potential, and the recent proliferation of large language models (LLMs) has markedly …

被引用次数：14 相关文章所有 3 个版本

[PDF] arxiv.org

When llms meet cybersecurity: A systematic literature review

J Zhang, H Bu, H Wen, Y Chen, L Li, H Zhu - arXiv preprint arXiv …, 2024 - arxiv.org

The rapid advancements in large language models (LLMs) have opened new avenues
across various fields, including cybersecurity, which faces an ever-evolving threat landscape …

被引用次数：4 相关文章所有 2 个版本

[PDF] arxiv.org

Executable code actions elicit better llm agents

X Wang, Y Chen, L Yuan, Y Zhang, Y Li… - arXiv preprint arXiv …, 2024 - arxiv.org

Large Language Model (LLM) agents, capable of performing a broad range of actions, such
as invoking tools and controlling robots, show great potential in tackling real-world …

被引用次数：15 相关文章所有 4 个版本

[PDF] arxiv.org

Ufo: A ui-focused agent for windows os interaction

C Zhang, L Li, S He, X Zhang, B Qiao, S Qin… - arXiv preprint arXiv …, 2024 - arxiv.org

We introduce UFO, an innovative UI-Focused agent to fulfill user requests tailored to
applications on Windows OS, harnessing the capabilities of GPT-Vision. UFO employs a …

被引用次数：13 相关文章所有 3 个版本

[PDF] arxiv.org

Infiagent-dabench: Evaluating agents on data analysis tasks

X Hu, Z Zhao, S Wei, Z Chai, G Wang, X Wang… - arXiv preprint arXiv …, 2024 - arxiv.org

In this paper, we introduce" InfiAgent-DABench", the first benchmark specifically designed to
evaluate LLM-based agents in data analysis tasks. This benchmark contains DAEval, a …

被引用次数：8 相关文章所有 4 个版本

[PDF] arxiv.org

AllHands: Ask Me Anything on Large-scale Verbatim Feedback via Large Language Models

C Zhang, Z Ma, Y Wu, S He, S Qin, M Ma, X Qin… - arXiv preprint arXiv …, 2024 - arxiv.org

Verbatim feedback constitutes a valuable repository of user experiences, opinions, and
requirements essential for software development. Effectively and efficiently extracting …

被引用次数：2 相关文章所有 2 个版本

[PDF] arxiv.org

Navigating the Landscape of Large Language Models: A Comprehensive Review and Analysis of Paradigms and Fine-Tuning Strategies

B Weng - arXiv preprint arXiv:2404.09022, 2024 - arxiv.org

With the surge of ChatGPT, the use of large models has significantly increased, rapidly rising
to prominence across the industry and sweeping across the internet. This article is a …

Call Me When Necessary: LLMs can Efficiently and Faithfully Reason over Structured Environments

S Cheng, Z Zhuang, Y Xu, F Yang, C Zhang… - arXiv preprint arXiv …, 2024 - arxiv.org

Large Language Models (LLMs) have shown potential in reasoning over structured
environments, eg, knowledge graph and table. Such tasks typically require multi-hop …

被引用次数：3 相关文章所有 2 个版本

[PDF] arxiv.org

MMAC-Copilot: Multi-modal Agent Collaboration Operating System Copilot

Z Song, Y Li, M Fang, Z Chen, Z Shi… - arXiv preprint arXiv …, 2024 - arxiv.org

Autonomous virtual agents are often limited by their singular mode of interaction with real-
world environments, restricting their versatility. To address this, we propose the Multi-Modal …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

FinRobot: An Open-Source AI Agent Platform for Financial Applications using Large Language Models

H Yang, B Zhang, N Wang, C Guo, X Zhang… - arXiv preprint arXiv …, 2024 - arxiv.org

As financial institutions and professionals increasingly incorporate Large Language Models
(LLMs) into their workflows, substantial barriers, including proprietary data and specialized …

高级搜索

QQ 群