Os-copilot: Towards generalist computer agents with self-improvement

Z Wu, C Han, Z Ding, Z Weng, Z Liu, S Yao… - arXiv preprint arXiv …, 2024 - arxiv.org
Autonomous interaction with the computer has been a longstanding challenge with great
potential, and the recent proliferation of large language models (LLMs) has markedly …

When llms meet cybersecurity: A systematic literature review

J Zhang, H Bu, H Wen, Y Chen, L Li, H Zhu - arXiv preprint arXiv …, 2024 - arxiv.org
The rapid advancements in large language models (LLMs) have opened new avenues
across various fields, including cybersecurity, which faces an ever-evolving threat landscape …

Executable code actions elicit better llm agents

X Wang, Y Chen, L Yuan, Y Zhang, Y Li… - arXiv preprint arXiv …, 2024 - arxiv.org
Large Language Model (LLM) agents, capable of performing a broad range of actions, such
as invoking tools and controlling robots, show great potential in tackling real-world …

Ufo: A ui-focused agent for windows os interaction

C Zhang, L Li, S He, X Zhang, B Qiao, S Qin… - arXiv preprint arXiv …, 2024 - arxiv.org
We introduce UFO, an innovative UI-Focused agent to fulfill user requests tailored to
applications on Windows OS, harnessing the capabilities of GPT-Vision. UFO employs a …

Infiagent-dabench: Evaluating agents on data analysis tasks

X Hu, Z Zhao, S Wei, Z Chai, G Wang, X Wang… - arXiv preprint arXiv …, 2024 - arxiv.org
In this paper, we introduce" InfiAgent-DABench", the first benchmark specifically designed to
evaluate LLM-based agents in data analysis tasks. This benchmark contains DAEval, a …

AllHands: Ask Me Anything on Large-scale Verbatim Feedback via Large Language Models

C Zhang, Z Ma, Y Wu, S He, S Qin, M Ma, X Qin… - arXiv preprint arXiv …, 2024 - arxiv.org
Verbatim feedback constitutes a valuable repository of user experiences, opinions, and
requirements essential for software development. Effectively and efficiently extracting …

Navigating the Landscape of Large Language Models: A Comprehensive Review and Analysis of Paradigms and Fine-Tuning Strategies

B Weng - arXiv preprint arXiv:2404.09022, 2024 - arxiv.org
With the surge of ChatGPT, the use of large models has significantly increased, rapidly rising
to prominence across the industry and sweeping across the internet. This article is a …

Call Me When Necessary: LLMs can Efficiently and Faithfully Reason over Structured Environments

S Cheng, Z Zhuang, Y Xu, F Yang, C Zhang… - arXiv preprint arXiv …, 2024 - arxiv.org
Large Language Models (LLMs) have shown potential in reasoning over structured
environments, eg, knowledge graph and table. Such tasks typically require multi-hop …

MMAC-Copilot: Multi-modal Agent Collaboration Operating System Copilot

Z Song, Y Li, M Fang, Z Chen, Z Shi… - arXiv preprint arXiv …, 2024 - arxiv.org
Autonomous virtual agents are often limited by their singular mode of interaction with real-
world environments, restricting their versatility. To address this, we propose the Multi-Modal …

FinRobot: An Open-Source AI Agent Platform for Financial Applications using Large Language Models

H Yang, B Zhang, N Wang, C Guo, X Zhang… - arXiv preprint arXiv …, 2024 - arxiv.org
As financial institutions and professionals increasingly incorporate Large Language Models
(LLMs) into their workflows, substantial barriers, including proprietary data and specialized …