From llms to llm-based agents for software engineering: A survey of current, challenges and future

H Jin, L Huang, H Cai, J Yan, B Li, H Chen - arXiv preprint arXiv …, 2024 - arxiv.org
With the rise of large language models (LLMs), researchers are increasingly exploring their
applications in var ious vertical domains, such as software engineering. LLMs have …

System for systematic literature review using multiple ai agents: Concept and an empirical evaluation

AM Sami, Z Rasheed, KK Kemell, M Waseem… - arXiv preprint arXiv …, 2024 - arxiv.org
Systematic Literature Reviews (SLRs) have become the foundation of evidence-based
studies, enabling researchers to identify, classify, and combine existing studies based on …

Large language model-based agents for software engineering: A survey

J Liu, K Wang, Y Chen, X Peng, Z Chen… - arXiv preprint arXiv …, 2024 - arxiv.org
The recent advance in Large Language Models (LLMs) has shaped a new paradigm of AI
agents, ie, LLM-based agents. Compared to standalone LLMs, LLM-based agents …

Ldb: A large language model debugger via verifying runtime execution step-by-step

L Zhong, Z Wang, J Shang - arXiv preprint arXiv:2402.16906, 2024 - arxiv.org
Large language models (LLMs) are leading significant progress in code generation. Beyond
one-pass code generation, recent works further integrate unit tests and program verifiers into …

Large language model evaluation via multi ai agents: Preliminary results

Z Rasheed, M Waseem, K Systä… - arXiv preprint arXiv …, 2024 - arxiv.org
As Large Language Models (LLMs) have become integral to both research and daily
operations, rigorous evaluation is crucial. This assessment is important not only for …

Dreamfactory: Pioneering multi-scene long video generation with a multi-agent framework

Z Xie, D Tang, D Tan, J Klein, TF Bissyand… - arXiv preprint arXiv …, 2024 - arxiv.org
Current video generation models excel at creating short, realistic clips, but struggle with
longer, multi-scene videos. We introduce\texttt {DreamFactory}, an LLM-based framework …

FinRobot: An Open-Source AI Agent Platform for Financial Applications using Large Language Models

H Yang, B Zhang, N Wang, C Guo, X Zhang… - arXiv preprint arXiv …, 2024 - arxiv.org
As financial institutions and professionals increasingly incorporate Large Language Models
(LLMs) into their workflows, substantial barriers, including proprietary data and specialized …

SAGEval: The frontiers of Satisfactory Agent based NLG Evaluation for reference-free open-ended text

R Ghosh, T Yao, L Chen, S Hasan, T Chen… - arXiv preprint arXiv …, 2024 - arxiv.org
Large Language Model (LLM) integrations into applications like Microsoft365 suite and
Google Workspace for creating/processing documents, emails, presentations, etc. has led to …

Benchmarking the Communication Competence of Code Generation for LLMs and LLM Agent

JJW Wu, FH Fard - arXiv preprint arXiv:2406.00215, 2024 - arxiv.org
Large language models (LLMs) have significantly improved their ability to perform tasks in
the field of code generation. However, there is still a gap between LLMs being capable …

Exploring the Integration of Large Language Models in Industrial Test Maintenance Processes

L Lemner, L Wahlgren, G Gay, N Mohammadiha… - arXiv preprint arXiv …, 2024 - arxiv.org
Much of the cost and effort required during the software testing process is invested in
performing test maintenance-the addition, removal, or modification of test cases to keep the …