Tool learning with foundation models

Y Qin, S Hu, Y Lin, W Chen, N Ding, G Cui… - ACM Computing …, 2024 - dl.acm.org
Humans possess an extraordinary ability to create and utilize tools. With the advent of
foundation models, artificial intelligence systems have the potential to be equally adept in …

Agentic AI: Autonomous Intelligence for Complex Goals–A Comprehensive Survey

DB Acharya, K Kuppan, B Divya - IEEE Access, 2025 - ieeexplore.ieee.org
Agentic AI, an emerging paradigm in artificial intelligence, refers to autonomous systems
designed to pursue complex goals with minimal human intervention. Unlike traditional AI …

Unifying large language models and knowledge graphs: A roadmap

S Pan, L Luo, Y Wang, C Chen… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Large language models (LLMs), such as ChatGPT and GPT4, are making new waves in the
field of natural language processing and artificial intelligence, due to their emergent ability …

Rest-mcts*: Llm self-training via process reward guided tree search

D Zhang, S Zhoubian, Z Hu, Y Yue, Y Dong… - arXiv preprint arXiv …, 2024 - arxiv.org
Recent methodologies in LLM self-training mostly rely on LLM generating responses and
filtering those with correct output answers as training data. This approach often yields a low …

Agent lumos: Unified and modular training for open-source language agents

D Yin, F Brahman, A Ravichander… - Proceedings of the …, 2024 - aclanthology.org
Closed-source agents suffer from several issues such as a lack of affordability, transparency,
and reproducibility, particularly on complex interactive tasks. This motivates the …

Visualwebarena: Evaluating multimodal agents on realistic visual web tasks

JY Koh, R Lo, L Jang, V Duvvur, MC Lim… - arXiv preprint arXiv …, 2024 - arxiv.org
Autonomous agents capable of planning, reasoning, and executing actions on the web offer
a promising avenue for automating computer tasks. However, the majority of existing …

Lumos: Learning agents with unified data, modular design, and open-source llms

D Yin, F Brahman, A Ravichander… - ICLR 2024 Workshop …, 2023 - openreview.net
We introduce Lumos, a novel framework for training language agents that employs a unified
data format and a modular architecture based on open-source large language models …

Chatgpt's one-year anniversary: are open-source large language models catching up?

H Chen, F Jiao, X Li, C Qin, M Ravaut, R Zhao… - arXiv preprint arXiv …, 2023 - arxiv.org
Upon its release in late 2022, ChatGPT has brought a seismic shift in the entire landscape of
AI, both in research and commerce. Through instruction-tuning a large language model …

Apigen: Automated pipeline for generating verifiable and diverse function-calling datasets

Z Liu, T Hoang, J Zhang, M Zhu, T Lan… - arXiv preprint arXiv …, 2024 - arxiv.org
The advancement of function-calling agent models requires diverse, reliable, and high-
quality datasets. This paper presents APIGen, an automated data generation pipeline …

Understanding the planning of LLM agents: A survey

X Huang, W Liu, X Chen, X Wang, H Wang… - arXiv preprint arXiv …, 2024 - arxiv.org
As Large Language Models (LLMs) have shown significant intelligence, the progress to
leverage LLMs as planning modules of autonomous agents has attracted more attention …