Agentinstruct: Toward generative teaching with agentic flows

P Wang, W Lu, C Lu, R Zhou, M Li… - Big Data Mining and …, 2025 - ieeexplore.ieee.org

The advent of Large Language Models (LLMs) has sparked considerable interest in the
medical image domain, as they can generalize to multiple tasks and offer outstanding …

[PDF] arxiv.org

Toolace: Winning the points of llm function calling

W Liu, X Huang, X Zeng, X Hao, S Yu, D Li… - arXiv preprint arXiv …, 2024 - arxiv.org

Function calling significantly extends the application boundary of large language models,
where high-quality and diverse training data is critical for unlocking this capability. However …

被引用次数：9 相关文章

[PDF] arxiv.org

Towards secure tuning: Mitigating security risks arising from benign instruction fine-tuning

Y Du, S Zhao, J Cao, M Ma, D Zhao, F Fan… - arXiv preprint arXiv …, 2024 - arxiv.org

Instruction Fine-Tuning (IFT) has become an essential method for adapting base Large
Language Models (LLMs) into variants for professional and private use. However …

被引用次数：6 相关文章所有 2 个版本

[PDF] arxiv.org

CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through Corpus Retrieval and Augmentation

I Ziegler, A Köksal, D Elliott, H Schütze - arXiv preprint arXiv:2409.02098, 2024 - arxiv.org

Building high-quality datasets for specialized tasks is a time-consuming and resource-
intensive process that often requires specialized domain knowledge. We propose Corpus …

被引用次数：2 相关文章所有 2 个版本

[PDF] arxiv.org

Reinforcement Learning for Long-Horizon Interactive LLM Agents

K Chen, M Cusumano-Towner, B Huval… - arXiv preprint arXiv …, 2025 - arxiv.org

Interactive digital agents (IDAs) leverage APIs of stateful digital environments to perform
tasks in response to user requests. While IDAs powered by instruction-tuned large language …

[PDF] arxiv.org

Automated test generation to evaluate tool-augmented LLMs as conversational AI agents

S Arcadinho, D Aparício, M Almeida - arXiv preprint arXiv:2409.15934, 2024 - arxiv.org

Tool-augmented LLMs are a promising approach to create AI agents that can have realistic
conversations, follow procedures, and call appropriate functions. However, evaluating them …

被引用次数：1 相关文章所有 3 个版本

[PDF] arxiv.org

Multi-Agent Collaboration Mechanisms: A Survey of LLMs

KT Tran, D Dao, MD Nguyen, QV Pham… - arXiv preprint arXiv …, 2025 - arxiv.org

With recent advances in Large Language Models (LLMs), Agentic AI has become
phenomenal in real-world applications, moving toward multiple LLM-based agents to …

[PDF] arxiv.org

Synthetic Artifact Auditing: Tracing LLM-Generated Synthetic Data Usage in Downstream Applications

Y Wu, Z Yang, Y Shen, M Backes, Y Zhang - arXiv preprint arXiv …, 2025 - arxiv.org

Large language models (LLMs) have facilitated the generation of high-quality, cost-effective
synthetic data for developing downstream models and conducting statistical analyses in …

[PDF] arxiv.org

MoColl: Agent-Based Specific and General Model Collaboration for Image Captioning

P Yang, B Dong - arXiv preprint arXiv:2501.01834, 2025 - arxiv.org

Image captioning is a critical task at the intersection of computer vision and natural language
processing, with wide-ranging applications across various domains. For complex tasks such …

[PDF] arxiv.org

BENCHAGENTS: Automated Benchmark Creation with Agent Interaction

N Butt, V Chandrasekaran, N Joshi, B Nushi… - arXiv preprint arXiv …, 2024 - arxiv.org

Evaluations are limited by benchmark availability. As models evolve, there is a need to
create benchmarks that can measure progress on new generative capabilities. However …

高级搜索

QQ 群