Large Language Model for Medical Images: A Survey of Taxonomy, Systematic Review, and Future Trends

P Wang, W Lu, C Lu, R Zhou, M Li… - Big Data Mining and …, 2025 - ieeexplore.ieee.org
The advent of Large Language Models (LLMs) has sparked considerable interest in the
medical image domain, as they can generalize to multiple tasks and offer outstanding …

Toolace: Winning the points of llm function calling

W Liu, X Huang, X Zeng, X Hao, S Yu, D Li… - arXiv preprint arXiv …, 2024 - arxiv.org
Function calling significantly extends the application boundary of large language models,
where high-quality and diverse training data is critical for unlocking this capability. However …

Towards secure tuning: Mitigating security risks arising from benign instruction fine-tuning

Y Du, S Zhao, J Cao, M Ma, D Zhao, F Fan… - arXiv preprint arXiv …, 2024 - arxiv.org
Instruction Fine-Tuning (IFT) has become an essential method for adapting base Large
Language Models (LLMs) into variants for professional and private use. However …

CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through Corpus Retrieval and Augmentation

I Ziegler, A Köksal, D Elliott, H Schütze - arXiv preprint arXiv:2409.02098, 2024 - arxiv.org
Building high-quality datasets for specialized tasks is a time-consuming and resource-
intensive process that often requires specialized domain knowledge. We propose Corpus …

Reinforcement Learning for Long-Horizon Interactive LLM Agents

K Chen, M Cusumano-Towner, B Huval… - arXiv preprint arXiv …, 2025 - arxiv.org
Interactive digital agents (IDAs) leverage APIs of stateful digital environments to perform
tasks in response to user requests. While IDAs powered by instruction-tuned large language …

Automated test generation to evaluate tool-augmented LLMs as conversational AI agents

S Arcadinho, D Aparício, M Almeida - arXiv preprint arXiv:2409.15934, 2024 - arxiv.org
Tool-augmented LLMs are a promising approach to create AI agents that can have realistic
conversations, follow procedures, and call appropriate functions. However, evaluating them …

Multi-Agent Collaboration Mechanisms: A Survey of LLMs

KT Tran, D Dao, MD Nguyen, QV Pham… - arXiv preprint arXiv …, 2025 - arxiv.org
With recent advances in Large Language Models (LLMs), Agentic AI has become
phenomenal in real-world applications, moving toward multiple LLM-based agents to …

Synthetic Artifact Auditing: Tracing LLM-Generated Synthetic Data Usage in Downstream Applications

Y Wu, Z Yang, Y Shen, M Backes, Y Zhang - arXiv preprint arXiv …, 2025 - arxiv.org
Large language models (LLMs) have facilitated the generation of high-quality, cost-effective
synthetic data for developing downstream models and conducting statistical analyses in …

MoColl: Agent-Based Specific and General Model Collaboration for Image Captioning

P Yang, B Dong - arXiv preprint arXiv:2501.01834, 2025 - arxiv.org
Image captioning is a critical task at the intersection of computer vision and natural language
processing, with wide-ranging applications across various domains. For complex tasks such …

BENCHAGENTS: Automated Benchmark Creation with Agent Interaction

N Butt, V Chandrasekaran, N Joshi, B Nushi… - arXiv preprint arXiv …, 2024 - arxiv.org
Evaluations are limited by benchmark availability. As models evolve, there is a need to
create benchmarks that can measure progress on new generative capabilities. However …