Large legal fictions: Profiling legal hallucinations in large language models

M Dahl, V Magesh, M Suzgun… - Journal of Legal Analysis, 2024 - academic.oup.com
Do large language models (LLMs) know the law? LLMs are increasingly being used to
augment legal practice, education, and research, yet their revolutionary potential is …

AI-Driven review systems: evaluating LLMs in scalable and bias-aware academic reviews

K Tyser, B Segev, G Longhitano, XY Zhang… - arXiv preprint arXiv …, 2024 - arxiv.org
Automatic reviewing helps handle a large volume of papers, provides early feedback and
quality control, reduces bias, and allows the analysis of trends. We evaluate the alignment of …

Meta-prompting for automating zero-shot visual recognition with llms

MJ Mirza, L Karlinsky, W Lin, S Doveh… - … on Computer Vision, 2025 - Springer
Abstract Prompt ensembling of Large Language Model (LLM) generated category-specific
prompts has emerged as an effective method to enhance zero-shot recognition ability of …

Agentscope: A flexible yet robust multi-agent platform

D Gao, Z Li, X Pan, W Kuang, Z Ma, B Qian… - arXiv preprint arXiv …, 2024 - arxiv.org
With the rapid advancement of Large Language Models (LLMs), significant progress has
been made in multi-agent applications. However, the complexities in coordinating agents' …

A dynamic LLM-powered agent network for task-oriented agent collaboration

Z Liu, Y Zhang, P Li, Y Liu, D Yang - First Conference on Language …, 2024 - openreview.net
Recent studies show that collaborating multiple large language model (LLM) powered
agents is a promising way for task solving. However, current approaches are constrained by …

Agentstore: Scalable integration of heterogeneous agents as specialized generalist computer assistant

C Jia, M Luo, Z Dang, Q Sun, F Xu, J Hu, T Xie… - arXiv preprint arXiv …, 2024 - arxiv.org
Digital agents capable of automating complex computer tasks have attracted considerable
attention due to their immense potential to enhance human-computer interaction. However …

Instance-adaptive zero-shot chain-of-thought prompting

X Yuan, C Shen, S Yan, X Zhang, L Xie… - arXiv preprint arXiv …, 2024 - arxiv.org
Zero-shot Chain-of-Thought (CoT) prompting emerges as a simple and effective strategy for
enhancing the performance of large language models (LLMs) in real-world reasoning tasks …

Oscar: Operating system control via state-aware reasoning and re-planning

X Wang, B Liu - arXiv preprint arXiv:2410.18963, 2024 - arxiv.org
Large language models (LLMs) and large multimodal models (LMMs) have shown great
potential in automating complex tasks like web browsing and gaming. However, their ability …

Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models

L Yang, Z Yu, T Zhang, S Cao, M Xu, W Zhang… - arXiv preprint arXiv …, 2024 - arxiv.org
We introduce Buffer of Thoughts (BoT), a novel and versatile thought-augmented reasoning
approach for enhancing accuracy, efficiency and robustness of large language models …

Automatic Instruction Evolving for Large Language Models

W Zeng, C Xu, Y Zhao, JG Lou, W Chen - arXiv preprint arXiv:2406.00770, 2024 - arxiv.org
Fine-tuning large pre-trained language models with Evol-Instruct has achieved encouraging
results across a wide range of tasks. However, designing effective evolving methods for …