Toolkengpt: Augmenting frozen language models with massive tools via tool embeddings

S Hao, T Liu, Z Wang, Z Hu - Advances in neural …, 2024 - proceedings.neurips.cc
Integrating large language models (LLMs) with various tools has led to increased attention
in the field. Existing approaches either involve fine-tuning the LLM, which is both …

Prompting is programming: A query language for large language models

L Beurer-Kellner, M Fischer, M Vechev - Proceedings of the ACM on …, 2023 - dl.acm.org
Large language models have demonstrated outstanding performance on a wide range of
tasks such as question answering and code generation. On a high level, given an input, a …

The life cycle of knowledge in big language models: A survey

B Cao, H Lin, X Han, L Sun - Machine Intelligence Research, 2024 - Springer
Abstract Knowledge plays a critical role in artificial intelligence. Recently, the extensive
success of pre-trained language models (PLMs) has raised significant attention about how …

Promptagent: Strategic planning with language models enables expert-level prompt optimization

X Wang, C Li, Z Wang, F Bai, H Luo, J Zhang… - arXiv preprint arXiv …, 2023 - arxiv.org
Highly effective, task-specific prompts are often heavily engineered by experts to integrate
detailed instructions and domain insights based on a deep understanding of both instincts of …

Large language models are temporal and causal reasoners for video question answering

D Ko, JS Lee, W Kang, B Roh, HJ Kim - arXiv preprint arXiv:2310.15747, 2023 - arxiv.org
Large Language Models (LLMs) have shown remarkable performances on a wide range of
natural language understanding and generation tasks. We observe that the LLMs provide …

Holistic evaluation of gpt-4v for biomedical imaging

Z Liu, H Jiang, T Zhong, Z Wu, C Ma, Y Li, X Yu… - arXiv preprint arXiv …, 2023 - arxiv.org
In this paper, we present a large-scale evaluation probing GPT-4V's capabilities and
limitations for biomedical image analysis. GPT-4V represents a breakthrough in artificial …

Gpt is becoming a turing machine: Here are some ways to program it

A Jojic, Z Wang, N Jojic - arXiv preprint arXiv:2303.14310, 2023 - arxiv.org
We demonstrate that, through appropriate prompting, GPT-3 family of models can be
triggered to perform iterative behaviours necessary to execute (rather than just write or …

Iterated decomposition: Improving science q&a by supervising reasoning processes

J Reppert, B Rachbach, C George, L Stebbing… - arXiv preprint arXiv …, 2023 - arxiv.org
Language models (LMs) can perform complex reasoning either end-to-end, with hidden
latent state, or compositionally, with transparent intermediate state. Composition offers …

Disentangling extraction and reasoning in multi-hop spatial reasoning

R Mirzaee, P Kordjamshidi - arXiv preprint arXiv:2310.16731, 2023 - arxiv.org
Spatial reasoning over text is challenging as the models not only need to extract the direct
spatial information from the text but also reason over those and infer implicit spatial relations …

Prompt Sketching for Large Language Models

L Beurer-Kellner, MN Müller, M Fischer… - arXiv preprint arXiv …, 2023 - arxiv.org
Many recent prompting strategies for large language models (LLMs) query the model
multiple times sequentially--first to produce intermediate results and then the final answer …