Thinksum: Probabilistic reasoning over sets using large language models

S Hao, T Liu, Z Wang, Z Hu - Advances in neural …, 2024 - proceedings.neurips.cc

Integrating large language models (LLMs) with various tools has led to increased attention
in the field. Existing approaches either involve fine-tuning the LLM, which is both …

被引用次数：73 相关文章所有 8 个版本

[PDF] acm.org

Prompting is programming: A query language for large language models

L Beurer-Kellner, M Fischer, M Vechev - Proceedings of the ACM on …, 2023 - dl.acm.org

Large language models have demonstrated outstanding performance on a wide range of
tasks such as question answering and code generation. On a high level, given an input, a …

被引用次数：82 相关文章所有 4 个版本

[PDF] arxiv.org

The life cycle of knowledge in big language models: A survey

B Cao, H Lin, X Han, L Sun - Machine Intelligence Research, 2024 - Springer

Abstract Knowledge plays a critical role in artificial intelligence. Recently, the extensive
success of pre-trained language models (PLMs) has raised significant attention about how …

被引用次数：14 相关文章所有 5 个版本

[PDF] arxiv.org

Promptagent: Strategic planning with language models enables expert-level prompt optimization

X Wang, C Li, Z Wang, F Bai, H Luo, J Zhang… - arXiv preprint arXiv …, 2023 - arxiv.org

Highly effective, task-specific prompts are often heavily engineered by experts to integrate
detailed instructions and domain insights based on a deep understanding of both instincts of …

被引用次数：29 相关文章所有 3 个版本

[PDF] arxiv.org

Large language models are temporal and causal reasoners for video question answering

D Ko, JS Lee, W Kang, B Roh, HJ Kim - arXiv preprint arXiv:2310.15747, 2023 - arxiv.org

Large Language Models (LLMs) have shown remarkable performances on a wide range of
natural language understanding and generation tasks. We observe that the LLMs provide …

被引用次数：21 相关文章所有 5 个版本

[PDF] arxiv.org

Holistic evaluation of gpt-4v for biomedical imaging

Z Liu, H Jiang, T Zhong, Z Wu, C Ma, Y Li, X Yu… - arXiv preprint arXiv …, 2023 - arxiv.org

In this paper, we present a large-scale evaluation probing GPT-4V's capabilities and
limitations for biomedical image analysis. GPT-4V represents a breakthrough in artificial …

被引用次数：8 相关文章所有 2 个版本

[PDF] arxiv.org

Gpt is becoming a turing machine: Here are some ways to program it

A Jojic, Z Wang, N Jojic - arXiv preprint arXiv:2303.14310, 2023 - arxiv.org

We demonstrate that, through appropriate prompting, GPT-3 family of models can be
triggered to perform iterative behaviours necessary to execute (rather than just write or …

被引用次数：15 相关文章所有 4 个版本

[PDF] arxiv.org

Iterated decomposition: Improving science q&a by supervising reasoning processes

J Reppert, B Rachbach, C George, L Stebbing… - arXiv preprint arXiv …, 2023 - arxiv.org

Language models (LMs) can perform complex reasoning either end-to-end, with hidden
latent state, or compositionally, with transparent intermediate state. Composition offers …

被引用次数：13 相关文章所有 2 个版本

[PDF] arxiv.org

Disentangling extraction and reasoning in multi-hop spatial reasoning

R Mirzaee, P Kordjamshidi - arXiv preprint arXiv:2310.16731, 2023 - arxiv.org

Spatial reasoning over text is challenging as the models not only need to extract the direct
spatial information from the text but also reason over those and infer implicit spatial relations …

被引用次数：6 相关文章所有 5 个版本

[PDF] arxiv.org

Prompt Sketching for Large Language Models

L Beurer-Kellner, MN Müller, M Fischer… - arXiv preprint arXiv …, 2023 - arxiv.org

Many recent prompting strategies for large language models (LLMs) query the model
multiple times sequentially--first to produce intermediate results and then the final answer …

被引用次数：1 相关文章所有 4 个版本

高级搜索

QQ 群