Language model cascades, 2022

A Chen, D Dohan, D So - Advances in Neural Information …, 2024 - proceedings.neurips.cc

Given the recent impressive accomplishments of language models (LMs) for code
generation, we explore the use of LMs as general adaptive mutation and crossover …

被引用次数：44 相关文章所有 5 个版本

[PDF] acm.org

Prompting is programming: A query language for large language models

L Beurer-Kellner, M Fischer, M Vechev - Proceedings of the ACM on …, 2023 - dl.acm.org

Large language models have demonstrated outstanding performance on a wide range of
tasks such as question answering and code generation. On a high level, given an input, a …

被引用次数：80 相关文章所有 4 个版本

[PDF] neurips.cc

When does confidence-based cascade deferral suffice?

W Jitkrittum, N Gupta, AK Menon… - Advances in …, 2024 - proceedings.neurips.cc

Cascades are a classical strategy to enable inference cost to vary adaptively across
samples, wherein a sequence of classifiers are invoked in turn. A deferral rule determines …

被引用次数：6 相关文章所有 5 个版本

[PDF] arxiv.org

Despite" super-human" performance, current LLMs are unsuited for decisions about ethics and safety

J Albrecht, E Kitanidis, AJ Fetterman - arXiv preprint arXiv:2212.06295, 2022 - arxiv.org

Large language models (LLMs) have exploded in popularity in the past few years and have
achieved undeniably impressive results on benchmarks as varied as question answering …

被引用次数：9 相关文章所有 3 个版本

[PDF] academia.edu

[PDF][PDF] Minimum levels of interpretability for artificial moral agents

A Vijayaraghavan, C Badea - arXiv preprint arXiv:2307.00660, 2023 - academia.edu

As artificial intelligence (AI) models continue to scale up, they are becoming more capable
and integrated into various forms of decision-making systems. For models involved in moral …

被引用次数：2 相关文章所有 3 个版本

[PDF] arxiv.org

From Decoding to Meta-Generation: Inference-time Algorithms for Large Language Models

S Welleck, A Bertsch, M Finlayson… - arXiv preprint arXiv …, 2024 - arxiv.org

One of the most striking findings in modern research on large language models (LLMs) is
that scaling up compute during training leads to better results. However, less attention has …

Learned interpreters: structural and learned systematicity in neural networks for program execution

D Bieber - 2023 - papyrus.bib.umontreal.ca

General purpose deep neural network architectures have made startling advances in
machine learning for code, advancing code completion, enabling natural language …

高级搜索

QQ 群

EvoPrompting: language models for code-level neural architecture search