EvoPrompting: language models for code-level neural architecture search

A Chen, D Dohan, D So - Advances in Neural Information …, 2024 - proceedings.neurips.cc
Given the recent impressive accomplishments of language models (LMs) for code
generation, we explore the use of LMs as general adaptive mutation and crossover …

Prompting is programming: A query language for large language models

L Beurer-Kellner, M Fischer, M Vechev - Proceedings of the ACM on …, 2023 - dl.acm.org
Large language models have demonstrated outstanding performance on a wide range of
tasks such as question answering and code generation. On a high level, given an input, a …

When does confidence-based cascade deferral suffice?

W Jitkrittum, N Gupta, AK Menon… - Advances in …, 2024 - proceedings.neurips.cc
Cascades are a classical strategy to enable inference cost to vary adaptively across
samples, wherein a sequence of classifiers are invoked in turn. A deferral rule determines …

Despite" super-human" performance, current LLMs are unsuited for decisions about ethics and safety

J Albrecht, E Kitanidis, AJ Fetterman - arXiv preprint arXiv:2212.06295, 2022 - arxiv.org
Large language models (LLMs) have exploded in popularity in the past few years and have
achieved undeniably impressive results on benchmarks as varied as question answering …

[PDF][PDF] Minimum levels of interpretability for artificial moral agents

A Vijayaraghavan, C Badea - arXiv preprint arXiv:2307.00660, 2023 - academia.edu
As artificial intelligence (AI) models continue to scale up, they are becoming more capable
and integrated into various forms of decision-making systems. For models involved in moral …

From Decoding to Meta-Generation: Inference-time Algorithms for Large Language Models

S Welleck, A Bertsch, M Finlayson… - arXiv preprint arXiv …, 2024 - arxiv.org
One of the most striking findings in modern research on large language models (LLMs) is
that scaling up compute during training leads to better results. However, less attention has …

Learned interpreters: structural and learned systematicity in neural networks for program execution

D Bieber - 2023 - papyrus.bib.umontreal.ca
General purpose deep neural network architectures have made startling advances in
machine learning for code, advancing code completion, enabling natural language …