Pythia: A suite for analyzing large language models across training and scaling

S Biderman, H Schoelkopf… - International …, 2023 - proceedings.mlr.press
How do large language models (LLMs) develop and evolve over the course of training?
How do these patterns change as models scale? To answer these questions, we introduce …

Bloom: A 176b-parameter open-access multilingual language model

T Le Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow… - 2023 - inria.hal.science
Large language models (LLMs) have been shown to be able to perform new tasks based on
a few demonstrations or natural language instructions. While these capabilities have led to …

Emergent and predictable memorization in large language models

S Biderman, U Prashanth, L Sutawika… - Advances in …, 2024 - proceedings.neurips.cc
Memorization, or the tendency of large language models (LLMs) to output entire sequences
from their training data verbatim, is a key concern for deploying language models. In …

Opening up ChatGPT: Tracking openness, transparency, and accountability in instruction-tuned text generators

A Liesenfeld, A Lopez, M Dingemanse - Proceedings of the 5th …, 2023 - dl.acm.org
Large language models that exhibit instruction-following behaviour represent one of the
biggest recent upheavals in conversational interfaces, a trend in large part fuelled by the …

Reclaiming the digital commons: A public data trust for training data

A Chan, H Bradley, N Rajkumar - Proceedings of the 2023 AAAI/ACM …, 2023 - dl.acm.org
Democratization of AI means not only that people can freely use AI, but also that people can
collectively decide how AI is to be used. In particular, collective decision-making power is …

Automated Program Repair: Emerging trends pose and expose problems for benchmarks

J Renzullo, P Reiter, W Weimer, S Forrest - arXiv preprint arXiv …, 2024 - arxiv.org
Machine learning (ML) now pervades the field of Automated Program Repair (APR).
Algorithms deploy neural machine translation and large language models (LLMs) to …

Bloom: A 176b-parameter open-access multilingual language model

BS Workshop, TL Scao, A Fan, C Akiki… - arXiv preprint arXiv …, 2022 - arxiv.org
Large language models (LLMs) have been shown to be able to perform new tasks based on
a few demonstrations or natural language instructions. While these capabilities have led to …

The Model Openness Framework: Promoting Completeness and Openness for Reproducibility, Transparency and Usability in AI

M White, I Haddad, C Osborne, A Abdelmonsef… - arXiv preprint arXiv …, 2024 - arxiv.org
Generative AI (GAI) offers unprecedented possibilities but its commercialization has raised
concerns about transparency, reproducibility, bias, and safety. Many" open-source" GAI …

[PDF][PDF] THE MODEL OPENNESS FRAMEWORK: PROMOTING COMPLETENESS AND OPENNESS FOR REPRODUCIBILITY, TRANSPARENCY, AND USABILITY IN …

M White, I Haddad, C Osborne… - arXiv preprint arXiv …, 2024 - ibrahimatlinux.com
Generative AI (GAI) offers unprecedented opportunities for research and innovation, but its
commercialization has raised concerns about transparency, reproducibility, and safety …

LENTES DIALÓGICAS SOBRE A INTELIGÊNCIA ARTIFICIAL E SUA APLICABILIDADE NO SETOR EDUCACIONAL

WKF de Santana, PKO Souza, RM de Carvalho… - Revista …, 2024 - periodicos.ufac.br
A Inteligência Artificial (IA) tem se constituído como uma área do conhecimento que
convoca esferas multidisciplinares, apresentando-se como um dos horizontes mais …