- 学术资源搜索

Machine culture

L Brinkmann, F Baumann, JF Bonnefon… - Nature Human …, 2023 - nature.com

The ability of humans to create and disseminate culture is often credited as the single most
important factor of our success as a species. In this Perspective, we explore the notion of …

被引用次数：58 相关文章所有 22 个版本

[PDF] arxiv.org

Pre-trained language models for text generation: A survey

J Li, T Tang, WX Zhao, JY Nie, JR Wen - ACM Computing Surveys, 2024 - dl.acm.org

Text Generation aims to produce plausible and readable text in human language from input
data. The resurgence of deep learning has greatly advanced this field, in particular, with the …

被引用次数：309 相关文章所有 7 个版本

[PDF] arxiv.org

A categorical archive of chatgpt failures

A Borji - arXiv preprint arXiv:2302.03494, 2023 - arxiv.org

Large language models have been demonstrated to be valuable in different fields. ChatGPT,
developed by OpenAI, has been trained using massive amounts of data and simulates …

被引用次数：447 相关文章所有 2 个版本

[PDF] mlr.press

Using large language models to simulate multiple humans and replicate human subject studies

GV Aher, RI Arriaga, AT Kalai - International Conference on …, 2023 - proceedings.mlr.press

We introduce a new type of test, called a Turing Experiment (TE), for evaluating to what
extent a given language model, such as GPT models, can simulate different aspects of …

被引用次数：314 相关文章所有 7 个版本

[PDF] neurips.cc

Towards automated circuit discovery for mechanistic interpretability

A Conmy, A Mavor-Parker, A Lynch… - Advances in …, 2023 - proceedings.neurips.cc

Through considerable effort and intuition, several recent works have reverse-engineered
nontrivial behaviors oftransformer models. This paper systematizes the mechanistic …

被引用次数：133 相关文章所有 6 个版本

[PDF] arxiv.org

Mass-editing memory in a transformer

K Meng, AS Sharma, A Andonian, Y Belinkov… - arXiv preprint arXiv …, 2022 - arxiv.org

Recent work has shown exciting promise in updating large language models with new
memories, so as to replace obsolete information or add specialized knowledge. However …

被引用次数：320 相关文章所有 5 个版本

[PDF] neurips.cc

Locating and editing factual associations in GPT

K Meng, D Bau, A Andonian… - Advances in Neural …, 2022 - proceedings.neurips.cc

We analyze the storage and recall of factual associations in autoregressive transformer
language models, finding evidence that these associations correspond to localized, directly …

被引用次数：713 相关文章所有 7 个版本

[PDF] neurips.cc

Training language models to follow instructions with human feedback

L Ouyang, J Wu, X Jiang, D Almeida… - Advances in neural …, 2022 - proceedings.neurips.cc

Making language models bigger does not inherently make them better at following a user's
intent. For example, large language models can generate outputs that are untruthful, toxic, or …

被引用次数：8949 相关文章所有 18 个版本

[HTML] nature.com Full View

[HTML][HTML] Large language models propagate race-based medicine

JA Omiye, JC Lester, S Spichak, V Rotemberg… - NPJ Digital …, 2023 - nature.com

Large language models (LLMs) are being integrated into healthcare systems; but these
models may recapitulate harmful, race-based medicine. The objective of this study is to …

被引用次数：133 相关文章所有 8 个版本

[PDF] arxiv.org

Interpretability in the wild: a circuit for indirect object identification in gpt-2 small

K Wang, A Variengien, A Conmy, B Shlegeris… - arXiv preprint arXiv …, 2022 - arxiv.org

Research in mechanistic interpretability seeks to explain behaviors of machine learning
models in terms of their internal components. However, most previous work either focuses …

被引用次数：266 相关文章所有 4 个版本

高级搜索

QQ 群

Machine culture

Pre-trained language models for text generation: A survey

A categorical archive of chatgpt failures

Using large language models to simulate multiple humans and replicate human subject studies

Towards automated circuit discovery for mechanistic interpretability

Mass-editing memory in a transformer

Locating and editing factual associations in GPT

Training language models to follow instructions with human feedback

[HTML][HTML] Large language models propagate race-based medicine

Interpretability in the wild: a circuit for indirect object identification in gpt-2 small

引用