Filling gaps in trustworthy development of AI

CE Richards, A Tzachor, S Avin, R Fenner - Nature Water, 2023 - nature.com

Artificial intelligence (AI) is increasingly proposed to address deficiencies across water
systems, which currently leave about 25% of the global population without clean water …

被引用次数：30 相关文章

Ethics-based AI auditing: A systematic literature review on conceptualizations of ethical principles and knowledge contributions to stakeholders

J Laine, M Minkkinen, M Mäntymäki - Information & Management, 2024 - Elsevier

This systematic literature review synthesizes the conceptualizations of ethical principles in AI
auditing literature and the knowledge contributions to the stakeholders of AI auditing. We …

被引用次数：9 相关文章

[PDF] arxiv.org

Red teaming language models to reduce harms: Methods, scaling behaviors, and lessons learned

D Ganguli, L Lovitt, J Kernion, A Askell, Y Bai… - arXiv preprint arXiv …, 2022 - arxiv.org

We describe our early efforts to red team language models in order to simultaneously
discover, measure, and attempt to reduce their potentially harmful outputs. We make three …

被引用次数：327 相关文章所有 2 个版本

[PDF] springer.com

Auditing large language models: a three-layered approach

J Mökander, J Schuett, HR Kirk, L Floridi - AI and Ethics, 2023 - Springer

Large language models (LLMs) represent a major advance in artificial intelligence (AI)
research. However, the widespread use of LLMs is also coupled with significant ethical and …

被引用次数：155 相关文章所有 6 个版本

[PDF] acm.org

Predictability and surprise in large generative models

D Ganguli, D Hernandez, L Lovitt, A Askell… - Proceedings of the …, 2022 - dl.acm.org

Large-scale pre-training has recently emerged as a technique for creating capable, general-
purpose, generative models such as GPT-3, Megatron-Turing NLG, Gopher, and many …

被引用次数：236 相关文章所有 6 个版本

[PDF] arxiv.org

Plex: Towards reliability using pretrained large model extensions

D Tran, J Liu, MW Dusenberry, D Phan… - arXiv preprint arXiv …, 2022 - arxiv.org

A recent trend in artificial intelligence is the use of pretrained models for language and
vision tasks, which have achieved extraordinary performance but also puzzling failures …

被引用次数：100 相关文章所有 6 个版本

[PDF] argmax.ai

Ai regulation is (not) all you need

L Lucaj, P Van Der Smagt, D Benbouzid - Proceedings of the 2023 ACM …, 2023 - dl.acm.org

The development of processes and tools for ethical, trustworthy, and legal AI is only
beginning. At the same time, legal requirements are emerging in various jurisdictions …

被引用次数：20 相关文章所有 4 个版本

[PDF] arxiv.org

Automating in-network machine learning

C Zheng, M Zang, X Hong, R Bensoussane… - arXiv preprint arXiv …, 2022 - arxiv.org

Using programmable network devices to aid in-network machine learning has been the
focus of significant research. However, most of the research was of a limited scope …

被引用次数：47 相关文章所有 4 个版本

[PDF] berkeley.edu

Regulating advanced artificial agents

MK Cohen, N Kolt, Y Bengio, GK Hadfield, S Russell - Science, 2024 - science.org

Technical experts and policy-makers have increasingly emphasized the need to address
extinction risk from artificial intelligence (AI) systems that might circumvent safeguards and …

被引用次数：15 相关文章所有 8 个版本

[PDF] acm.org

Certification labels for trustworthy ai: Insights from an empirical mixed-method study

N Scharowski, M Benk, SJ Kühne, L Wettstein… - Proceedings of the …, 2023 - dl.acm.org

Auditing plays a pivotal role in the development of trustworthy AI. However, current research
primarily focuses on creating auditable AI documentation, which is intended for regulators …

被引用次数：11 相关文章所有 7 个版本

高级搜索

QQ 群