TASRA: a taxonomy and analysis of societal-scale risks from AI

C Chen, K Shu - AI Magazine, 2024 - Wiley Online Library

Misinformation such as fake news and rumors is a serious threat for information ecosystems
and public trust. The emergence of large language models (LLMs) has great potential to …

被引用次数：121 相关文章所有 4 个版本

[PDF] arxiv.org

Ai alignment: A comprehensive survey

J Ji, T Qiu, B Chen, B Zhang, H Lou, K Wang… - arXiv preprint arXiv …, 2023 - arxiv.org

AI alignment aims to make AI systems behave in line with human intentions and values. As
AI systems grow more capable, the potential large-scale risks associated with misaligned AI …

被引用次数：206 相关文章所有 3 个版本

[PDF] arxiv.org

Foundational challenges in assuring alignment and safety of large language models

U Anwar, A Saparov, J Rando, D Paleka… - arXiv preprint arXiv …, 2024 - arxiv.org

This work identifies 18 foundational challenges in assuring the alignment and safety of large
language models (LLMs). These challenges are organized into three different categories …

被引用次数：102 相关文章所有 3 个版本

[PDF] biocomm.ai

[PDF][PDF] Managing ai risks in an era of rapid progress

Y Bengio, G Hinton, A Yao, D Song… - arXiv preprint arXiv …, 2023 - blog.biocomm.ai

In this short consensus paper, we outline risks from upcoming, advanced AI systems. We
examine large-scale social harms and malicious uses, as well as an irreversible loss of …

被引用次数：84 相关文章所有 13 个版本

[HTML] science.org

Managing extreme AI risks amid rapid progress

Y Bengio, G Hinton, A Yao, D Song, P Abbeel, T Darrell… - Science, 2024 - science.org

Artificial intelligence (AI) is progressing rapidly, and companies are shifting their focus to
developing generalist AI systems that can autonomously act and pursue goals. Increases in …

被引用次数：139 相关文章所有 5 个版本

General Purpose Artificial Intelligence Systems (GPAIS): Properties, definition, taxonomy, societal implications and responsible governance

I Triguero, D Molina, J Poyatos, J Del Ser, F Herrera - Information Fusion, 2024 - Elsevier

Abstract Most applications of Artificial Intelligence (AI) are designed for a confined and
specific task. However, there are many scenarios that call for a more general AI, capable of …

被引用次数：39 相关文章所有 3 个版本

[PDF] arxiv.org

International Scientific Report on the Safety of Advanced AI (Interim Report)

Y Bengio, S Mindermann, D Privitera… - arXiv preprint arXiv …, 2024 - arxiv.org

This is the interim publication of the first International Scientific Report on the Safety of
Advanced AI. The report synthesises the scientific understanding of general-purpose AI--AI …

被引用次数：20 相关文章所有 3 个版本

[PDF] arxiv.org

From task structures to world models: what do LLMs know?

I Yildirim, LA Paul - Trends in Cognitive Sciences, 2024 - cell.com

In what sense does a large language model (LLM) have knowledge? We answer by
granting LLMs 'instrumental knowledge': knowledge gained by using next-word generation …

被引用次数：38 相关文章所有 8 个版本

[PDF] arxiv.org

Open problems in technical ai governance

A Reuel, B Bucknall, S Casper, T Fist, L Soder… - arXiv preprint arXiv …, 2024 - arxiv.org

AI progress is creating a growing range of risks and opportunities, but it is often unclear how
they should be navigated. In many cases, the barriers and uncertainties faced are at least …

被引用次数：18 相关文章所有 4 个版本

[PDF] acm.org

Visibility into AI Agents

A Chan, C Ezell, M Kaufmann, K Wei… - The 2024 ACM …, 2024 - dl.acm.org

Increased delegation of commercial, scientific, governmental, and personal activities to AI
agents—systems capable of pursuing complex goals with limited supervision—may …

被引用次数：20 相关文章所有 5 个版本

高级搜索

QQ 群