相关文章- 学术资源搜索

[PDF][PDF] The AI alignment problem: why it is hard, and where to start

E Yudkowsky - Symbolic Systems Distinguished Speaker, 2016 - intelligence.org

If we can build sufficiently advanced machine intelligences, what goals should we point
them at? The frontier open problems on this subject are less,“A robot may not injure a …

被引用次数：63 相关文章

[HTML] springer.com

[HTML][HTML] Current cases of AI misalignment and their implications for future risks

L Dung - Synthese, 2023 - Springer

How can one build AI systems such that they pursue the goals their designers want them to
pursue? This is the alignment problem. Numerous authors have raised concerns that, as …

被引用次数：14 相关文章所有 3 个版本

[PDF] arxiv.org

Of Models and Tin Men--a behavioural economics study of principal-agent problems in AI alignment using large-language models

S Phelps, R Ranson - arXiv preprint arXiv:2307.11137, 2023 - arxiv.org

AI Alignment is often presented as an interaction between a single designer and an artificial
agent in which the designer attempts to ensure the agent's behavior is consistent with its …

被引用次数：3 相关文章所有 6 个版本

[HTML] springer.com

[HTML][HTML] Artificial intelligence, values, and alignment

I Gabriel - Minds and machines, 2020 - Springer

This paper looks at philosophical questions that arise in the context of AI alignment. It
defends three propositions. First, normative and technical aspects of the AI alignment …

被引用次数：572 相关文章所有 15 个版本

[PDF] arxiv.org

Alignment of language agents

Z Kenton, T Everitt, L Weidinger, I Gabriel… - arXiv preprint arXiv …, 2021 - arxiv.org

For artificial intelligence to be beneficial to humans the behaviour of AI agents needs to be
aligned with what humans want. In this paper we discuss some behavioural issues for …

被引用次数：136 相关文章所有 4 个版本

[PDF] openphilanthropy.org

Agent foundations for aligning machine intelligence with human interests: a technical research agenda

N Soares, B Fallenstein - The technological singularity: Managing the …, 2017 - Springer

In this chapter, we discuss a host of technical problems that we think AI scientists could work
on to ensure that the creation of smarter-than-human machine intelligence has a positive …

被引用次数：82 相关文章所有 9 个版本

[PDF] nber.org

Aligned with whom? Direct and social goals for AI systems

A Korinek, A Balwit - 2022 - nber.org

As artificial intelligence (AI) becomes more powerful and widespread, the AI alignment
problem—how to ensure that AI systems pursue the goals that we want them to pursue—has …

被引用次数：12 相关文章所有 13 个版本

[HTML] springer.com

[HTML][HTML] The obscure politics of artificial intelligence: a Marxian socio-technical critique of the AI alignment problem thesis

F Cugurullo - AI and Ethics, 2024 - Springer

There is a growing feeling that artificial intelligence (AI) is getting out of control. Many AI
experts worldwide stress that great care must be taken on the so-called alignment problem …

被引用次数：2 相关文章所有 2 个版本

Ready for robots: how to think about the future of AI

K Cukier - Foreign Aff., 2019 - HeinOnline

EDITED BY JOHN BROCKMAN. Penguin Press, 2019, 320 pp. n 1955, John McCarthy
coined the term" artificial intelligence"(AI) in a grant proposal that he co-wrote with his …

被引用次数：36 相关文章所有 2 个版本

[PDF] amazonaws.com

[图书][B] The alignment problem: How can machines learn human values?

B Christian - 2021 - books.google.com

'Vital reading. This is the book on artificial intelligence we need right now.'Mike Krieger,
cofounder of Instagram Artificial intelligence is rapidly dominating every aspect of our …

被引用次数：489 相关文章所有 7 个版本

高级搜索

QQ 群

[PDF][PDF] The AI alignment problem: why it is hard, and where to start

[HTML][HTML] Current cases of AI misalignment and their implications for future risks

Of Models and Tin Men--a behavioural economics study of principal-agent problems in AI alignment using large-language models

[HTML][HTML] Artificial intelligence, values, and alignment

Alignment of language agents

Agent foundations for aligning machine intelligence with human interests: a technical research agenda

Aligned with whom? Direct and social goals for AI systems

[HTML][HTML] The obscure politics of artificial intelligence: a Marxian socio-technical critique of the AI alignment problem thesis

Ready for robots: how to think about the future of AI

[图书][B] The alignment problem: How can machines learn human values?

相关搜索

引用