[PDF][PDF] The AI alignment problem: why it is hard, and where to start

E Yudkowsky - Symbolic Systems Distinguished Speaker, 2016 - intelligence.org
If we can build sufficiently advanced machine intelligences, what goals should we point
them at? The frontier open problems on this subject are less,“A robot may not injure a …

[HTML][HTML] Current cases of AI misalignment and their implications for future risks

L Dung - Synthese, 2023 - Springer
How can one build AI systems such that they pursue the goals their designers want them to
pursue? This is the alignment problem. Numerous authors have raised concerns that, as …

Of Models and Tin Men--a behavioural economics study of principal-agent problems in AI alignment using large-language models

S Phelps, R Ranson - arXiv preprint arXiv:2307.11137, 2023 - arxiv.org
AI Alignment is often presented as an interaction between a single designer and an artificial
agent in which the designer attempts to ensure the agent's behavior is consistent with its …

[HTML][HTML] Artificial intelligence, values, and alignment

I Gabriel - Minds and machines, 2020 - Springer
This paper looks at philosophical questions that arise in the context of AI alignment. It
defends three propositions. First, normative and technical aspects of the AI alignment …

Alignment of language agents

Z Kenton, T Everitt, L Weidinger, I Gabriel… - arXiv preprint arXiv …, 2021 - arxiv.org
For artificial intelligence to be beneficial to humans the behaviour of AI agents needs to be
aligned with what humans want. In this paper we discuss some behavioural issues for …

Agent foundations for aligning machine intelligence with human interests: a technical research agenda

N Soares, B Fallenstein - The technological singularity: Managing the …, 2017 - Springer
In this chapter, we discuss a host of technical problems that we think AI scientists could work
on to ensure that the creation of smarter-than-human machine intelligence has a positive …

Aligned with whom? Direct and social goals for AI systems

A Korinek, A Balwit - 2022 - nber.org
As artificial intelligence (AI) becomes more powerful and widespread, the AI alignment
problem—how to ensure that AI systems pursue the goals that we want them to pursue—has …

[HTML][HTML] The obscure politics of artificial intelligence: a Marxian socio-technical critique of the AI alignment problem thesis

F Cugurullo - AI and Ethics, 2024 - Springer
There is a growing feeling that artificial intelligence (AI) is getting out of control. Many AI
experts worldwide stress that great care must be taken on the so-called alignment problem …

Ready for robots: how to think about the future of AI

K Cukier - Foreign Aff., 2019 - HeinOnline
EDITED BY JOHN BROCKMAN. Penguin Press, 2019, 320 pp. n 1955, John McCarthy
coined the term" artificial intelligence"(AI) in a grant proposal that he co-wrote with his …

[图书][B] The alignment problem: How can machines learn human values?

B Christian - 2021 - books.google.com
'Vital reading. This is the book on artificial intelligence we need right now.'Mike Krieger,
cofounder of Instagram Artificial intelligence is rapidly dominating every aspect of our …