Artificial intelligence, values, and alignment

I Gabriel - Minds and machines, 2020 - Springer
This paper looks at philosophical questions that arise in the context of AI alignment. It
defends three propositions. First, normative and technical aspects of the AI alignment …

Aligning artificial intelligence with human values: reflections from a phenomenological perspective

S Han, E Kelly, S Nikou, EO Svee - AI & SOCIETY, 2022 - Springer
Artificial Intelligence (AI) must be directed at humane ends. The development of AI has
produced great uncertainties of ensuring AI alignment with human values (AI value …

STELA: a community-centred approach to norm elicitation for AI alignment

S Bergman, N Marchal, J Mellor, S Mohamed… - Scientific Reports, 2024 - nature.com
Value alignment, the process of ensuring that artificial intelligence (AI) systems are aligned
with human values and goals, is a critical issue in AI research. Existing scholarship has …

Ai alignment: A comprehensive survey

J Ji, T Qiu, B Chen, B Zhang, H Lou, K Wang… - arXiv preprint arXiv …, 2023 - arxiv.org
AI alignment aims to make AI systems behave in line with human intentions and values. As
AI systems grow more capable, the potential large-scale risks associated with misaligned AI …

Taking principles seriously: A hybrid approach to value alignment in artificial intelligence

TW Kim, J Hooker, T Donaldson - Journal of Artificial Intelligence Research, 2021 - jair.org
An important step in the development of value alignment (VA) systems in artificial
intelligence (AI) is understanding how VA can reflect valid ethical principles. We propose …

Should we trust artificial intelligence?

M Sutrop - Trames, 2019 - ceeol.com
Trust is believed to be a foundational cornerstone for artificial intelligence (AI). In April 2019
the European Commission High Level Expert Group on AI adopted the Ethics Guidelines for …

Agent foundations for aligning machine intelligence with human interests: a technical research agenda

N Soares, B Fallenstein - The technological singularity: Managing the …, 2017 - Springer
In this chapter, we discuss a host of technical problems that we think AI scientists could work
on to ensure that the creation of smarter-than-human machine intelligence has a positive …

Building ethically bounded AI

F Rossi, N Mattei - Proceedings of the AAAI Conference on Artificial …, 2019 - aaai.org
The more AI agents are deployed in scenarios with possibly unexpected situations, the more
they need to be flexible, adaptive, and creative in achieving the goal we have given them …

[PDF][PDF] The AI alignment problem: why it is hard, and where to start

E Yudkowsky - Symbolic Systems Distinguished Speaker, 2016 - intelligence.org
If we can build sufficiently advanced machine intelligences, what goals should we point
them at? The frontier open problems on this subject are less,“A robot may not injure a …

[图书][B] The promise of artificial intelligence: reckoning and judgment

BC Smith - 2019 - books.google.com
An argument that—despite dramatic advances in the field—artificial intelligence is nowhere
near developing systems that are genuinely intelligent. In this provocative book, Brian …