Artificial intelligence, values, and alignment

E Liscio, R Lera-Leri, F Bistaffa… - Proceedings of the …, 2023 - enricoliscio.github.io

As artificial agents become increasingly embedded in our society, we must ensure that their
behavior aligns with human values. Value alignment entails value inference, the process of …

被引用次数：22 相关文章所有 13 个版本

[PDF] arxiv.org

The challenge of value alignment: From fairer algorithms to AI safety

I Gabriel, V Ghazavi - arXiv preprint arXiv:2101.06060, 2021 - arxiv.org

This paper addresses the question of how to align AI systems with human values and
situates it within a wider body of thought regarding technology and value. Far from existing …

被引用次数：42 相关文章所有 5 个版本

[PDF] arxiv.org

AI alignment in the design of interactive AI: Specification alignment, process alignment, and evaluation support

M Terry, C Kulkarni, M Wattenberg, L Dixon… - arXiv preprint arXiv …, 2023 - arxiv.org

AI alignment considers the overall problem of ensuring an AI produces desired outcomes,
without undesirable side effects. While often considered from the perspectives of safety and …

被引用次数：9 相关文章所有 2 个版本

[PDF] arxiv.org

Value fulcra: Mapping large language models to the multidimensional spectrum of basic human values

J Yao, X Yi, X Wang, Y Gong, X Xie - arXiv preprint arXiv:2311.10766, 2023 - arxiv.org

The rapid advancement of Large Language Models (LLMs) has attracted much attention to
value alignment for their responsible development. However, how to define values in this …

被引用次数：9 相关文章所有 3 个版本

[HTML] springer.com

[HTML][HTML] What values should an agent align with? An empirical comparison of general and context-specific values

E Liscio, M van der Meer, LC Siebert… - Autonomous Agents and …, 2022 - Springer

The pursuit of values drives human behavior and promotes cooperation. Existing research is
focused on general values (eg, Schwartz) that transcend contexts. However, context-specific …

被引用次数：26 相关文章所有 15 个版本

[HTML] springer.com

[HTML][HTML] On the philosophy of unsupervised learning

DS Watson - Philosophy & Technology, 2023 - Springer

Unsupervised learning algorithms are widely used for many important statistical tasks with
numerous applications in science and industry. Yet despite their prevalence, they have …

被引用次数：17 相关文章所有 6 个版本

[HTML] springer.com

[HTML][HTML] Current cases of AI misalignment and their implications for future risks

L Dung - Synthese, 2023 - Springer

How can one build AI systems such that they pursue the goals their designers want them to
pursue? This is the alignment problem. Numerous authors have raised concerns that, as …

被引用次数：10 相关文章所有 3 个版本

[PDF] arxiv.org

Exploring the psychology of GPT-4's Moral and Legal Reasoning

GFCF Almeida, JL Nunes, N Engelmann… - arXiv preprint arXiv …, 2023 - arxiv.org

Large language models have been used as the foundation of highly sophisticated artificial
intelligences, capable of delivering human-like responses to probes about legal and moral …

被引用次数：13 相关文章所有 2 个版本

[HTML] springer.com

[HTML][HTML] Ethics of generative AI and manipulation: a design-oriented research agenda

M Klenk - Ethics and Information Technology, 2024 - Springer

Generative AI enables automated, effective manipulation at scale. Despite the growing
general ethical discussion around generative AI, the specific manipulation risks remain …

被引用次数：12 相关文章所有 7 个版本

[PDF] mit.edu

Artificial intelligence, humanistic ethics

J Tasioulas - Daedalus, 2022 - direct.mit.edu

Ethics is concerned with what it is to live a flourishing life and what it is we morally owe to
others. The optimizing mindset prevalent among computer scientists and economists …

被引用次数：25 相关文章所有 7 个版本

高级搜索

QQ 群