[PDF][PDF] Value inference in sociotechnical systems

E Liscio, R Lera-Leri, F Bistaffa… - Proceedings of the …, 2023 - enricoliscio.github.io
As artificial agents become increasingly embedded in our society, we must ensure that their
behavior aligns with human values. Value alignment entails value inference, the process of …

The challenge of value alignment: From fairer algorithms to AI safety

I Gabriel, V Ghazavi - arXiv preprint arXiv:2101.06060, 2021 - arxiv.org
This paper addresses the question of how to align AI systems with human values and
situates it within a wider body of thought regarding technology and value. Far from existing …

AI alignment in the design of interactive AI: Specification alignment, process alignment, and evaluation support

M Terry, C Kulkarni, M Wattenberg, L Dixon… - arXiv preprint arXiv …, 2023 - arxiv.org
AI alignment considers the overall problem of ensuring an AI produces desired outcomes,
without undesirable side effects. While often considered from the perspectives of safety and …

Value fulcra: Mapping large language models to the multidimensional spectrum of basic human values

J Yao, X Yi, X Wang, Y Gong, X Xie - arXiv preprint arXiv:2311.10766, 2023 - arxiv.org
The rapid advancement of Large Language Models (LLMs) has attracted much attention to
value alignment for their responsible development. However, how to define values in this …

[HTML][HTML] What values should an agent align with? An empirical comparison of general and context-specific values

E Liscio, M van der Meer, LC Siebert… - Autonomous Agents and …, 2022 - Springer
The pursuit of values drives human behavior and promotes cooperation. Existing research is
focused on general values (eg, Schwartz) that transcend contexts. However, context-specific …

[HTML][HTML] On the philosophy of unsupervised learning

DS Watson - Philosophy & Technology, 2023 - Springer
Unsupervised learning algorithms are widely used for many important statistical tasks with
numerous applications in science and industry. Yet despite their prevalence, they have …

[HTML][HTML] Current cases of AI misalignment and their implications for future risks

L Dung - Synthese, 2023 - Springer
How can one build AI systems such that they pursue the goals their designers want them to
pursue? This is the alignment problem. Numerous authors have raised concerns that, as …

Exploring the psychology of GPT-4's Moral and Legal Reasoning

GFCF Almeida, JL Nunes, N Engelmann… - arXiv preprint arXiv …, 2023 - arxiv.org
Large language models have been used as the foundation of highly sophisticated artificial
intelligences, capable of delivering human-like responses to probes about legal and moral …

[HTML][HTML] Ethics of generative AI and manipulation: a design-oriented research agenda

M Klenk - Ethics and Information Technology, 2024 - Springer
Generative AI enables automated, effective manipulation at scale. Despite the growing
general ethical discussion around generative AI, the specific manipulation risks remain …

Artificial intelligence, humanistic ethics

J Tasioulas - Daedalus, 2022 - direct.mit.edu
Ethics is concerned with what it is to live a flourishing life and what it is we morally owe to
others. The optimizing mindset prevalent among computer scientists and economists …