The moral psychology of Artificial Intelligence

JF Bonnefon, I Rahwan, A Shariff - Annual review of psychology, 2024 - annualreviews.org
Moral psychology was shaped around three categories of agents and patients: humans,
other animals, and supernatural beings. Rapid progress in artificial intelligence has …

Ai alignment: A comprehensive survey

J Ji, T Qiu, B Chen, B Zhang, H Lou, K Wang… - arXiv preprint arXiv …, 2023 - arxiv.org
AI alignment aims to make AI systems behave in line with human intentions and values. As
AI systems grow more capable, the potential large-scale risks associated with misaligned AI …

Camel: Communicative agents for" mind" exploration of large language model society

G Li, H Hammoud, H Itani… - Advances in Neural …, 2023 - proceedings.neurips.cc
The rapid advancement of chat-based language models has led to remarkable progress in
complex task-solving. However, their success heavily relies on human input to guide the …

Human-level play in the game of Diplomacy by combining language models with strategic reasoning

Meta Fundamental AI Research Diplomacy Team … - Science, 2022 - science.org
Despite much progress in training artificial intelligence (AI) systems to imitate human
language, building agents that use language to communicate intentionally with humans in …

Foundational challenges in assuring alignment and safety of large language models

U Anwar, A Saparov, J Rando, D Paleka… - arXiv preprint arXiv …, 2024 - arxiv.org
This work identifies 18 foundational challenges in assuring the alignment and safety of large
language models (LLMs). These challenges are organized into three different categories …

Restoring and attributing ancient texts using deep neural networks

Y Assael, T Sommerschield, B Shillingford, M Bordbar… - Nature, 2022 - nature.com
Ancient history relies on disciplines such as epigraphy—the study of inscribed texts known
as inscriptions—for evidence of the thought, language, society and history of past …

[HTML][HTML] Rethinking the entwinement between artificial intelligence and human learning: What capabilities do learners need for a world with AI?

L Markauskaite, R Marrone, O Poquet, S Knight… - … and Education: Artificial …, 2022 - Elsevier
The proliferation of AI in many aspects of human life—from personal leisure, to collaborative
professional work, to global policy decisions—poses a sharp question about how to prepare …

Designing creative AI partners with COFI: A framework for modeling interaction in human-AI co-creative systems

J Rezwana, ML Maher - ACM Transactions on Computer-Human …, 2023 - dl.acm.org
Human-AI co-creativity involves both humans and AI collaborating on a shared creative
product as partners. In a creative collaboration, interaction dynamics, such as turn-taking …

Studying up machine learning data: Why talk about bias when we mean power?

M Miceli, J Posada, T Yang - Proceedings of the ACM on Human …, 2022 - dl.acm.org
Research in machine learning (ML) has argued that models trained on incomplete or biased
datasets can lead to discriminatory outputs. In this commentary, we propose moving the …

Negotiation and honesty in artificial intelligence methods for the board game of Diplomacy

J Kramár, T Eccles, I Gemp, A Tacchetti… - Nature …, 2022 - nature.com
The success of human civilization is rooted in our ability to cooperate by communicating and
making joint plans. We study how artificial agents may use communication to better …