[HTML][HTML] Reward is enough

D Silver, S Singh, D Precup, RS Sutton - Artificial Intelligence, 2021 - Elsevier
In this article we hypothesise that intelligence, and its associated abilities, can be
understood as subserving the maximisation of reward. Accordingly, reward is enough to …

Scalable agent alignment via reward modeling: a research direction

J Leike, D Krueger, T Everitt, M Martic, V Maini… - arXiv preprint arXiv …, 2018 - arxiv.org
One obstacle to applying reinforcement learning algorithms to real-world problems is the
lack of suitable reward functions. Designing such reward functions is difficult in part because …

Research priorities for robust and beneficial artificial intelligence

S Russell, D Dewey, M Tegmark - AI magazine, 2015 - ojs.aaai.org
Success in the quest for artificial intelligence has the potential to bring unprecedented
benefits to humanity, and it is therefore worthwhile to investigate how to maximize these …

AI safety gridworlds

J Leike, M Martic, V Krakovna, PA Ortega… - arXiv preprint arXiv …, 2017 - arxiv.org
We present a suite of reinforcement learning environments illustrating various safety
properties of intelligent agents. These problems include safe interruptibility, avoiding side …

Three dogmas of reinforcement learning

D Abel, MK Ho, A Harutyunyan - arXiv preprint arXiv:2407.10583, 2024 - arxiv.org
Modern reinforcement learning has been conditioned by at least three dogmas. The first is
the environment spotlight, which refers to our tendency to focus on modeling environments …

Self-predictive universal AI

E Catt, J Grau-Moya, M Hutter… - Advances in …, 2023 - proceedings.neurips.cc
Reinforcement Learning (RL) algorithms typically utilize learning and/or planning
techniques to derive effective policies. The integration of both approaches has proven to be …

Embedded agency

A Demski, S Garrabrant - arXiv preprint arXiv:1902.09469, 2019 - arxiv.org
Traditional models of rational action treat the agent as though it is cleanly separated from its
environment, and can act on that environment from the outside. Such agents have a known …

Towards safe artificial general intelligence

T Everitt - 2019 - search.proquest.com
The field of artificial intelligence has recently experienced a number of breakthroughs thanks
to progress in deep learning and reinforcement learning. Computer algorithms now …

Ethical artificial intelligence

B Hibbard - arXiv preprint arXiv:1411.1373, 2014 - arxiv.org
This book-length article combines several peer reviewed papers and new material to
analyze the issues of ethical artificial intelligence (AI). The behavior of future AI systems can …

Information-theoretic bounded rationality

PA Ortega, DA Braun, J Dyer, KE Kim… - arXiv preprint arXiv …, 2015 - arxiv.org
Bounded rationality, that is, decision-making and planning under resource limitations, is
widely regarded as an important open problem in artificial intelligence, reinforcement …