[HTML][HTML] Reward is enough

D Silver, S Singh, D Precup, RS Sutton - Artificial Intelligence, 2021 - Elsevier
In this article we hypothesise that intelligence, and its associated abilities, can be
understood as subserving the maximisation of reward. Accordingly, reward is enough to …

Machine culture

L Brinkmann, F Baumann, JF Bonnefon… - Nature Human …, 2023 - nature.com
The ability of humans to create and disseminate culture is often credited as the single most
important factor of our success as a species. In this Perspective, we explore the notion of …

A social path to human-like artificial intelligence

EA Duéñez-Guzmán, S Sadedin, JX Wang… - Nature Machine …, 2023 - nature.com
Traditionally, cognitive and computer scientists have viewed intelligence solipsistically, as a
property of unitary agents devoid of social context. Given the success of contemporary …

Learning few-shot imitation as cultural transmission

A Bhoopchand, B Brownfield, A Collister… - Nature …, 2023 - nature.com
Cultural transmission is the domain-general social skill that allows agents to acquire and
use information from each other in real-time with high fidelity and recall. It can be thought of …

Emergent social learning via multi-agent reinforcement learning

KK Ndousse, D Eck, S Levine… - … conference on machine …, 2021 - proceedings.mlr.press
Social learning is a key component of human and animal intelligence. By taking cues from
the behavior of experts in their environment, social learners can acquire sophisticated …

Imitating interactive intelligence

J Abramson, A Ahuja, I Barr, A Brussee… - arXiv preprint arXiv …, 2020 - arxiv.org
A common vision from science fiction is that robots will one day inhabit our physical spaces,
sense the world as we do, assist our physical labours, and communicate with us through …

Conditioned reinforcement learning for few-shot imitation

CR Dance, J Perez, T Cachet - International Conference on …, 2021 - proceedings.mlr.press
In few-shot imitation, an agent is given a few demonstrations of a previously unseen task,
and must then successfully perform that task. We propose a novel approach to learning few …

Psiphi-learning: Reinforcement learning with demonstrations using successor features and inverse temporal difference learning

A Filos, C Lyle, Y Gal, S Levine… - International …, 2021 - proceedings.mlr.press
We study reinforcement learning (RL) with no-reward demonstrations, a setting in which an
RL agent has access to additional data from the interaction of other agents with the same …

Reinforcement learning for personalization: A systematic literature review

F Den Hengst, EM Grua, A el Hassouni… - Data …, 2020 - content.iospress.com
The major application areas of reinforcement learning (RL) have traditionally been game
playing and continuous control. In recent years, however, RL has been increasingly applied …

[HTML][HTML] Fast and accurate data-driven goal recognition using process mining techniques

Z Su, A Polyvyanyy, N Lipovetzky, S Sardiña… - Artificial Intelligence, 2023 - Elsevier
The problem of goal recognition requests to automatically infer an accurate probability
distribution over possible goals an autonomous agent is attempting to achieve in the …