Reinforcement learning, fast and slow

M Botvinick, S Ritter, JX Wang, Z Kurth-Nelson… - Trends in cognitive …, 2019 - cell.com
Deep reinforcement learning (RL) methods have driven impressive advances in artificial
intelligence in recent years, exceeding human performance in domains ranging from Atari to …

Reinforcement learning and episodic memory in humans and animals: an integrative framework

SJ Gershman, ND Daw - Annual review of psychology, 2017 - annualreviews.org
We review the psychology and neuroscience of reinforcement learning (RL), which has
experienced significant progress in the past two decades, enabled by the comprehensive …

Model-based reinforcement learning: A survey

TM Moerland, J Broekens, A Plaat… - … and Trends® in …, 2023 - nowpublishers.com
Sequential decision making, commonly formalized as Markov Decision Process (MDP)
optimization, is an important challenge in artificial intelligence. Two key approaches to this …

Prefrontal cortex as a meta-reinforcement learning system

JX Wang, Z Kurth-Nelson, D Kumaran, D Tirumala… - Nature …, 2018 - nature.com
Over the past 20 years, neuroscience research on reward-based learning has converged on
a canonical model, under which the neurotransmitter dopamine 'stamps in'associations …

State and rate-of-change encoding in parallel mesoaccumbal dopamine pathways

JW de Jong, Y Liang, JPH Verharen, KM Fraser… - Nature …, 2024 - nature.com
The nervous system uses fast-and slow-adapting sensory detectors in parallel to enable
neuronal representations of external states and their temporal dynamics. It is unknown …

The successor representation in human reinforcement learning

I Momennejad, EM Russek, JH Cheong… - Nature human …, 2017 - nature.com
Theories of reward learning in neuroscience have focused on two families of algorithms
thought to capture deliberative versus habitual choice.'Model-based'algorithms compute the …

Prioritized memory access explains planning and hippocampal replay

MG Mattar, ND Daw - Nature neuroscience, 2018 - nature.com
To make decisions, animals must evaluate candidate choices by accessing memories of
relevant experiences. Yet little is known about which experiences are considered or ignored …

Humans primarily use model-based inference in the two-stage task

C Feher da Silva, TA Hare - Nature Human Behaviour, 2020 - nature.com
Distinct model-free and model-based learning processes are thought to drive both typical
and dysfunctional behaviours. Data from two-stage decision tasks have seemingly shown …

Believing in dopamine

SJ Gershman, N Uchida - Nature Reviews Neuroscience, 2019 - nature.com
Midbrain dopamine signals are widely thought to report reward prediction errors that drive
learning in the basal ganglia. However, dopamine has also been implicated in various …

Predictive representations can link model-based reinforcement learning to model-free mechanisms

EM Russek, I Momennejad, MM Botvinick… - PLoS computational …, 2017 - journals.plos.org
Humans and animals are capable of evaluating actions by considering their long-run future
rewards through a process described using model-based reinforcement learning (RL) …