A rubric for human-like agents and NeuroAI

I Momennejad - … Transactions of the Royal Society B, 2023 - royalsocietypublishing.org
Researchers across cognitive, neuro-and computer sciences increasingly reference 'human-
like'artificial intelligence and 'neuroAI'. However, the scope and use of the terms are often …

Evaluating cognitive maps and planning in large language models with CogEval

I Momennejad, H Hasanbeig… - Advances in …, 2024 - proceedings.neurips.cc
Recently an influx of studies claims emergent cognitive abilities in large language models
(LLMs). Yet, most rely on anecdotes, overlook contamination of training sets, or lack …

Curious replay for model-based adaptation

I Kauvar, C Doyle, L Zhou, N Haber - arXiv preprint arXiv:2306.15934, 2023 - arxiv.org
Agents must be able to adapt quickly as an environment changes. We find that existing
model-based reinforcement learning agents are unable to do this well, in part because of …

Multi-step generalized policy improvement by leveraging approximate models

LN Alegre, A Bazzan, A Nowé… - Advances in Neural …, 2024 - proceedings.neurips.cc
We introduce a principled method for performing zero-shot transfer in reinforcement learning
(RL) by exploiting approximate models of the environment. Zero-shot transfer in RL has …

Importance of prefrontal meta control in human-like reinforcement learning

JH Lee, JZ Leibo, SJ An, SW Lee - Frontiers in Computational …, 2022 - frontiersin.org
Recent investigation on reinforcement learning (RL) has demonstrated considerable
flexibility in dealing with various problems. However, such models often experience difficulty …

The effectiveness of world models for continual reinforcement learning

S Kessler, M Ostaszewski… - Conference on …, 2023 - proceedings.mlr.press
World models power some of the most efficient reinforcement learning algorithms. In this
work, we showcase that they can be harnessed for continual learning–a situation when the …

Learning and Planning with the Average-Reward Formulation

Y Wan - 2023 - era.library.ualberta.ca
The average-reward formulation is a natural and important formulation of learning and
planning problems, yet has received much less attention than the episodic and discounted …

Reinforcement Learning Recommendation Algorithm Based on Environmental Information Exploration

J Jia, H Wang, F Wang, T Li, S Liu - 2024 - researchsquare.com
Most of the current recommendation algorithms based on reinforcement learning are biased
towards the design of exploration strategies at the model level, ignoring the full use of …

Replay Buffer with Local Forgetting for Adapting to Local Environment Changes in Deep Model-Based Reinforcement Learning

A Rahimi-Kalahroudi, J Rajendran… - Conference on …, 2023 - proceedings.mlr.press
One of the key behavioral characteristics used in neuroscience to determine whether the
subject of study—be it a rodent or a human—exhibits model-based learning is effective …

Replay Buffer With Local Forgetting for Adaptive Deep Model-Based Reinforcement Learning

A Rahimi-Kalahroudi, J Rajendran… - Deep Reinforcement …, 2023 - openreview.net
One of the key behavioral characteristics used in neuroscience to determine whether the
subject of study---be it a rodent or a human---exhibits model-based learning is effective …