Evaluating cognitive maps and planning in large language models with CogEval

I Momennejad, H Hasanbeig… - Advances in …, 2024 - proceedings.neurips.cc
Recently an influx of studies claims emergent cognitive abilities in large language models
(LLMs). Yet, most rely on anecdotes, overlook contamination of training sets, or lack …

Reward machines: Exploiting reward function structure in reinforcement learning

RT Icarte, TQ Klassen, R Valenzano… - Journal of Artificial …, 2022 - jair.org
Reinforcement learning (RL) methods usually treat reward functions as black boxes. As
such, these methods must extensively interact with the environment in order to discover …

Formal synthesis of controllers for safety-critical autonomous systems: Developments and challenges

X Yin, B Gao, X Yu - Annual Reviews in Control, 2024 - Elsevier
In recent years, formal methods have been extensively used in the design of autonomous
systems. By employing mathematically rigorous techniques, formal methods can provide …

Compositional reinforcement learning from logical specifications

K Jothimurugan, S Bansal… - Advances in Neural …, 2021 - proceedings.neurips.cc
We study the problem of learning control policies for complex tasks given by logical
specifications. Recent approaches automatically generate a reward function from a given …

Safe reinforcement learning using probabilistic shields

N Jansen, B Könighofer, S Junges… - 31st International …, 2020 - drops.dagstuhl.de
This paper concerns the efficient construction of a safety shield for reinforcement learning.
We specifically target scenarios that incorporate uncertainty and use Markov decision …

Control for smart systems: Challenges and trends in smart cities

QS Jia, H Panetto, M Macchi, S Siri, G Weichhart… - Annual Reviews in …, 2022 - Elsevier
There have been tremendous developments in theories and technologies in control for
smart systems. In this paper we review applications to various systems that are crucial for the …

Modular deep reinforcement learning for continuous motion planning with temporal logic

M Cai, M Hasanbeig, S Xiao, A Abate… - IEEE Robotics and …, 2021 - ieeexplore.ieee.org
This letter investigates the motion planning of autonomous dynamical systems modeled by
Markov decision processes (MDP) with unknown transition probabilities over continuous …

Cautious reinforcement learning with logical constraints

M Hasanbeig, A Abate, D Kroening - arXiv preprint arXiv:2002.12156, 2020 - arxiv.org
This paper presents the concept of an adaptive safe padding that forces Reinforcement
Learning (RL) to synthesise optimal control policies while ensuring safety during the …

Deep reinforcement learning with temporal logics

M Hasanbeig, D Kroening, A Abate - … and Analysis of Timed Systems: 18th …, 2020 - Springer
The combination of data-driven learning methods with formal reasoning has seen a surge of
interest, as either area has the potential to bolstering the other. For instance, formal methods …

Compositional policy learning in stochastic control systems with formal guarantees

Đ Žikelić, M Lechner, A Verma… - Advances in …, 2024 - proceedings.neurips.cc
Reinforcement learning has shown promising results in learning neural network policies for
complicated control tasks. However, the lack of formal guarantees about the behavior of …