Interpretable concept bottlenecks to align reinforcement learning agents

Q Delfosse, S Sztwiertnia, M Rothermel… - arXiv preprint arXiv …, 2024 - arxiv.org
Goal misalignment, reward sparsity and difficult credit assignment are only a few of the many
issues that make it difficult for deep reinforcement learning (RL) agents to learn optimal …

Interpretable end-to-end Neurosymbolic Reinforcement Learning agents

N Grandien, Q Delfosse, K Kersting - arXiv preprint arXiv:2410.14371, 2024 - arxiv.org
Deep reinforcement learning (RL) agents rely on shortcut learning, preventing them from
generalizing to slightly different environments. To address this problem, symbolic method …

EXPIL: Explanatory Predicate Invention for Learning in Games

J Sha, H Shindo, Q Delfosse, K Kersting… - arXiv preprint arXiv …, 2024 - arxiv.org
Reinforcement learning (RL) has proven to be a powerful tool for training agents that excel
in various games. However, the black-box nature of neural network models often hinders our …

Reducing Carbon Emissions at Scale: Interpretable and Efficient to Implement Reinforcement Learning via Policy Extraction

J Goldfeder, J Sipple - Proceedings of the 11th ACM International …, 2024 - dl.acm.org
Machine Learning is a promising avenue for combating climate change by developing
control policies that can lead to greater efficiency, reducing cost and carbon footprint …

OCALM: Object-Centric Assessment with Language Models

T Kaufmann, J Blüml, A Wüst, Q Delfosse… - arXiv preprint arXiv …, 2024 - arxiv.org
Properly defining a reward signal to efficiently train a reinforcement learning (RL) agent is a
challenging task. Designing balanced objective functions from which a desired behavior can …

An Attentive Approach for Building Partial Reasoning Agents from Pixels

S Alver, D Precup - Transactions on Machine Learning Research - openreview.net
We study the problem of building reasoning agents that are able to generalize in an effective
manner. Towards this goal, we propose an end-to-end approach for building model-based …

Hierarchical Programmatic Option Framework

YA Lin, CT Lee, CH Yang, GT Liu, SH Sun - The Thirty-eighth Annual … - openreview.net
Deep reinforcement learning aims to learn deep neural network policies to solve large-scale
decision-making problems. However, approximating policies using deep neural networks …