查看文章

arxiv.org 中的 [PDF]

Reward Bound for Behavioral Guarantee of Model-based Planning Agents

作者

Zhiyu An, Xianzhong Ding, Wan Du

发表日期

2024/2/20

期刊

arXiv preprint arXiv:2402.13419

简介

Recent years have seen an emerging interest in the trustworthiness of machine learning-based agents in the wild, especially in robotics, to provide safety assurance for the industry. Obtaining behavioral guarantees for these agents remains an important problem. In this work, we focus on guaranteeing a model-based planning agent reaches a goal state within a specific future time step. We show that there exists a lower bound for the reward at the goal state, such that if the said reward is below that bound, it is impossible to obtain such a guarantee. By extension, we show how to enforce preferences over multiple goals.

引用总数

被引用次数：2

20242

学术搜索中的文章

Reward Bound for Behavioral Guarantee of Model-based Planning Agents

Z An, X Ding, W Du - arXiv preprint arXiv:2402.13419, 2024

被引用次数：2 相关文章所有 3 个版本