查看文章

mlr.press 中的 [PDF]

Black-box policy search with probabilistic programs

作者

Jan-Willem Vandemeent, Brooks Paige, David Tolpin, Frank Wood

发表日期

2016/5/2

研讨会论文

Artificial Intelligence and Statistics

页码范围

1195-1204

出版商

PMLR

简介

In this work we show how to represent policies as programs: that is, as stochastic simulators with tunable parameters. To learn the parameters of such policies we develop connections between black box variational inference and existing policy search approaches. We then explain how such learning can be implemented in a probabilistic programming system. Using our own novel implementation of such a system we demonstrate both conciseness of policy representation and automatic policy parameter learning for a set of canonical reinforcement learning problems.

引用总数

被引用次数：30

201620172018201920202021202220237 1 5 6 4 4 1 1

学术搜索中的文章

Black-box policy search with probabilistic programs

JW Vandemeent, B Paige, D Tolpin, F Wood - Artificial Intelligence and Statistics, 2016

被引用次数：30 相关文章所有 13 个版本