J Beitelspacher, J Fager, G Henriques… - School of Computer …, 2006 - mcgovern-fagg.org
This paper compares the performance of policy gradient techniques with traditional value
function approximation methods for reinforcement learning in a difficult problem domain. We …