M Tagorti, B Scherrer - International Conference on Machine …, 2015 - proceedings.mlr.press
We consider LSTD (λ), the least-squares temporal-difference algorithm with eligibility traces
algorithm proposed by Boyan (2002). It computes a linear approximation of the value …