processes. Using relative value functions, we present methods for evaluating optimal bias,
this leads to a probabilistic analysis which transforms the original reward problem into a
minimum average cost problem. The result is an explanation of how and why bias implicitly
discounts future rewards.