P Geibel, F Wysotzki - Journal of Artificial Intelligence Research, 2005 - jair.org
… , and formalize it as a constrained MDP with two criteria. The … We present a model free,
heuristic reinforcement learning … a feasible solution for the constrained problem that has a good …