J Seznec,
P Menard,
A Lazaric… - … Conference on Artificial …, 2020 - proceedings.mlr.press
In many application domains (eg, recommender systems, intelligent tutoring systems), the
rewards associated to the available actions tend to decrease over time. This decay is either …