Online learning in Markov decision processes with changing cost sequences

T Dick, A Gyorgy, C Szepesvari - … Conference on Machine …, 2014 - proceedings.mlr.press
In this paper we consider online learning in finite Markov decision processes (MDPs) with
changing cost sequences under full and bandit-information. We propose to view this …

Online learning in Markov decision processes with changing cost sequences

T Dick, A György, C Szepesvári - … of the 31st International Conference on …, 2014 - dl.acm.org
In this paper we consider online learning in finite Markov decision processes (MDPs) with
changing cost sequences under full and bandit-information. We propose to view this …

Online Learning in Markov Decision Processes with Changing Cost Sequences

T Dick, A Gyorgy, C Szepesvari - International Conference on …, 2014 - jmlr.csail.mit.edu
In this paper we consider online learning in finite Markov decision processes (MDPs) with
changing cost sequences under full and bandit-information. We propose to view this …

Online Learning in Markov Decision Processes with Changing Cost Sequences

T Dick, A Gyorgy, C Szepesvari - … Conference on Machine …, 2014 - proceedings.mlr.press
In this paper we consider online learning in finite Markov decision processes (MDPs) with
changing cost sequences under full and bandit-information. We propose to view this …