B Gaudeta, R Linaresc, R Furfaroa - arXiv preprint arXiv …, 2019 - dspace.mit.edu
This paper proposes a novel adaptive guidance system developed using reinforcement
meta-learning with a recurrent policy and value function approximator. The use of recurrent …