systems to reach contrasts necessary to directly imaged exoplanets. Telescope vibrations
and the temporal error induced by the latency of the control loop limit the performance of
these systems. One way to reduce these effects is to use predictive control. We describe how
model-free reinforcement learning can be used to optimize a recurrent neural network
controller for closed-loop predictive control. First, we verify our proposed approach for tip–tilt …