O Scheel, L Bergamini, M Wołczyk, B Osiński… - arXiv preprint arXiv …, 2021 - arxiv.org
In this work we are the first to present an offline policy gradient method for learning imitative
policies for complex urban driving from a large corpus of real-world demonstrations. This is …