Urban driver: Learning to drive from real-world demonstrations using policy gradients

O Scheel, L Bergamini, M Wolczyk… - … on Robot Learning, 2022 - proceedings.mlr.press
In this work we are the first to present an offline policy gradient method for learning imitative
policies for complex urban driving from a large corpus of real-world demonstrations. This is …

Urban Driver: Learning to Drive from Real-world Demonstrations Using Policy Gradients

O Scheel, L Bergamini, M Wołczyk, B Osiński… - arXiv preprint arXiv …, 2021 - arxiv.org
In this work we are the first to present an offline policy gradient method for learning imitative
policies for complex urban driving from a large corpus of real-world demonstrations. This is …

Urban Driver: Learning to Drive from Real-world Demonstrations Using Policy Gradients

O Scheel, L Bergamini, M Wolczyk… - … Conference on Robot …, 2021 - openreview.net
In this work we are the first to present an offline policy gradient method for learning imitative
policies for complex urban driving from a large corpus of real-world demonstrations. This is …

Urban Driver: Learning to Drive from Real-world Demonstrations Using Policy Gradients

O Scheel, L Bergamini, M Wołczyk, B Osiński… - arXiv e …, 2021 - ui.adsabs.harvard.edu
In this work we are the first to present an offline policy gradient method for learning imitative
policies for complex urban driving from a large corpus of real-world demonstrations. This is …

[PDF][PDF] Urban Driver: Learning to Drive from Real-world Demonstrations Using Policy Gradients

O Scheel, L Bergamini, M Wołczyk, B Osinski… - proceedings.mlr.press
In this section we report additional results of deploying our trained policy to SDVs. Figure 2
shows our planner navigating through a multitude of challenging scenarios. For more results …

[PDF][PDF] Urban Driver: Learning to Drive from Real-world Demonstrations Using Policy Gradients

O Scheel, L Bergamini, M Wołczyk, B Osinski… - openreview.net
In this work we are the first to present an offline policy gradient method for learning imitative
policies for complex urban driving from a large corpus of real-world demonstrations. This is …