Model-based reinforcement learning for atari L Kaiser, M Babaeizadeh, P Milos, B Osinski, RH Campbell, ... arXiv preprint arXiv:1903.00374, 2019 | 936 | 2019 |
Simulation-based reinforcement learning for real-world autonomous driving B Osiński, A Jakubowski, P Zięcina, P Miłoś, C Galias, S Homoceanu, ... 2020 IEEE international conference on robotics and automation (ICRA), 6411-6418, 2020 | 140 | 2020 |
Learning to run challenge solutions: Adapting reinforcement learning methods for neuromusculoskeletal environments Ł Kidziński, SP Mohanty, CF Ong, Z Huang, S Zhou, A Pechenko, ... The NIPS'17 Competition: Building Intelligent Systems, 121-153, 2018 | 94 | 2018 |
Inequality decomposition by population subgroups for ordinal data M Kobus, P Miłoś Journal of Health Economics 31 (1), 15-21, 2012 | 86 | 2012 |
Continual world: A robotic benchmark for continual reinforcement learning M Wołczyk, M Zając, R Pascanu, Ł Kuciński, P Miłoś Advances in Neural Information Processing Systems 34, 28496-28510, 2021 | 71 | 2021 |
Thor: Wielding hammers to integrate language models and automated theorem provers AQ Jiang, W Li, S Tworkowski, K Czechowski, T Odrzygóźdź, P Miłoś, ... Advances in Neural Information Processing Systems 35, 8360-8373, 2022 | 58 | 2022 |
Maximal displacement of a supercritical branching random walk in a time-inhomogeneous random environment B Mallein, P Miłoś | 57* | |
Focused transformer: Contrastive training for context scaling S Tworkowski, K Staniszewski, M Pacek, Y Wu, H Michalewski, P Miłoś Advances in Neural Information Processing Systems 36, 2024 | 56 | 2024 |
CLT for Ornstein-Uhlenbeck branching particle system R Adamczak, P Miłoś | 34* | 2015 |
Subgoal search for complex reasoning tasks K Czechowski, T Odrzygóźdź, M Zbysiński, M Zawalski, K Olejnik, Y Wu, ... Advances in Neural Information Processing Systems 34, 624-638, 2021 | 27 | 2021 |
CARLA Real Traffic Scenarios--novel training ground and benchmark for autonomous driving B Osiński, P Miłoś, A Jakubowski, P Zięcina, M Martyniak, C Galias, ... arXiv preprint arXiv:2012.11329, 2020 | 26 | 2020 |
Disentangling transfer in continual reinforcement learning M Wolczyk, M Zając, R Pascanu, Ł Kuciński, P Miłoś Advances in Neural Information Processing Systems 35, 6304-6317, 2022 | 24 | 2022 |
Moe-mamba: Efficient selective state space models with mixture of experts M Pióro, K Ciebiera, K Król, J Ludziejewski, S Jaszczur arXiv preprint arXiv:2401.04081, 2024 | 23 | 2024 |
The random interchange process on the hypercube R Kotecký, P Miłoś, D Ueltschi | 22 | 2016 |
Occupation time fluctuations of Poisson and equilibrium finite variance branching systems P Milos arXiv preprint math/0512414, 2005 | 22 | 2005 |
Delocalization of two-dimensional random surfaces with hard-core constraints P Miłoś, R Peled Communications in Mathematical Physics 340 (1), 1-46, 2015 | 21 | 2015 |
On truncated variation, upward truncated variation and downward truncated variation for diffusions RM Łochowski, P Miłoś Stochastic Processes and their Applications 123 (2), 446-474, 2013 | 20 | 2013 |
Magnushammer: A transformer-based approach to premise selection M Mikuła, S Antoniak, S Tworkowski, AQ Jiang, JP Zhou, C Szegedy, ... arXiv preprint arXiv:2303.04488, 2023 | 19 | 2023 |
-Statistics of Ornstein–Uhlenbeck Branching Particle System R Adamczak, P Miłoś Journal of Theoretical Probability 27 (4), 1071-1111, 2014 | 17 | 2014 |
Occupation times of subcritical branching immigration systems with Markov motions P Miłoś Stochastic processes and their applications 119 (10), 3211-3237, 2009 | 16 | 2009 |