A comprehensive survey of multiagent reinforcement learning L Busoniu, R Babuska, B De Schutter IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and …, 2008 | 2762 | 2008 |
Reinforcement learning and dynamic programming using function approximators L Busoniu, R Babuska, B De Schutter, D Ernst CRC press, 2017 | 1242 | 2017 |
A survey of actor-critic reinforcement learning: Standard and natural policy gradients I Grondman, L Busoniu, GAD Lopes, R Babuska IEEE Transactions on Systems, Man, and Cybernetics, part C (applications and …, 2012 | 1121 | 2012 |
Multi-agent reinforcement learning: An overview L Buşoniu, R Babuška, B De Schutter Innovations in multi-agent systems and applications-1, 183-221, 2010 | 943 | 2010 |
Reinforcement learning for control: Performance, stability, and deep approximators L Buşoniu, T De Bruin, D Tolić, J Kober, I Palunko Annual Reviews in Control 46, 8-28, 2018 | 402 | 2018 |
Experience replay for real-time reinforcement learning control S Adam, L Busoniu, R Babuska IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and …, 2011 | 306 | 2011 |
Vision and control for UAVs: A survey of general methods and of inexpensive platforms for infrastructure inspection K Máthé, L Buşoniu Sensors 15 (7), 14887-14916, 2015 | 241 | 2015 |
Multi-agent reinforcement learning: A survey L Busoniu, R Babuska, B De Schutter 2006 9th International Conference on Control, Automation, Robotics and …, 2006 | 168 | 2006 |
Efficient model learning methods for actor–critic control I Grondman, M Vaandrager, L Busoniu, R Babuska, E Schuitema IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics) 42 …, 2011 | 128 | 2011 |
Stability analysis of discrete-time infinite-horizon optimal control with discounted cost R Postoyan, L Buşoniu, D Nešić, J Daafouz IEEE Transactions on Automatic Control 62 (6), 2736-2749, 2016 | 110 | 2016 |
Cross-entropy optimization of control policies with adaptive basis functions L Busoniu, D Ernst, B De Schutter, R Babuska IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics) 41 …, 2010 | 97 | 2010 |
Online least-squares policy iteration for reinforcement learning control L Buşoniu, D Ernst, B De Schutter, R Babuška Proceedings of the 2010 American Control Conference, 486-491, 2010 | 91 | 2010 |
Approximate reinforcement learning: An overview L Buşoniu, D Ernst, B De Schutter, R Babuška 2011 IEEE symposium on adaptive dynamic programming and reinforcement …, 2011 | 83 | 2011 |
Decentralized reinforcement learning control of a robotic manipulator L Busoniu, B De Schutter, R Babuska 2006 9th International Conference on Control, Automation, Robotics and …, 2006 | 75 | 2006 |
Approximate dynamic programming with a fuzzy parameterization L Buşoniu, D Ernst, B De Schutter, R Babuška Automatica 46 (5), 804-814, 2010 | 71 | 2010 |
Control delay in reinforcement learning for real-time dynamic systems: A memoryless approach E Schuitema, L Buşoniu, R Babuška, P Jonker 2010 IEEE/RSJ international conference on intelligent robots and systems …, 2010 | 66 | 2010 |
Railway track following with the AR. Drone using vanishing point detection E Páli, K Mathe, L Tamas, L Buşoniu 2014 IEEE International Conference on Automation, Quality and Testing …, 2014 | 56 | 2014 |
Optimistic planning for Markov decision processes L Busoniu, R Munos Artificial Intelligence and Statistics, 182-189, 2012 | 51 | 2012 |
Spectral normalisation for deep reinforcement learning: an optimisation perspective F Gogianu, T Berariu, MC Rosca, C Clopath, L Busoniu, R Pascanu International Conference on Machine Learning, 3734-3744, 2021 | 43 | 2021 |
Least-squares methods for policy iteration L Buşoniu, A Lazaric, M Ghavamzadeh, R Munos, R Babuška, ... Reinforcement learning: state-of-the-art, 75-109, 2012 | 37 | 2012 |