Deep q-learning from demonstrations T Hester, M Vecerik, O Pietquin, M Lanctot, T Schaul, B Piot, D Horgan, ... Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018 | 1220 | 2018 |
Noisy Networks for Exploration SL Meire Fortunato, Mohammad Gheshlaghi Azar, Bilal Piot, Jacob Menick, Ian ... International Conference on Learning Representations (ICLR), 2018 | 1172* | 2018 |
Leveraging demonstrations for deep reinforcement learning on robotics problems with sparse rewards M Vecerik, T Hester, J Scholz, F Wang, O Pietquin, B Piot, N Heess, ... arXiv preprint arXiv:1707.08817, 2017 | 779 | 2017 |
Modulating early visual processing by language H De Vries, F Strub, J Mary, H Larochelle, O Pietquin, AC Courville Advances in neural information processing systems 30, 2017 | 552 | 2017 |
Guesswhat?! visual object discovery through multi-modal dialogue H De Vries, F Strub, S Chandar, O Pietquin, H Larochelle, A Courville Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2017 | 452 | 2017 |
Audiolm: a language modeling approach to audio generation Z Borsos, R Marinier, D Vincent, E Kharitonov, O Pietquin, M Sharifi, ... IEEE/ACM transactions on audio, speech, and language processing 31, 2523-2533, 2023 | 357 | 2023 |
Listen and translate: A proof of concept for end-to-end speech-to-text translation A Bérard, O Pietquin, C Servan, L Besacier arXiv preprint arXiv:1612.01744, 2016 | 315 | 2016 |
A theory of regularized markov decision processes M Geist, B Scherrer, O Pietquin International Conference on Machine Learning, 2160-2169, 2019 | 310 | 2019 |
Acme: A research framework for distributed reinforcement learning MW Hoffman, B Shahriari, J Aslanides, G Barth-Maron, N Momchev, ... arXiv preprint arXiv:2006.00979, 2020 | 238 | 2020 |
What matters for on-policy deep actor-critic methods? A large-scale empirical study M Andrychowicz, A Raichuk, P Stańczyk, M Orsini, S Girgin, R Marinier, ... arXiv preprint arXiv:2006.05990, 2020 | 236* | 2020 |
End-to-end automatic speech translation of audiobooks A Bérard, L Besacier, AC Kocabiyikoglu, O Pietquin 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 220 | 2018 |
A probabilistic framework for dialog simulation and optimal strategy learning O Pietquin, T Dutoit IEEE Transactions on Audio, Speech, and Language Processing 14 (2), 589-599, 2006 | 203 | 2006 |
Learning from demonstrations for real world reinforcement learning T Hester, M Vecerik, O Pietquin, M Lanctot, T Schaul, B Piot, D Horgan, ... arXiv preprint arXiv:1704.03732, 2017, 2018 | 189 | 2018 |
What matters for on-policy deep actor-critic methods? a large-scale study M Andrychowicz, A Raichuk, P Stańczyk, M Orsini, S Girgin, R Marinier, ... International conference on learning representations, 2020 | 164 | 2020 |
Machine learning for spoken dialogue systems O Lemon, O Pietquin European Conference on Speech Communication and Technologies (Interspeech'07 …, 2007 | 150 | 2007 |
A framework for unsupervised learning of dialogue strategies O Pietquin Presses univ. de Louvain, 2005 | 148 | 2005 |
A survey on metrics for the evaluation of user simulations O Pietquin, H Hastie The knowledge engineering review 28 (1), 59-73, 2013 | 140 | 2013 |
Observe and look further: Achieving consistent performance on atari T Pohlen, B Piot, T Hester, MG Azar, D Horgan, D Budden, G Barth-Maron, ... arXiv preprint arXiv:1805.11593, 2018 | 136 | 2018 |
Primal wasserstein imitation learning R Dadashi, L Hussenot, M Geist, O Pietquin arXiv preprint arXiv:2006.04678, 2020 | 128 | 2020 |
Kalman temporal differences M Geist, O Pietquin Journal of artificial intelligence research 39, 483-532, 2010 | 124 | 2010 |