Deep reinforcement learning that matters P Henderson, R Islam, P Bachman, J Pineau, D Precup, D Meger Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018 | 2324 | 2018 |
Building end-to-end dialogue systems using generative hierarchical neural network models I Serban, A Sordoni, Y Bengio, A Courville, J Pineau Proceedings of the AAAI conference on artificial intelligence 30 (1), 2016 | 2063 | 2016 |
An introduction to deep reinforcement learning V François-Lavet, P Henderson, R Islam, MG Bellemare, J Pineau Foundations and Trends® in Machine Learning 11 (3-4), 219-354, 2018 | 1697 | 2018 |
How not to evaluate your dialogue system: An empirical study of unsupervised evaluation metrics for dialogue response generation CW Liu, R Lowe, IV Serban, M Noseworthy, L Charlin, J Pineau arXiv preprint arXiv:1603.08023, 2016 | 1506 | 2016 |
Point-based value iteration: An anytime algorithm for POMDPs J Pineau, G Gordon, S Thrun Ijcai 3, 1025-1032, 2003 | 1455 | 2003 |
A hierarchical latent variable encoder-decoder model for generating dialogues I Serban, A Sordoni, R Lowe, L Charlin, J Pineau, A Courville, Y Bengio Proceedings of the AAAI conference on artificial intelligence 31 (1), 2017 | 1262 | 2017 |
The ubuntu dialogue corpus: A large dataset for research in unstructured multi-turn dialogue systems R Lowe, N Pow, I Serban, J Pineau arXiv preprint arXiv:1506.08909, 2015 | 1119 | 2015 |
Towards robotic assistants in nursing homes: Challenges and results J Pineau, M Montemerlo, M Pollack, N Roy, S Thrun Robotics and autonomous systems 42 (3-4), 271-281, 2003 | 861 | 2003 |
A survey of point-based POMDP solvers G Shani, J Pineau, R Kaplow Autonomous Agents and Multi-Agent Systems 27, 1-51, 2013 | 748 | 2013 |
Online planning algorithms for POMDPs S Ross, J Pineau, S Paquet, B Chaib-Draa Journal of Artificial Intelligence Research 32, 663-704, 2008 | 720 | 2008 |
An actor-critic algorithm for sequence prediction D Bahdanau, P Brakel, K Xu, A Goyal, R Lowe, J Pineau, A Courville, ... arXiv preprint arXiv:1607.07086, 2016 | 704 | 2016 |
Bayesian reinforcement learning: A survey M Ghavamzadeh, S Mannor, J Pineau, A Tamar Foundations and Trends® in Machine Learning 8 (5-6), 359-483, 2015 | 541 | 2015 |
Anytime point-based approximations for large POMDPs J Pineau, G Gordon, S Thrun Journal of Artificial Intelligence Research 27, 335-380, 2006 | 516 | 2006 |
Pearl: A mobile robotic assistant for the elderly ME Pollack, L Brown, D Colbry, C Orosz, B Peintner, S Ramakrishnan, ... AAAI workshop on automation as eldercare 2002, 85-91, 2002 | 491 | 2002 |
Towards the systematic reporting of the energy and carbon footprints of machine learning P Henderson, J Hu, J Romoff, E Brunskill, D Jurafsky, J Pineau Journal of Machine Learning Research 21 (248), 1-43, 2020 | 469 | 2020 |
Experiences with a mobile robotic guide for the elderly M Montemerlo, J Pineau, N Roy, S Thrun, V Verma AAAI/IAAI 2002, 587-592, 2002 | 461 | 2002 |
A survey of available corpora for building data-driven dialogue systems IV Serban, R Lowe, P Henderson, L Charlin, J Pineau arXiv preprint arXiv:1512.05742, 2015 | 428 | 2015 |
Towards an automatic turing test: Learning to evaluate dialogue responses R Lowe, M Noseworthy, IV Serban, N Angelard-Gontier, Y Bengio, ... arXiv preprint arXiv:1708.07149, 2017 | 416 | 2017 |
Spoken dialogue management using probabilistic reasoning N Roy, J Pineau, S Thrun Proceedings of the 38th annual meeting of the association for computational …, 2000 | 415 | 2000 |
Tarmac: Targeted multi-agent communication A Das, T Gervet, J Romoff, D Batra, D Parikh, M Rabbat, J Pineau International Conference on machine learning, 1538-1546, 2019 | 400 | 2019 |