Extrapolating beyond suboptimal demonstrations via inverse reinforcement learning from observations D Brown, W Goo, P Nagarajan, S Niekum International Conference on Machine Learning, 783-792, 2019 | 375 | 2019 |
ChainerRL: A Deep Reinforcement Learning Library Y Fujita, P Nagarajan, T Kataoka, T Ishikawa Journal of Machine Learning Research 22 (77), 1-14, 2021 | 133 | 2021 |
Deterministic Implementations for Reproducibility in Deep Reinforcement Learning P Nagarajan, G Warnell, P Stone AAAI 2019 Workshop on Reproducible AI, 2019 | 62 | 2019 |
The Impact of Nondeterminism on Reproducibility in Deep Reinforcement Learning P Nagarajan, G Warnell, P Stone 2nd Reproducibility in Machine Learning Workshop at ICML 2018, Stockholm, Sweden, 2018 | 32 | 2018 |
Distributed Reinforcement Learning of Targeted Grasping with Active Vision for Mobile Manipulators Y Fujita, K Uenishi, A Ummadisingu, P Nagarajan, S Masuda, MY Castro 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020 | 22 | 2020 |
Learning Latent State Spaces for Planning through Reward Prediction A Havens, Y Ouyang, P Nagarajan, Y Fujita Workshop on Deep Reinforcement Learning at the 33rd Conference on Neural …, 2019 | 7 | 2019 |
Periodic Intra-Ensemble Knowledge Distillation for Reinforcement Learning ZW Hong, P Nagarajan, G Maeda European Conference on Machine Learning and Principles and Practice of …, 2021 | 4 | 2021 |
Reconnaissance for Reinforcement Learning with Safety Constraints S Maeda, H Watahiki, Y Ouyang, S Okada, M Koyama, P Nagarajan European Conference on Machine Learning and Principles and Practice of …, 2021 | 2 | 2021 |
When is Offline Policy Selection Sample Efficient for Reinforcement Learning? V Liu, P Nagarajan, A Patterson, M White arXiv preprint arXiv:2312.02355, 2023 | | 2023 |
Swarm-inspired Reinforcement Learning via Collaborative Inter-agent Knowledge Distillation ZW Hong, P Nagarajan, G Maeda Workshop on Deep Reinforcement Learning at the 33rd Conference on Neural …, 2019 | | 2019 |
Nondeterminism as a Reproducibility Challenge for Deep Reinforcement Learning PM Nagarajan The University of Texas at Austin, 2018 | | 2018 |