Of Moments and Matching: A Game-Theoretic Framework for Closing the Imitation Gap G Swamy, S Choudhury, JA Bagnell, ZS Wu 38th International Conference on Machine Learning (ICML), 2021 | 72* | 2021 |
On the Utility of Model Learning in HRI G Swamy, J Schulz, R Choudhury, D Hadfield-Menell, A Dragan arXiv preprint arXiv:1901.01291, 2019 | 61* | 2019 |
Scaled autonomy: Enabling human operators to control robot fleets G Swamy, S Reddy, S Levine, AD Dragan 2020 IEEE International Conference on Robotics and Automation (ICRA), 5942-5948, 2020 | 41 | 2020 |
A Minimaximalist Approach to Reinforcement Learning from Human Feedback G Swamy, C Dann, R Kidambi, ZS Wu, A Agarwal arXiv preprint arXiv:2401.04056, 2024 | 27 | 2024 |
Causal imitation learning under temporally correlated noise G Swamy, S Choudhury, D Bagnell, S Wu International Conference on Machine Learning, 20877-20890, 2022 | 22 | 2022 |
Sequence model imitation learning with unobserved contexts G Swamy, S Choudhury, J Bagnell, SZ Wu Advances in Neural Information Processing Systems 35, 17665-17676, 2022 | 18 | 2022 |
Inverse Reinforcement Learning without Reinforcement Learning G Swamy, S Choudhury, D Bagnell, S Wu International Conference on Machine Learning, 33299-33318, 2023 | 15 | 2023 |
Minimax Optimal Online Imitation Learning via Replay Estimation G Swamy, N Rajaraman, M Peng, S Choudhury, J Bagnell, SZ Wu, J Jiao, ... Advances in Neural Information Processing Systems 35, 7077-7088, 2022 | 13 | 2022 |
REBEL: Reinforcement Learning via Regressing Relative Rewards Z Gao, JD Chang, W Zhan, O Oertell, G Swamy, K Brantley, T Joachims, ... arXiv preprint arXiv:2404.16767, 2024 | 6 | 2024 |
Learning Shared Safety Constraints from Multi-task Demonstrations K Kim, G Swamy, Z Liu, D Zhao, S Choudhury, SZ Wu Advances in Neural Information Processing Systems 36, 2024 | 5 | 2024 |
Hybrid Inverse Reinforcement Learning J Ren, G Swamy, ZS Wu, JA Bagnell, S Choudhury arXiv preprint arXiv:2402.08848, 2024 | 4 | 2024 |
A Critique of Strictly Batch Imitation Learning G Swamy, S Choudhury, JA Bagnell, ZS Wu arXiv preprint arXiv:2110.02063, 2021 | 3 | 2021 |
Generative Models for Pose Transfer P Chao, A Li, G Swamy arXiv preprint arXiv:1806.09070, 2018 | 3 | 2018 |
EvIL: Evolution Strategies for Generalisable Imitation Learning S Sapora, G Swamy, C Lu, YW Teh, JN Foerster arXiv preprint arXiv:2406.11905, 2024 | | 2024 |
Multi-Agent Imitation Learning: Value is Easy, Regret is Hard J Tang, G Swamy, F Fang, ZS Wu arXiv preprint arXiv:2406.04219, 2024 | | 2024 |
Understanding Preference Fine-Tuning Through the Lens of Coverage Y Song, G Swamy, A Singh, JA Bagnell, W Sun arXiv preprint arXiv:2406.01462, 2024 | | 2024 |
Game-Theoretic Algorithms for Conditional Moment Matching G Swamy, S Choudhury, JA Bagnell, ZS Wu arXiv preprint arXiv:2208.09551, 2022 | | 2022 |
Learning with Humans in the Loop G Swamy https://www2.eecs.berkeley.edu/Pubs/TechRpts/2020/EECS-2020-76.html, 2020 | | 2020 |
Repeatability and Steadiness of Fingertip Force using Depth Feedback K Cao, G Swamy, SR Chaudhuri, P Cosman IEEE Signal Processing in Medicine and Biology Symposium 2016, 2016 | | 2016 |
Efficient Inverse Reinforcement Learning without Compounding Errors NE Dice, G Swamy, S Choudhury, W Sun ICML 2024 Workshop on Models of Human Feedback for AI Alignment, 0 | | |