Ishan Durugkar 个人学术档案

引用次数

	总计	2019 年至今
引用	1392	1238
h 指数	10	10
i10 指数	10	10

300

150

225

20162017201820192020202120222023202413 29 103 161 196 226 243 295 117

开放获取的出版物数量

查看全部

7 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Peter StoneProfessor of Computer Science, The University of Texas at Austin在 cs.utexas.edu 的电子邮件经过验证
Sridhar MahadevanDirector, Data Science Lab, Adobe Research & Professor, University of Massachusetts, Amherst在 cs.umass.edu 的电子邮件经过验证
Ian GempDeepMind在 google.com 的电子邮件经过验证
Rajarshi DasAWS AI Labs在 cs.washington.edu 的电子邮件经过验证
Luke VilnisResearch Scientist, Google DeepMind在 google.com 的电子邮件经过验证
Shehzaad DhuliawalaETH Zurich在 inf.ethz.ch 的电子邮件经过验证
Akshay KrishnamurthyUniversity of Massachusetts Amherst在 cs.umass.edu 的电子邮件经过验证
Andrew McCallumDistinguished Professor of Computer Science, University of Massachusetts Amherst在 cs.umass.edu 的电子邮件经过验证
Alex SmolaBoson AI在 smola.org 的电子邮件经过验证
Manzil ZaheerGoogle Research在 cmu.edu 的电子邮件经过验证
Josiah P. HannaAssistant Professor, University of Wisconsin--Madison在 cs.wisc.edu 的电子邮件经过验证
Garrett WarnellResearch Scientist, Army Research Laboratory在 army.mil 的电子邮件经过验证
Mauricio TecHarvard University在 hsph.harvard.edu 的电子邮件经过验证
Dr Anand J KulkarniProfessor & Associate Director, Institute of Artificial Intelligence, Dr Vishwanath Karad MIT World在 ntu.edu.sg 的电子邮件经过验证
Elad Liebmanstaff research scientist, SparkCognition & Assistant Prof. of Instruction, UT Austin在 cs.utexas.edu 的电子邮件经过验证
Haresh KarnanPh.D., The University of Texas at Austin在 utexas.edu 的电子邮件经过验证
Philip ThomasUniversity of Massachusetts Amherst在 cs.umass.edu 的电子邮件经过验证
Emma BrunskillAssociate Professor of Computer Science, Stanford University在 cs.stanford.edu 的电子邮件经过验证
Georgios TheocharousAdobe Research在 adobe.com 的电子邮件经过验证
Brahma S PavsePhD Student, University of Wisconsin-Madison在 wisc.edu 的电子邮件经过验证

关注

Ishan Durugkar

Research Scientist, Sony AI

在 sony.com 的电子邮件经过验证 - 首页

Reinforcement Learning Generative Models Machine Learning Multi-agent systems Artificial Intelligence


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Go for a walk and arrive at the answer: Reasoning over paths in knowledge bases using reinforcement learning R Das, S Dhuliawala, M Zaheer, L Vilnis, I Durugkar, A Krishnamurthy, ... arXiv preprint arXiv:1711.05851, 2017	565	2017
Generative Multi-Adversarial Networks I Durugkar, I Gemp, S Mahadevan International Conference on Learning Representations, 2017, 2017	455	2017
Cohort intelligence: a self supervised learning behavior AJ Kulkarni, IP Durugkar, M Kumar 2013 IEEE international conference on systems, man, and cybernetics, 1396-1400, 2013	132	2013
Predictive off-policy policy evaluation for nonstationary decision problems, with applications to digital marketing P Thomas, G Theocharous, M Ghavamzadeh, I Durugkar, E Brunskill Proceedings of the AAAI Conference on Artificial Intelligence 31 (2), 4740-4745, 2017	64	2017
An imitation from observation approach to transfer learning with dynamics mismatch S Desai, I Durugkar, H Karnan, G Warnell, J Hanna, P Stone Advances in Neural Information Processing Systems 33, 3917-3929, 2020	43	2020
Adversarial intrinsic motivation for reinforcement learning I Durugkar, M Tec, S Niekum, P Stone Advances in Neural Information Processing Systems 34, 8622-8636, 2021	30	2021
Deep reinforcement learning with macro-actions IP Durugkar, C Rosenbaum, S Dernbach, S Mahadevan arXiv preprint arXiv:1606.04615, 2016	27	2016
Balancing individual preferences and shared objectives in multiagent reinforcement learning I Durugkar, E Liebman, P Stone International Joint Conference on Artificial Intelligence, 2020	20	2020
Reducing sampling error in batch temporal difference learning B Pavse, I Durugkar, J Hanna, P Stone International Conference on Machine Learning, 7543-7552, 2020	14	2020
TD learning with constrained gradients I Durugkar, P Stone	14	2018
Towards a real-time, low-resource, end-to-end object detection pipeline for robot soccer SK Narayanaswami, M Tec, I Durugkar, S Desai, B Masetty, S Narvekar, ... Robot World Cup, 62-74, 2022	6	2022
Wasserstein distance maximizing intrinsic control I Durugkar, S Hansen, S Spencer, V Mnih arXiv preprint arXiv:2110.15331, 2021	4	2021
Unmixing in the presence of nuisances with deep generative models M Parente, I Gemp, I Durugkar 2017 IEEE International Geoscience and Remote Sensing Symposium (IGARSS …, 2017	4	2017
An imitation from observation approach to sim-to-real transfer S Desai, I Durugkar, H Karnan, G Warnell, J Hanna, P Stone, A Sony 2nd Workshop on Closing the Reality Gap in Sim2Real Transfer for Robotics. RSS, 2020	3	2020
ABC: Adversarial Behavioral Cloning for Offline Mode-Seeking Imitation Learning E Hudson, I Durugkar, G Warnell, P Stone arXiv preprint arXiv:2211.04005, 2022	2	2022
DM : Distributed multi-agent reinforcement learning via distribution matching C Wang	2	2022
Multi-preference actor critic I Durugkar, M Hausknecht, A Swaminathan, P MacAlpine arXiv preprint arXiv:1904.03295, 2019	2	2019
Inverting variational autoencoders for improved generative accuracy I Gemp, I Durugkar, M Parente, MD Dyar, S Mahadevan arXiv preprint arXiv:1608.05983, 2016	2	2016
f-Policy Gradients: A General Framework for Goal-Conditioned RL using f-Divergences S Agarwal, I Durugkar, P Stone, A Zhang Advances in Neural Information Processing Systems 36, 2024	1	2024
Estimation and control of visitation distributions for reinforcement learning I Durugkar	1	2023

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用