Saurabh Kumar 个人学术档案

引用次数

	总计	2019 年至今
引用	1774	1759
h 指数	9	9
i10 指数	9	9

560

280

140

420

201820192020202120222023202413 65 166 273 419 546 290

开放获取的出版物数量

查看全部

3 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Marc G. BellemareGoogle Brain在 google.com 的电子邮件经过验证
Carles GeladaOpenAI在 openai.com 的电子邮件经过验证
Chelsea FinnStanford University, Google在 cs.stanford.edu 的电子邮件经过验证
Tianhe YuGoogle DeepMind在 google.com 的电子邮件经过验证
Pablo Samuel CastroGoogle在 google.com 的电子邮件经过验证
Ofir NachumOpenAI在 openai.com 的电子邮件经过验证
Jacob BuckmanPhD Student, Mila在 mail.mcgill.ca 的电子邮件经过验证
Robert DadashiGoogle DeepMind在 google.com 的电子邮件经过验证
Dale SchuurmansUniversity of Alberta, Google DeepMind在 cs.ualberta.ca 的电子邮件经过验证
Himanshu SahniStudent at Georgia Institute of Technology在 gatech.edu 的电子邮件经过验证
Benjamin Van RoyStanford University在 stanford.edu 的电子邮件经过验证
Mark RowlandResearch Scientist, Google DeepMind在 google.com 的电子邮件经过验证
Rémi MunosGoogle DeepMind在 inria.fr 的电子邮件经过验证
Will DabneyDeepMind在 google.com 的电子邮件经过验证
Larry HeckProfessor, Georgia Institute of Technology在 ieee.org 的电子邮件经过验证
Pararth ShahSenior Staff Software Engineer, Google在 google.com 的电子邮件经过验证
Dilek Hakkani-TurProfessor of Computer Science, Univ. Illinois Urbana-Champaign在 ieee.org 的电子邮件经过验证
Junfeng WenAssistant Professor, Carleton University在 carleton.ca 的电子邮件经过验证

关注

Saurabh Kumar

Stanford

在 stanford.edu 的电子邮件经过验证

Continual Learning Reinforcement Learning


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Gradient surgery for multi-task learning T Yu, S Kumar, A Gupta, S Levine, K Hausman, C Finn Advances in Neural Information Processing Systems 33, 5824-5836, 2020	865	2020
Deepmdp: Learning continuous latent space models for representation learning C Gelada, S Kumar, J Buckman, O Nachum, MG Bellemare International conference on machine learning, 2170-2179, 2019	303	2019
Dopamine: A research framework for deep reinforcement learning PS Castro, S Moitra, C Gelada, S Kumar, MG Bellemare arXiv preprint arXiv:1812.06110, 2018	287	2018
Statistics and samples in distributional reinforcement learning M Rowland, R Dadashi, S Kumar, R Munos, MG Bellemare, W Dabney International Conference on Machine Learning, 5528-5536, 2019	96	2019
One solution is not all you need: Few-shot extrapolation via structured maxent rl S Kumar, A Kumar, S Levine, C Finn Advances in Neural Information Processing Systems 33, 8198-8210, 2020	85	2020
Federated control with hierarchical multi-agent deep reinforcement learning S Kumar, P Shah, D Hakkani-Tur, L Heck arXiv preprint arXiv:1712.08266, 2017	45	2017
Learning to compose skills H Sahni, S Kumar, F Tejani, C Isbell arXiv preprint arXiv:1711.11289, 2017	40	2017
Characterizing the gap between actor-critic and policy gradient J Wen, S Kumar, R Gummadi, D Schuurmans International Conference on Machine Learning, 11101-11111, 2021	17	2021
Maintaining Plasticity in Continual Learning via Regenerative Regularization S Kumar, H Marklund, B Van Roy arXiv preprint arXiv:2308.11958, 2023	15	2023
Continual learning as computationally constrained reinforcement learning S Kumar, H Marklund, A Rao, Y Zhu, HJ Jeon, Y Liu, B Van Roy arXiv preprint arXiv:2307.04345, 2023	9	2023
Multi-task reinforcement learning without interference T Yu, S Kumar, A Gupta, S Levine, K Hausman, C Finn Proc. Optim. Found. Reinforcement Learn. Workshop NeurIPS, 2019	6	2019
State space decomposition and subgoal creation for transfer in deep reinforcement learning H Sahni, S Kumar, F Tejani, Y Schroecker, C Isbell arXiv preprint arXiv:1705.08997, 2017	4	2017
Generalized policy updates for policy optimization S Kumar, R Dadashi, Z Ahmed, D Schuurmans, MG Bellemare NeurIPS 2019 Optimization Foundations for Reinforcement Learning Workshop, 2019	2	2019
Learning Continually by Spectral Regularization A Lewandowski, S Kumar, D Schuurmans, A György, MC Machado arXiv preprint arXiv:2406.06811, 2024		2024
A Parametric Class of Approximate Gradient Updates for Policy Optimization R Gummadi, S Kumar, J Wen, D Schuurmans International Conference on Machine Learning, 7998-8015, 2022		2022
Dopamine: A framework for flexible Reinforcement Learning research PS Castro, S Moitra, C Gelada, S Kumar, MG Bellemare

系统目前无法执行此操作，请稍后再试。

文章 1–16

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用