Shangtong Zhang 个人学术档案

引用次数

	总计	2019 年至今
引用	1185	1147
h 指数	16	16
i10 指数	23	21

300

150

225

201720182019202020212022202320245 27 62 159 227 277 291 127

开放获取的出版物数量

查看全部

10 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Shimon WhitesonProfessor of Computer Science, University of Oxford / Senior Staff Research Scientist, Waymo在 cs.ox.ac.uk 的电子邮件经过验证
Hengshuai YaoSony AI在 ualberta.ca 的电子邮件经过验证
Richard S. SuttonKeen, Amii, and University of Alberta在 richsutton.com 的电子邮件经过验证
Bo LiuPhD, AAAI SM, IEEE SM在 cs.umass.edu 的电子邮件经过验证
Linglong KongProfessor, Canada Research Chair in Statistical Learning, UAlberta, and Canada CIFAR AI Chair, Amii在 ualberta.ca 的电子邮件经过验证
Wendelin BöhmerSequential Decision Making Group, Delft University of Technology在 tudelft.nl 的电子邮件经过验证
Ray JiangResearch Scientist, DeepMind在 google.com 的电子邮件经过验证
Remi Tachet des Combes在 alpacaml.com 的电子邮件经过验证
Romain LarocheMicrosoft Research在 polytechnique.org 的电子邮件经过验证
Marcus EdelComputer Science, Free University of Berlin在 fu-berlin.de 的电子邮件经过验证
Ryan R. CurtinFree agent在 ratml.org 的电子邮件经过验证
Nando de FreitasCIFAR & DeepMind在 google.com 的电子邮件经过验证
Tom Le PaineStaff Research Scientist at Google DeepMind在 google.com 的电子邮件经过验证
Julian SchrittwieserDeepMind在 furidamu.org 的电子邮件经过验证
Roman RingDeepMind在 deepmind.com 的电子邮件经过验证
Petko GeorgievGoogle DeepMind, University of Cambridge在 cam.ac.uk 的电子邮件经过验证
Michael MathieuDeepMind在 google.com 的电子邮件经过验证
Aäron van den OordGoogle DeepMind在 google.com 的电子邮件经过验证
Sergio Gómez ColmenarejoResearch Engineer, DeepMind在 google.com 的电子邮件经过验证
Caglar GulcehreAI Researcher, Prof at EPFL, Consultant@Google DeepMind, ex-Staff Research Scientist@Google DeepMind在 google.com 的电子邮件经过验证

关注

Shangtong Zhang

University of Virginia

在 virginia.edu 的电子邮件经过验证 - 首页

reinforcement learning


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
A Deeper Look at Experience Replay S Zhang, RS Sutton Deep Reinforcement Learning Symposium, NIPS 2017, 2017	340	2017
GradientDICE: Rethinking Generalized Offline Estimation of Stationary Values S Zhang, B Liu, S Whiteson ICML 2020, 2020	99	2020
Distributional Reinforcement Learning for Efficient Exploration B Mavrin, S Zhang, H Yao, L Kong, K Wu, Y Yu ICML 2019, 2019	89	2019
mlpack 3: a fast, flexible machine learning library R Curtin, M Edel, M Lozhnikov, Y Mentekidis, S Ghaisas, S Zhang Journal of Open Source Software 3 (26), 726, 2018	85	2018
DAC: The Double Actor-Critic Architecture for Learning Options S Zhang, S Whiteson NeurIPS 2019, 2019	77	2019
Provably Convergent Two-Timescale Off-Policy Actor-Critic with Function Approximation S Zhang, B Liu, H Yao, S Whiteson ICML 2020, 2019	54	2019
Generalized Off-Policy Actor-Critic S Zhang, W Boehmer, S Whiteson NeurIPS 2019, 2019	51	2019
Breaking the Deadly Triad with a Target Network S Zhang, H Yao, S Whiteson ICML 2021, 2021	43	2021
Mean-variance policy iteration for risk-averse reinforcement learning S Zhang, B Liu, S Whiteson Proceedings of the AAAI Conference on Artificial Intelligence 35 (12), 10905 …, 2021	36	2021
Average-Reward Off-Policy Policy Evaluation with Function Approximation S Zhang, Y Wan, RS Sutton, S Whiteson ICML 2021, 2021	34	2021
QUOTA: The Quantile Option Architecture for Reinforcement Learning S Zhang, B Mavrin, L Kong, B Liu, H Yao AAAI 2019, 2018	32	2018
ACE: An Actor Ensemble Algorithm for Continuous Control with Tree Search S Zhang, H Chen, H Yao AAAI 2019, 2018	31	2018
Modularized Implementation of Deep RL Algorithms in PyTorch S Zhang	30*	2018
A deep neural network for modeling music P Zhang, X Zheng, W Zhang, S Li, S Qian, W He, S Zhang, Z Wang Proceedings of the 5th ACM on International Conference on Multimedia …, 2015	27	2015
Deep Residual Reinforcement Learning S Zhang, W Boehmer, S Whiteson AAMAS 2020, 2019	26	2019
AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning M Mathieu, S Ozair, S Srinivasan, C Gulcehre, S Zhang, R Jiang, ... arXiv preprint arXiv:2308.03526, 2023	22*	2023
Learning expected emphatic traces for deep RL R Jiang, S Zhang, V Chelu, A White, H van Hasselt Proceedings of the AAAI Conference on Artificial Intelligence 36 (6), 7015-7023, 2022	14	2022
Learning Retrospective Knowledge with Reverse Reinforcement Learning S Zhang, V Veeriah, S Whiteson NeurIPS 2020, 2020	13	2020
Mega-Reward: Achieving Human-Level Play without Extrinsic Rewards Y Song, J Wang, T Lukasiewicz, Z Xu, S Zhang, M Xu AAAI 2020, 2019	13	2019
Comparing Deep Reinforcement Learning and Evolutionary Methods in Continuous Control S Zhang, OR Zaiane Deep Reinforcement Learning Symposium, NIPS 2017, 2017	12	2017

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用