A Deeper Look at Experience Replay S Zhang, RS Sutton Deep Reinforcement Learning Symposium, NIPS 2017, 2017 | 340 | 2017 |
GradientDICE: Rethinking Generalized Offline Estimation of Stationary Values S Zhang, B Liu, S Whiteson ICML 2020, 2020 | 99 | 2020 |
Distributional Reinforcement Learning for Efficient Exploration B Mavrin, S Zhang, H Yao, L Kong, K Wu, Y Yu ICML 2019, 2019 | 89 | 2019 |
mlpack 3: a fast, flexible machine learning library R Curtin, M Edel, M Lozhnikov, Y Mentekidis, S Ghaisas, S Zhang Journal of Open Source Software 3 (26), 726, 2018 | 85 | 2018 |
DAC: The Double Actor-Critic Architecture for Learning Options S Zhang, S Whiteson NeurIPS 2019, 2019 | 77 | 2019 |
Provably Convergent Two-Timescale Off-Policy Actor-Critic with Function Approximation S Zhang, B Liu, H Yao, S Whiteson ICML 2020, 2019 | 54 | 2019 |
Generalized Off-Policy Actor-Critic S Zhang, W Boehmer, S Whiteson NeurIPS 2019, 2019 | 51 | 2019 |
Breaking the Deadly Triad with a Target Network S Zhang, H Yao, S Whiteson ICML 2021, 2021 | 43 | 2021 |
Mean-variance policy iteration for risk-averse reinforcement learning S Zhang, B Liu, S Whiteson Proceedings of the AAAI Conference on Artificial Intelligence 35 (12), 10905 …, 2021 | 36 | 2021 |
Average-Reward Off-Policy Policy Evaluation with Function Approximation S Zhang, Y Wan, RS Sutton, S Whiteson ICML 2021, 2021 | 34 | 2021 |
QUOTA: The Quantile Option Architecture for Reinforcement Learning S Zhang, B Mavrin, L Kong, B Liu, H Yao AAAI 2019, 2018 | 32 | 2018 |
ACE: An Actor Ensemble Algorithm for Continuous Control with Tree Search S Zhang, H Chen, H Yao AAAI 2019, 2018 | 31 | 2018 |
Modularized Implementation of Deep RL Algorithms in PyTorch S Zhang | 30* | 2018 |
A deep neural network for modeling music P Zhang, X Zheng, W Zhang, S Li, S Qian, W He, S Zhang, Z Wang Proceedings of the 5th ACM on International Conference on Multimedia …, 2015 | 27 | 2015 |
Deep Residual Reinforcement Learning S Zhang, W Boehmer, S Whiteson AAMAS 2020, 2019 | 26 | 2019 |
AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning M Mathieu, S Ozair, S Srinivasan, C Gulcehre, S Zhang, R Jiang, ... arXiv preprint arXiv:2308.03526, 2023 | 22* | 2023 |
Learning expected emphatic traces for deep RL R Jiang, S Zhang, V Chelu, A White, H van Hasselt Proceedings of the AAAI Conference on Artificial Intelligence 36 (6), 7015-7023, 2022 | 14 | 2022 |
Learning Retrospective Knowledge with Reverse Reinforcement Learning S Zhang, V Veeriah, S Whiteson NeurIPS 2020, 2020 | 13 | 2020 |
Mega-Reward: Achieving Human-Level Play without Extrinsic Rewards Y Song, J Wang, T Lukasiewicz, Z Xu, S Zhang, M Xu AAAI 2020, 2019 | 13 | 2019 |
Comparing Deep Reinforcement Learning and Evolutionary Methods in Continuous Control S Zhang, OR Zaiane Deep Reinforcement Learning Symposium, NIPS 2017, 2017 | 12 | 2017 |