关注
Oleg Klimov
Oleg Klimov
未知所在单位机构
在 smallcloud.tech 的电子邮件经过验证
标题
引用次数
引用次数
年份
Proximal policy optimization algorithms
J Schulman, F Wolski, P Dhariwal, A Radford, O Klimov
arXiv preprint arXiv:1707.06347, 2017
190432017
Exploration by random network distillation
Y Burda, H Edwards, A Storkey, O Klimov
arXiv preprint arXiv:1810.12894, 2018
13852018
Openai baselines
P Dhariwal, C Hesse, O Klimov, A Nichol, M Plappert, A Radford, ...
10262017
Stable baselines
A Hill, A Raffin, M Ernestus, A Gleave, A Kanervisto, R Traore, P Dhariwal, ...
8962018
Quantifying generalization in reinforcement learning
K Cobbe, O Klimov, C Hesse, T Kim, J Schulman
International conference on machine learning, 1282-1289, 2019
6902019
Gotta learn fast: A new benchmark for generalization in rl
A Nichol, V Pfau, C Hesse, O Klimov, J Schulman
arXiv preprint arXiv:1804.03720, 2018
2102018
Phasic policy gradient
KW Cobbe, J Hilton, O Klimov, J Schulman
International Conference on Machine Learning, 2020-2027, 2021
1692021
Proximal policy optimization algorithms. CoRR abs/1707.06347 (2017)
J Schulman, F Wolski, P Dhariwal, A Radford, O Klimov
arXiv preprint arXiv:1707.06347, 2017
1442017
Openai baselines (2017)
P Dhariwal, C Hesse, O Klimov, A Nichol, M Plappert, A Radford, ...
URL https://github. com/openai/baselines, 2016
612016
Proximal policy optimization
J Schulman, O Klimov, F Wolski, P Dhariwal, A Radford
arXiv preprint arXiv:1707.06347, 2017
392017
Carracing-v0
O Klimov
URL https://gym. openai. com/envs/CarRacing-v0, 2016
222016
Multi-task curriculum learning in a complex, visual, hard-exploration domain: Minecraft
I Kanitscheider, J Huizinga, D Farhi, WH Guss, B Houghton, R Sampedro, ...
arXiv preprint arXiv:2106.14876, 2021
182021
Carracing-v0. 2016
O Klimov
URL https://gym. openai. com/envs/CarRacing-v0, 2016
102016
系统目前无法执行此操作,请稍后再试。
文章 1–13