Stephen McAleer 个人学术档案

引用次数

	总计	2019 年至今
引用	2665	2660
h 指数	20	20
i10 指数	30	30

880

440

220

660

20192020202120222023202460 182 323 515 864 714

开放获取的出版物数量

查看全部

16 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Pierre BaldiProfessor, University of California, Irvine在 ics.uci.edu 的电子邮件经过验证
Yaodong YangBOYA (博雅) Assistant Professor at Peking University在 pku.edu.cn 的电子邮件经过验证
Roy FoxAssistant Professor, UC Irvine在 uci.edu 的电子邮件经过验证
JB LanierUC Irvine在 uci.edu 的电子邮件经过验证
Alexander ShmakovUniversity of California Irvine在 uci.edu 的电子邮件经过验证
Forest AgostinelliAssistant Professor at the University of South Carolina在 cse.sc.edu 的电子邮件经过验证
Tuomas SandholmAngel Jordan University Professor of Computer Science, Carnegie Mellon University在 cs.cmu.edu 的电子邮件经过验证
Jun WangProfessor, Computer Science, University College London在 cs.ucl.ac.uk 的电子邮件经过验证
Oliver SlumbersUniversity College London在 ucl.ac.uk 的电子邮件经过验证
Kevin A. WangBrown University在 kevinwang.us 的电子邮件经过验证
Gabriele FarinaAssistant Professor, Massachusetts Institute of Technology在 mit.edu 的电子邮件经过验证
Marc LanctotResearch Scientist, Google DeepMind在 google.com 的电子邮件经过验证
Shauharda (Shaw) KhadkaSenior Applied Scientist at Microsoft在 microsoft.com 的电子邮件经过验证
Somdeb MajumdarIntel Corp在 intel.com 的电子邮件经过验证
Kagan TumerOregon State University在 oregonstate.edu 的电子邮件经过验证
Ioannis PanageasAssistant Professor, University of California, Irvine在 ics.uci.edu 的电子邮件经过验证
Pieter AbbeelUC Berkeley | Covariant在 cs.berkeley.edu 的电子邮件经过验证
Alexander IhlerUniversity of California, Irvine在 ics.uci.edu 的电子邮件经过验证
Michael DennisGoogle DeepMind在 cs.berkeley.edu 的电子邮件经过验证
Karl TuylsCo-Founder at H (chief Research & Operations), ex-Google DeepMind, Prof at University of Liverpool在 hcompany.ai 的电子邮件经过验证

关注

Stephen McAleer

OpenAI

在 openai.com 的电子邮件经过验证 - 首页

Artificial Intelligence


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Highly accurate machine fault diagnosis using deep transfer learning S Shao, S McAleer, R Yan, P Baldi IEEE Transactions on Industrial Informatics 15 (4), 2446-2455, 2018	1079	2018
Solving the Rubik’s cube with deep reinforcement learning and search F Agostinelli, S McAleer, A Shmakov*, P Baldi Nature Machine Intelligence 1 (8), 356-363, 2019	221	2019
Language Models can Solve Computer Tasks G Kim, P Baldi, S McAleer Neural Information Processing Systems (NeurIPS), 2023	174	2023
Mastering the game of stratego with model-free multiagent reinforcement learning J Perolat, B De Vylder, D Hennes, E Tarassov, F Strub, V de Boer, ... Science 378 (6623), 990-996, 2022	156	2022
Llemma: An Open Language Model for Mathematics Z Azerbayev, H Schoelkopf, K Paster, M Dos Santos, S McAleer, AQ Jiang, ... International Conference on Learning Representations (ICLR), 2023	98	2023
Solving the Rubik's Cube with Approximate Policy Iteration S McAleer, F Agostinelli, A Shmakov*, P Baldi International Conference on Learning Representations (ICLR), 2018	96*	2018
AI Alignment: A Comprehensive Survey J Ji, T Qiu, B Chen, B Zhang, H Lou, K Wang, Y Duan, Z He, J Zhou, ... arXiv preprint arXiv:2310.19852, 2023	88	2023
Pipeline PSRO: A scalable approach for finding approximate nash equilibria in large games S McAleer, J Lanier, R Fox, P Baldi 34th Conference on Neural Information Processing Systems (NeurIPS), 2020	75	2020
Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning Y Chen, Y Yang, T Wu, S Wang, X Feng, J Jiang, SM McAleer, H Dong, ... 36th Conference on Neural Information Processing Systems (NeurIPS 2022 …, 2022	66	2022
Evolutionary reinforcement learning for sample-efficient multiagent coordination S Majumdar, S Khadka, S Miret, S McAleer, K Tumer International Conference on Machine Learning (ICML), 2020	63	2020
XDO: A double oracle algorithm for extensive-form games S McAleer, J Lanier, P Baldi, R Fox Advances in Neural Information Processing Systems (NeurIPS), 2021	53	2021
Independent Natural Policy Gradient Always Converges in Markov Potential Games R Fox, S McAleer, W Overman, I Panageas AISTATS 2022, 2021	47	2021
Neural auto-curricula in two-player zero-sum games X Feng, O Slumbers, Z Wan, B Liu, S McAleer, Y Wen, J Wang, Y Yang Advances in Neural Information Processing Systems (NeurIPS), 2021	46*	2021
Online Double Oracle LC Dinh, Y Yang, S McAleer, NP Nieves, O Slumbers, Z Tian, DH Mguni, ... Transactions on Machine Learning Research, 2021	29	2021
Deep-learning-based reconstruction of the neutrino direction and energy for in-ice radio detectors C Glaser, S McAleer, S Stjärnholm, P Baldi, SW Barwick Astroparticle Physics 145, 102781, 2023	27*	2023
White Paper: ARIANNA-200 high energy neutrino telescope A Anker, P Baldi, SW Barwick, D Bergman, H Bernhoff, DZ Besson, ... arXiv preprint arXiv:2004.09841, 2020	26	2020
Alphazero-like tree-search can guide large language model decoding and training X Feng, Z Wan, M Wen, S McAleer, Y Wen, W Zhang, J Wang arXiv preprint arXiv:2309.17179, 2023	23	2023
Curiosity-Driven Multi-Criteria Hindsight Experience Replay J Lanier, S McAleer, P Baldi NeurIPS 2019 Deep RL Workshop, 2019	22	2019
Reducing variance in temporal-difference value estimation via ensemble of deep networks L Liang, Y Xu, S McAleer, D Hu, A Ihler, P Abbeel, R Fox International Conference on Machine Learning (ICML), 2022	21*	2022
Toward Optimal Policy Population Growth in Two-Player Zero-Sum Games S McAleer, JB Lanier, K Wang, P Baldi, R Fox, T Sandholm International Conference on Learning Representations (ICLR), 2022	20*	2022

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用