Ryan D'Orazio 个人学术档案 - 学术资源搜索

引用次数

	总计	2019 年至今
引用	204	204
h 指数	7	7
i10 指数	6	6

0

70

35

2019202020212022202320242 5 23 58 61 54

开放获取的出版物数量

2 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

Ryan D'Orazio

Ryan D'Orazio

Mila, Université de Montréal

在 mila.quebec 的电子邮件经过验证 - 首页

Optimization Reinforcement Learning Game Theory


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
A unified approach to reinforcement learning, quantal response equilibria, and two-player zero-sum games S Sokota, R D'Orazio, JZ Kolter, N Loizou, M Lanctot, I Mitliagkas, ... arXiv preprint arXiv:2206.05825, 2022	50	2022
Hindsight and sequential rationality of correlated play D Morrill, R D'Orazio, R Sarfati, M Lanctot, JR Wright, AR Greenwald, ... Proceedings of the AAAI Conference on Artificial Intelligence 35 (6), 5584-5594, 2021	37	2021
Efficient deviation types and learning for hindsight rationality in extensive-form games D Morrill, R D’Orazio, M Lanctot, JR Wright, M Bowling, AR Greenwald International Conference on Machine Learning, 7818-7828, 2021	35	2021
Stochastic mirror descent: Convergence analysis and adaptive variants via the mirror stochastic polyak stepsize R D'Orazio, N Loizou, I Laradji, I Mitliagkas arXiv preprint arXiv:2110.15412, 2021	31	2021
Solving common-payoff games with approximate policy iteration S Sokota, E Lockhart, F Timbers, E Davoodi, R D'Orazio, N Burch, ... Proceedings of the AAAI Conference on Artificial Intelligence 35 (11), 9695-9703, 2021	16	2021
Alternative Function Approximation Parameterizations for Solving Games: An Analysis of -Regression Counterfactual Regret Minimization R D'Orazio, D Morrill, JR Wright, M Bowling arXiv preprint arXiv:1912.02967, 2019	11	2019
Simultaneous prediction intervals for patient-specific survival curves S Sokota, R D'Orazio, K Javed, H Haider, R Greiner arXiv preprint arXiv:1906.10780, 2019	7	2019
Regret minimization with function approximation in extensive-form games R D'Orazio	6	2020
Optimistic and adaptive lagrangian hedging R D'Orazio, R Huang arXiv preprint arXiv:2101.09603, 2021	4	2021
Abstracting imperfect information away from two-player zero-sum games S Sokota, R D’Orazio, CK Ling, DJ Wu, JZ Kolter, N Brown International Conference on Machine Learning, 32169-32193, 2023	3	2023
On stochastic mirror descent: Convergence analysis and adaptive variants R D’Orazio, N Loizou, I Laradji, I Mitliagkas Beyond First-Order Methods in ML Systems Workshop, Int. Conf. Machine Learning, 2021	2	2021
Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games: Corrections D Morrill, R D'Orazio, M Lanctot, JR Wright, M Bowling, AR Greenwald arXiv preprint arXiv:2205.12031, 2022	1	2022
Bounds for approximate regret-matching algorithms R D'Orazio, D Morrill, JR Wright arXiv preprint arXiv:1910.01706, 2019	1	2019
Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games Supplementary D Morrill, R D’Orazio, M Lanctot, JR Wright, M Bowling, AR Greenwald

系统目前无法执行此操作，请稍后再试。

文章 1–14