Joshua Romoff 个人学术档案

引用次数

	总计	2019 年至今
引用	1565	1487
h 指数	12	12
i10 指数	13	13

480

240

120

360

2017201820192020202120222023202421 53 100 148 222 335 466 213

合著作者

Joelle PineauSchool of Computer Science, McGill University; FAIR, Meta AI; Mila在 cs.mcgill.ca 的电子邮件经过验证
Peter HendersonPrinceton University在 princeton.edu 的电子邮件经过验证
Romain LarocheMicrosoft Research在 polytechnique.org 的电子邮件经过验证
Harm van SeijenSony AI在 sony.com 的电子邮件经过验证
Mehdi FatemiWand.ai在 wand.ai 的电子邮件经过验证
Emma BrunskillAssociate Professor of Computer Science, Stanford University在 cs.stanford.edu 的电子邮件经过验证
Michael RabbatResearch Scientist at Facebook在 fb.com 的电子邮件经过验证
Ahmed TouatiMeta AI在 umontreal.ca 的电子邮件经过验证
Dan JurafskyProfessor of Linguistics and Computer Science, Stanford University在 stanford.edu 的电子邮件经过验证
Jieru HuDeepMind在 deepmind.com 的电子邮件经过验证
Devi ParikhPreviously: FAIR and GenAI @ Meta. Georgia Tech在 gatech.edu 的电子邮件经过验证
Dhruv BatraGeorgia Tech | Prev: FAIR (Meta AI)在 gatech.edu 的电子邮件经过验证
Theophile GervetCarnegie Mellon University在 andrew.cmu.edu 的电子邮件经过验证
Jeffrey TsangGoogle在 google.com 的电子邮件经过验证
Tavian BarnesPhD Student, University of Waterloo在 uwaterloo.ca 的电子邮件经过验证
Pierre-Luc BaconUniversity of Montreal在 mila.quebec 的电子邮件经过验证
Maxim PeterResearch engineer在 polytechnique.edu 的电子邮件经过验证
Emmanuel BengioMcGill University在 mail.mcgill.ca 的电子邮件经过验证
Alexandre PichéServiceNow Research, Mila在 mail.mcgill.ca 的电子邮件经过验证
Vincent François-LavetVU Amsterdam在 vu.nl 的电子邮件经过验证

关注

Joshua Romoff

Ubisoft La Forge

在 ubisoft.com 的电子邮件经过验证

Reinforcement Learning Deep Learning


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Towards the systematic reporting of the energy and carbon footprints of machine learning P Henderson, J Hu, J Romoff, E Brunskill, D Jurafsky, J Pineau Journal of Machine Learning Research 21 (248), 1-43, 2020	474	2020
Tarmac: Targeted multi-agent communication A Das, T Gervet, J Romoff, D Batra, D Parikh, M Rabbat, J Pineau International Conference on machine learning, 1538-1546, 2019	401	2019
Hybrid reward architecture for reinforcement learning H Van Seijen, M Fatemi, J Romoff, R Laroche, T Barnes, J Tsang Advances in Neural Information Processing Systems 30, 2017	332*	2017
Reward estimation for variance reduction in deep reinforcement learning J Romoff, P Henderson, A Piché, V Francois-Lavet, J Pineau Conference on Robot Learning, 674-699, 2018	51	2018
Where did my optimum go?: An empirical analysis of gradient descent optimization in policy gradient methods P Henderson, J Romoff, J Pineau arXiv preprint arXiv:1810.02525, 2018	45*	2018
Deep reinforcement learning for navigation in aaa video games E Alonso, M Peter, D Goumard, J Romoff Proceedings of the Thirtieth International Joint Conference on Artificial …, 2021	43	2021
Randomized value functions via multiplicative normalizing flows A Touati, H Satija, J Romoff, J Pineau, P Vincent Uncertainty in Artificial Intelligence, 422-432, 2020	42	2020
Gossip-based Actor-Learner Architectures for Deep Reinforcement Learning M Assran, J Romoff, N Ballas, J Pineau, M Rabbat Advances in Neural Information Processing Systems, 13299-13309, 2019	34	2019
Direct behavior specification via constrained reinforcement learning J Roy, R Girgis, J Romoff, PL Bacon, C Pal arXiv preprint arXiv:2112.12228, 2021	31	2021
Multi-advisor reinforcement learning R Laroche, M Fatemi, J Romoff, H van Seijen arXiv preprint arXiv:1704.00756, 2017	28	2017
Separation of concerns in reinforcement learning H van Seijen, M Fatemi, J Romoff, R Laroche arXiv preprint arXiv:1612.05159, 2016	28*	2016
Separating value functions across time-scales J Romoff, P Henderson, A Touati, E Brunskill, J Pineau, Y Ollivier International Conference on Machine Learning, 5468-5477, 2019	26	2019
TDprop: Does Adaptive Optimization With Jacobi Preconditioning Help Temporal Difference Learning? J Romoff, P Henderson, D Kanaa, E Bengio, A Touati, PL Bacon, J Pineau Proceedings of the 20th International Conference on Autonomous Agents and …, 2021	10*	2021
Graph augmented deep reinforcement learning in the gamerland3d environment E Beeching, M Peter, P Marcotte, J Debangoye, O Simonin, J Romoff, ... arXiv preprint arXiv:2112.11731, 2021	9	2021
Deep conditional multi-task learning in atari J Romoff, E Bengio, J Pineau Workshop on Abstraction in Reinforcement Learning at ICML, 2016	5	2016
Improving Intrinsic Exploration by Creating Stationary Objectives RC Castanyer, J Romoff, G Berseth arXiv preprint arXiv:2310.18144, 2023	2	2023
Learning Computational Efficient Bots with Costly Features A Kobanda, CA Valliappan, J Romoff, L Denoyer 2023 IEEE Conference on Games (CoG), 1-8, 2023	2	2023
Decomposing the Bellman Equation in Reinforcement Learning J Romoff McGill University (Canada), 2021	2	2021
Minimax Exploiter: A Data Efficient Approach for Competitive Self-Play D Bairamian, P Marcotte, J Romoff, G Robert, D Nowrouzezahrai arXiv preprint arXiv:2311.17190, 2023		2023
About the attractor phenomenon in decomposed reinforcement learning R Laroche, M Fatemi, J Romoff, H van Seijen

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用