Michael Gimelfarb 个人学术档案

引用次数

	总计	2019 年至今
引用	104	103
h 指数	6	6
i10 指数	3	3

20182019202020212022202320241 2 7 13 21 40 20

开放获取的出版物数量

查看全部

2 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Scott SannerUniversity of Toronto在 mie.utoronto.ca 的电子邮件经过验证
Chi-Guhn LeeUniversity of Toronto在 mie.utoronto.ca 的电子邮件经过验证
Jihwan JeongUniversity of Toronto在 mail.utoronto.ca 的电子邮件经过验证
Ayal TaitlerUniversity of Toronto在 utoronto.ca 的电子邮件经过验证
Noah PattonPhD Student, University of Texas at Austin在 cs.utexas.edu 的电子邮件经过验证
Andre BarretoResearch Scientist, Google DeepMind在 google.com 的电子邮件经过验证
Martin MladenovGoogle Research在 google.com 的电子邮件经过验证
Sriram GopalakrishnanJP Morgan AI Research在 asu.edu 的电子邮件经过验证
Xiaotian LiuQueen's University在 queensu.ca 的电子邮件经过验证
Baher AbdulhaiProfessor, University of Toronto在 utoronto.ca 的电子邮件经过验证
Hyunwoo KimZhejiang Lab在 zhejianglab.com 的电子邮件经过验证
Xiaoyu WangDepartment of Civil Engineering, University of Toronto在 mail.utoronto.ca 的电子邮件经过验证
Daniel FišerSaarland University在 danfis.cz 的电子邮件经过验证
Gregor BehnkeILLC - University of Amsterdam在 uva.nl 的电子邮件经过验证
Enrico ScalaUniversità di Brescia在 unibs.it 的电子邮件经过验证
Ron AlfordThe MITRE Corporation在 mitre.org 的电子邮件经过验证
Javier Segovia-AguasUniversitat Pompeu Fabra在 upf.edu 的电子邮件经过验证
Dominik P. SchreiberKarlsruhe Institute of Technology在 kit.edu 的电子邮件经过验证
Jendrik SeippAssociate Professor, Linköping University在 liu.se 的电子邮件经过验证
Joan Espasa ArxerUniversity of St Andrews在 st-andrews.ac.uk 的电子邮件经过验证

关注

Michael Gimelfarb

Computer Science, University of Toronto

在 mail.utoronto.ca 的电子邮件经过验证

machine learning deep learning reinforcement learning robotics Bayesian statistics


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Reinforcement learning with multiple experts: A bayesian model combination approach M Gimelfarb, S Sanner, CG Lee Advances in Neural Information Processing Systems (NeurIPS) 31, 9528-9538, 2018	29	2018
ε-BMC: A Bayesian Ensemble Approach to Epsilon-Greedy Exploration in Model-Free Reinforcement Learning M Gimelfarb, S Sanner, CG Lee Uncertainty in Artificial Intelligence (UAI-19) 35, 476-485, 2019	26	2019
Risk-Aware Transfer in Reinforcement Learning using Successor Features M Gimelfarb, A Barreto, S Sanner, CG Lee Advances in Neural Information Processing Systems (NeurIPS) 34, 2021	17	2021
pyRDDLGym: From RDDL to Gym Environments A Taitler, M Gimelfarb, J Jeong, S Gopalakrishnan, M Mladenov, X Liu, ... PRL Workshop – Bridging the Gap Between AI Planning and Reinforcement Learning, 2023	9	2023
Conservative Bayesian Model-Based Value Expansion for Offline Policy Optimization J Jeong, X Wang, M Gimelfarb, H Kim, B Abdulhai, S Sanner International Conference on Learning Representations (ICLR), 2023	8	2023
Contextual policy transfer in reinforcement learning domains via deep mixtures-of-experts M Gimelfarb, S Sanner, CG Lee Uncertainty in Artificial Intelligence (UAI-21) 37, 1787-1797, 2021	6*	2021
A Distributional Framework for Risk-Sensitive End-to-End Planning in Continuous MDPs N Patton, J Jeong, M Gimelfarb, S Sanner AAAI Conference on Artificial Intelligence (AAAI) 6 (9), 9894-9901, 2022	5*	2022
The 2023 International Planning Competition A Taitler, R Alford, J Espasa, G Behnke, D Fišer, M Gimelfarb, ... AI Magazine, 2024	2	2024
Bayesian Experience Reuse for Learning from Multiple Demonstrators M Gimelfarb, S Sanner, CG Lee International Joint Conference on Artificial Intelligence (IJCAI) 30, 2021	2	2021
JaxPlan and GurobiPlan: Optimization Baselines for Replanning in Discrete and Mixed Discrete-Continuous Probabilistic Domains M Gimelfarb, A Taitler, S Sanner Proceedings of the International Conference on Automated Planning and …, 2024		2024
Constraint-Generation Policy Optimization (CGPO): Nonlinear Programming for Policy Optimization in Mixed Discrete-Continuous MDPs M Gimelfarb, A Taitler, S Sanner arXiv preprint arXiv:2401.12243, 2024		2024
Thompson Sampling for Parameterized Markov Decision Processes with Uninformative Actions M Gimelfarb, MJ Kim arXiv preprint arXiv:2305.07844, 2023		2023
Who Should I Trust?: Uncertainty and Risk for Knowledge Transfer from Multiple Sources in Reinforcement Learning Domains M Gimelfarb University of Toronto (Canada), 2023		2023
Distributional Reward Shaping: Point Estimates Are All You Need M Gimelfarb, S Sanner, CG Lee The Multi-disciplinary Conference on Reinforcement Learning and Decision …, 2022		2022
End-to-End Risk-Aware Planning by Gradient Descent N Patton, J Jeong, M Gimelfarb, S Sanner PRL Workshop – Bridging the Gap Between AI Planning and Reinforcement Learning, 2021		2021
Thompson Sampling for the Control of a Queue with Demand Uncertainty M Gimelfarb University of Toronto (Canada), 2017		2017

系统目前无法执行此操作，请稍后再试。

文章 1–16

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用