Amir-massoud Farahmand 个人学术档案

引用次数

	总计	2019 年至今
引用	2207	1388
h 指数	23	19
i10 指数	42	32

300

150

225

200820092010201120122013201420152016201720182019202020212022202320249 38 45 65 63 81 81 78 98 102 138 156 212 230 298 283 208

开放获取的出版物数量

查看全部

6 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Csaba SzepesvariDeepMind & University of Alberta在 cs.ualberta.ca 的电子邮件经过验证
Mohammad GhavamzadehAmazon在 amazon.com 的电子邮件经过验证
Daniel NikovskiChief Scientist, Mitsubishi Electric Research Labs在 merl.com 的电子邮件经过验证
Shie MannorProfessor of Electrical Engineering @ Technion & Researcher @ Nvidia在 technion.ac.il 的电子邮件经过验证
Doina PrecupDeepMind and McGill University在 cs.mcgill.ca 的电子邮件经过验证
Azad ShademanIntuitive Surgical Inc.在 intusurg.com 的电子邮件经过验证
Martin JagersandUniversity of Alberta在 cs.ualberta.ca 的电子邮件经过验证
Yangchen PanUniversity of Oxford在 eng.ox.ac.uk 的电子邮件经过验证
Andre BarretoResearch Scientist, Google DeepMind在 google.com 的电子邮件经过验证
Martha WhiteUniversity of Alberta在 ualberta.ca 的电子邮件经过验证
Majid Nili AhmadabadiProfessor of ECE, University of Tehran在 ut.ac.ir 的电子邮件经过验证
Saleh NabiAI Researcher, Schneider Electric在 se.com 的电子邮件经过验证
Joelle PineauSchool of Computer Science, McGill University; FAIR, Meta AI; Mila在 cs.mcgill.ca 的电子邮件经过验证
Claas VoelckerPhD student at University of Toronto在 cs.toronto.edu 的电子邮件经过验证
Babak N AraabiProfessor of ECE, University of Tehran在 ut.ac.ir 的电子邮件经过验证
Beomjoon KimKorea Advanced Institute of Science & Technology (KAIST)在 kaist.ac.kr 的电子邮件经过验证
J. Andrew BagnellCarnegie Mellon University在 ri.cmu.edu 的电子邮件经过验证
Rémi MunosGoogle DeepMind在 inria.fr 的电子邮件经过验证
mouhacine benosmanMERL- Dynamical systems team- Lead在 merl.com 的电子邮件经过验证
Romina AbachiUniversity of Toronto, Vector Institute在 mail.utoronto.ca 的电子邮件经过验证

关注

Amir-massoud Farahmand

University of Toronto

在 cs.toronto.edu 的电子邮件经过验证 - 首页

Machine Learning Reinforcement Learning Sequential Decision Making Statistical Learning Theory


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Error propagation for approximate policy and value iteration A Farahmand, C Szepesvári, R Munos Advances in Neural Information Processing Systems (NeurIPS), 568-576, 2010	273	2010
Regularized Policy Iteration A Farahmand, M Ghavamzadeh, S Mannor, C Szepesvári Advances in Neural Information Processing Systems 21 (NeurIPS 2008), 441-448, 2009	162	2009
Manifold-adaptive dimension estimation A Farahmand, C Szepesvári, JY Audibert Proceedings of the 24th International Conference on Machine Learning (ICML …, 2007	139	2007
Learning from Limited Demonstrations B Kim, A Farahmand, J Pineau, D Precup Advances in Neural Information Processing Systems (NeurIPS), 2859-2867, 2013	136	2013
Value-aware loss function for model-based reinforcement learning A Farahmand, A Barreto, D Nikovski Artificial Intelligence and Statistics (AISTATS), 1486-1494, 2017	127	2017
Regularized policy iteration with nonparametric function spaces A Farahmand, M Ghavamzadeh, C Szepesvári, S Mannor Journal of Machine Learning Research (JMLR) 17 (1), 4809-4874, 2016	124*	2016
Regularized fitted Q-iteration for planning in continuous-space Markovian decision problems A Farahmand, M Ghavamzadeh, C Szepesvári, S Mannor American Control Conference (ACC), 725-730, 2009	96*	2009
Robust jacobian estimation for uncalibrated visual servoing A Shademan, A Farahmand, M Jägersand IEEE International Conference on Robotics and Automation (ICRA), 5564-5569, 2010	86	2010
Model Selection in Reinforcement Learning AM Farahmand, C Szepesvári Machine learning 85 (3), 299-332, 2011	74	2011
Iterative Value-Aware Model Learning A Farahmand Advances in Neural Information Processing Systems (NeurIPS), 9072-9083, 2018	67	2018
Action-Gap Phenomenon in Reinforcement Learning AM Farahmand Neural Information Processing Systems (NeurIPS), 2011	63	2011
Global visual-motor estimation for uncalibrated visual servoing A Farahmand, A Shademan, M Jagersand IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS …, 2007	53*	2007
Deep reinforcement learning for partial differential equation control A Farahmand, S Nabi, DN Nikovski American Control Conference (ACC), 3120-3127, 2017	49	2017
Regularization in Reinforcement Learning AM Farahmand Department of Computing Science, University of Alberta, 2011	45	2011
Attentional network for visual object detection K Hara, MY Liu, O Tuzel, A Farahmand arXiv preprint arXiv:1702.01478, 2017	39	2017
Model-based and model-free reinforcement learning for visual servoing A Farahmand, A Shademan, M Jagersand, C Szepesvári IEEE International Conference on Robotics and Automation (ICRA), 2917-2924, 2009	39*	2009
Policy-aware model learning for policy gradient methods R Abachi, M Ghavamzadeh, A Farahmand arXiv:2003.00030, 2020	35	2020
Approximate MaxEnt Inverse Optimal Control and its Application for Mental Simulation of Human Interactions DA Huang, AM Farahmand, KM Kitani, JA Bagnell AAAI Conference on Artificial Intelligence (AAAI), 2015	32	2015
Improving Skin Condition Classification with a Visual Symptom Checker Trained using Reinforcement Learning M Akrout, A Farahmand, T Jarmain, L Abid International Conference on Medical Image Computing and Computer Assisted …, 2019	28	2019
Method for Data-Driven Learning-based Control of HVAC Systems using High-Dimensional Sensory Observations A Farahmand, S Nabi, P Grover, DN Nikovski US Patent App. 15/290,038, 2018	28	2018

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用