Pablo Hernandez-Leal 个人学术档案

引用次数

	总计	2019 年至今
引用	2327	2018
h 指数	20	17
i10 指数	30	26

480

240

120

360

20132014201520162017201820192020202120222023202416 15 28 39 79 113 163 309 449 466 419 210

开放获取的出版物数量

查看全部

14 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Matthew E. TaylorProfessor, University of Alberta在 ualberta.ca 的电子邮件经过验证
Luis Enrique SucarSenior Research Scientist, INAOE, Mexico在 inaoep.mx 的电子邮件经过验证
Bilal KartalNVIDIA在 nvidia.com 的电子邮件经过验证
Enrique Munoz de CoteProfessor of Computer Science at INAOE; CSO @ People & AI; past: Director at Secondmind在 people-ai.com 的电子邮件经过验证
Eduardo MoralesComputer Science, INAOE在 inaoep.mx 的电子邮件经过验证
Michael KaisersDeepMind在 deepmind.com 的电子邮件经过验证
Oscar MayoraFBK - Head of Digital Health Lab在 fbk.eu 的电子邮件经过验证
Alban Maxhuni, PhDTechnical University of Denmark在 dtu.dk 的电子邮件经过验证
Jesus A. GonzalezThe University of Texas at Arlington在 uta.edu 的电子邮件经过验证
Tim BaarslagCentrum Wiskunde & Informatica (CWI)在 cwi.nl 的电子邮件经过验证
Fernando LezamaPolytechnic Institute of Porto, ISEP, GECAD在 isep.ipp.pt 的电子邮件经过验证
Joao SoaresPolytechnic Institute of Porto, ISEP, GECAD在 isep.ipp.pt 的电子邮件经过验证
Tiago PintoUTAD / INESC-TEC在 utad.pt 的电子邮件经过验证
Orihuela-Espina, FelipeUniversity of Birmingham; Instituto Nacional de Astrofísica, Óptica y Electrónica (INAOE); Imperial在 bham.ac.uk 的电子邮件经过验证
Felipe Leno da Silva (Leno)Lawrence Livermore National Lab在 llnl.gov 的电子邮件经过验证
Concha BielzaProfessor of Statistics and Operations Research, Technical University of Madrid在 fi.upm.es 的电子邮件经过验证
Benjamin RosmanProfessor at the University of the Witwatersrand, South Africa在 wits.ac.za 的电子邮件经过验证
Pablo H. IbarguengoytiaIIE在 iie.org.mx 的电子邮件经过验证
Jesús Ariel Carrasco OchoaInstituto Nacional de Astrofísica, Óptica y Electrónoca (INAOE), México在 inaoep.mx 的电子邮件经过验证
J. Arturo Olvera-LopezAutonomous University of Puebla (Faculty of Computer Science)在 correo.buap.mx 的电子邮件经过验证

关注

Pablo Hernandez-Leal

Quantfury

在 quantfury.com 的电子邮件经过验证 - 首页

Machine Learning Reinforcement Learning Intelligent Agents


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
A survey and critique of multiagent deep reinforcement learning P Hernandez-Leal, B Kartal, ME Taylor Autonomous Agents and Multi-Agent Systems 33 (6), 750-797, 2019	703*	2019
A survey of learning in multiagent environments: Dealing with non-stationarity P Hernandez-Leal, M Kaisers, T Baarslag, EM De Cote arXiv preprint arXiv:1707.09183, 2017	331	2017
Local energy markets: Paving the path toward fully transactive energy systems F Lezama, J Soares, P Hernandez-Leal, M Kaisers, T Pinto, Z Vale IEEE Transactions on Power Systems 34 (5), 4081-4088, 2018	324	2018
Multi-label classification with Bayesian network-based chain classifiers LE Sucar, C Bielza, EF Morales, P Hernandez-Leal, JH Zaragoza, ... Pattern Recognition Letters 41, 14-22, 2014	144	2014
Uncertainty-aware action advising for deep reinforcement learning agents FL Da Silva, P Hernandez-Leal, B Kartal, ME Taylor Proceedings of the AAAI conference on artificial intelligence 34 (04), 5792-5799, 2020	81	2020
Stress modelling and prediction in presence of scarce data A Maxhuni, P Hernandez-Leal, LE Sucar, V Osmani, EF Morales, ... Journal of biomedical informatics 63, 344-356, 2016	81	2016
Agent modeling as auxiliary task for deep reinforcement learning P Hernandez-Leal, B Kartal, ME Taylor Proceedings of the AAAI conference on artificial intelligence and …, 2019	55	2019
Efficiently detecting switches against non-stationary opponents P Hernandez-Leal, Y Zhan, ME Taylor, LE Sucar, E Munoz de Cote Autonomous Agents and Multi-Agent Systems 31, 767-789, 2017	44	2017
Identifying and tracking switching, non-stationary opponents: A Bayesian approach P Hernandez-Leal, ME Taylor, BS Rosman, LE Sucar, E Munoz de Cote Association for the Advancement of Artificial Intelligence (AAAI), 2016	35	2016
Terminal prediction as an auxiliary task for deep reinforcement learning B Kartal, P Hernandez-Leal, ME Taylor Proceedings of the AAAI Conference on Artificial Intelligence and …, 2019	34	2019
Towards a fast detection of opponents in repeated stochastic games P Hernandez-Leal, M Kaisers Autonomous Agents and Multiagent Systems: AAMAS 2017 Workshops, Best Papers …, 2017	34	2017
InstanceRank based on borders for instance selection P Hernandez-Leal, JA Carrasco-Ochoa, JF Martínez-Trinidad, ... Pattern Recognition 46 (1), 365-375, 2013	32	2013
Learning against sequential opponents in repeated stochastic games P Hernandez-Leal, M Kaisers The 3rd Multi-disciplinary Conference on Reinforcement Learning and Decision …, 2017	29	2017
Cdt: Cascading decision trees for explainable reinforcement learning Z Ding, P Hernandez-Leal, GW Ding, C Li, R Huang arXiv preprint arXiv:2011.07553, 2020	27	2020
A framework for learning and planning against switching strategies in repeated games P Hernandez-Leal, E Munoz de Cote, LE Sucar Connection Science 26 (2), 103-122, 2014	27	2014
Action guidance with MCTS for deep reinforcement learning B Kartal, P Hernandez-Leal, ME Taylor Proceedings of the AAAI conference on artificial intelligence and …, 2019	26	2019
Learning temporal nodes Bayesian networks P Hernandez-Leal, JA Gonzalez, EF Morales, LE Sucar International Journal of Approximate Reasoning 54 (8), 956-977, 2013	26	2013
A Bayesian approach for learning and tracking switching, non-stationary opponents P Hernandez-Leal, B Rosman, ME Taylor, LE Sucar, E Munoz de Cote Proceedings of the 2016 international conference on autonomous agents …, 2016	23	2016
Skynet: A top deep rl agent in the inaugural pommerman team competition C Gao, P Hernandez-Leal, B Kartal, ME Taylor arXiv preprint arXiv:1905.01360, 2019	22	2019
An exploration strategy for non-stationary opponents P Hernandez-Leal, Y Zhan, ME Taylor, LE Sucar, E Munoz de Cote Autonomous Agents and Multi-Agent Systems 31, 971-1002, 2017	22	2017

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用