Thomas Mesnard 个人学术档案

引用次数

	总计	2019 年至今
引用	1449	1251
h 指数	12	12
i10 指数	12	12

540

270

135

405

201520162017201820192020202120222023202416 31 70 74 84 119 134 138 236 535

开放获取的出版物数量

查看全部

2 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Yoshua BengioProfessor of computer science, University of Montreal, Mila, IVADO, CIFAR在 umontreal.ca 的电子邮件经过验证
Rémi MunosGoogle DeepMind在 inria.fr 的电子邮件经过验证
Bilal PiotGoogle Deepmind在 google.com 的电子邮件经过验证
Will DabneyDeepMind在 google.com 的电子邮件经过验证
Doina PrecupDeepMind and McGill University在 cs.mcgill.ca 的电子邮件经过验证
Theophane WeberResearch Scientist at DeepMind在 google.com 的电子邮件经过验证
Eric MoulinesProfesseur, Ecole Polytechnique, Membre de l'Académie des Sciences在 polytechnique.edu 的电子邮件经过验证
Armand JoulinGoogle DeepMind在 google.com 的电子邮件经过验证
Laurent SifreGoogle DeepMind在 polytechnique.edu 的电子邮件经过验证
Demis HassabisDeepMind
Jeff DeanGoogle Chief Scientist, Google Research and Google DeepMind在 google.com 的电子邮件经过验证
koray kavukcuogluDeepMind在 kavukcuoglu.org 的电子邮件经过验证
Clement FarabetEx Research Scientist, New York University在 nyu.edu 的电子邮件经过验证
Oriol VinyalsResearch Scientist at Google DeepMind在 google.com 的电子邮件经过验证
Noah FiedelGoogle在 engineeralum.berkeley.edu 的电子邮件经过验证

关注

Thomas Mesnard

Research Scientist at Google DeepMind

在 google.com 的电子邮件经过验证 - 首页

LLM Reinforcement Learning Artificial Intelligence


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Towards biologically plausible deep learning Y Bengio, DH Lee, J Bornschein, T Mesnard, Z Lin arXiv preprint arXiv:1502.04156, 2015	429	2015
Rlaif: Scaling reinforcement learning from human feedback with ai feedback H Lee, S Phatale, H Mansoor, K Lu, T Mesnard, C Bishop, V Carbune, ... arXiv preprint arXiv:2309.00267, 2023	238	2023
Gemma: Open models based on gemini research and technology G Team, T Mesnard, C Hardin, R Dadashi, S Bhupatiraju, S Pathak, ... arXiv preprint arXiv:2403.08295, 2024	220	2024
An objective function for STDP Y Bengio, T Mesnard, A Fischer, S Zhang, Y Wu arXiv preprint arXiv:1509.05936 5 (6.2), 6.3, 2015	182*	2015
Hindsight credit assignment A Harutyunyan, W Dabney, T Mesnard, M Gheshlaghi Azar, B Piot, ... Advances in neural information processing systems 32, 2019	96	2019
Counterfactual credit assignment in model-free reinforcement learning T Mesnard, T Weber, F Viola, S Thakoor, A Saade, A Harutyunyan, ... arXiv preprint arXiv:2011.09464, 2020	66	2020
Generalization of equilibrium propagation to vector field dynamics B Scellier, A Goyal, J Binas, T Mesnard, Y Bengio arXiv preprint arXiv:1808.04873, 2018	47*	2018
Nash learning from human feedback R Munos, M Valko, D Calandriello, MG Azar, M Rowland, ZD Guo, Y Tang, ... arXiv preprint arXiv:2312.00886, 2023	43	2023
Geometric entropic exploration ZD Guo, MG Azar, A Saade, S Thakoor, B Piot, BA Pires, M Valko, ... arXiv preprint arXiv:2101.02055, 2021	40	2021
Direct language model alignment from online ai feedback S Guo, B Zhang, T Liu, T Liu, M Khalman, F Llinares, A Rame, T Mesnard, ... arXiv preprint arXiv:2402.04792, 2024	32	2024
Towards deep learning with spiking neurons in energy based models with contrastive hebbian plasticity T Mesnard, W Gerstner, J Brea arXiv preprint arXiv:1612.03214, 2016	27	2016
Curiosity in hindsight: Intrinsic exploration in stochastic environments D Jarrett, C Tallec, F Altché, T Mesnard, R Munos, M Valko	13	2023
Ghost units yield biologically plausible backprop in deep neural networks T Mesnard, G Vignoud, J Sacramento, W Senn, Y Bengio arXiv preprint arXiv:1911.08585, 2019	6	2019
A survey of temporal credit assignment in deep reinforcement learning E Pignatelli, J Ferret, M Geist, T Mesnard, H van Hasselt, L Toni arXiv preprint arXiv:2312.01072, 2023	4	2023
RecurrentGemma: Moving Past Transformers for Efficient Open Language Models A Botev, S De, SL Smith, A Fernando, GC Muraru, R Haroun, L Berrada, ... arXiv preprint arXiv:2404.07839, 2024	2	2024
Quantile credit assignment T Mesnard, W Chen, A Saade, Y Tang, M Rowland, T Weber, C Lyle, ... International Conference on Machine Learning, 24517-24531, 2023	2	2023
Activation alignment: exploring the use of approximate activity gradients in multilayer networks T Mesnard, B Richards 2018 Conference on Cognitive Computational Neuroscience, Brentwood …, 2018	1	2018
Connectionist Temporal Classification: Labelling Unsegmented Sequences with Recurrent Neural Networks A AUVOLAT, T MESNARD	1	2006
Credit Assignment in Deep Reinforcement Learning T Mesnard Institut Polytechnique de Paris, 2023		2023

系统目前无法执行此操作，请稍后再试。

文章 1–19

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用