Thomas Mesnard

Cited by

	All	Since 2019
Citations	2077	1878
h-index	14	14
i10-index	14	14

1200

600

300

900

201520162017201820192020202120222023202416 31 71 73 84 119 134 138 235 1163

Public access

View all

2 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Yoshua BengioProfessor of computer science, University of Montreal, Mila, IVADO, CIFARVerified email at umontreal.ca
Rémi MunosGoogle DeepMindVerified email at inria.fr
Bilal PiotGoogle DeepmindVerified email at google.com
Will DabneyDeepMindVerified email at google.com
Theophane WeberResearch Scientist at DeepMindVerified email at google.com
Doina PrecupDeepMind and McGill UniversityVerified email at cs.mcgill.ca
Eric MoulinesProfesseur, Ecole Polytechnique, Membre de l'Académie des SciencesVerified email at polytechnique.edu
Armand JoulinGoogle DeepMindVerified email at google.com
Laurent SifreGoogle DeepMindVerified email at polytechnique.edu
Demis HassabisDeepMind
Jeff DeanGoogle Chief Scientist, Google Research and Google DeepMindVerified email at google.com
koray kavukcuogluDeepMindVerified email at kavukcuoglu.org
Clement FarabetEx Research Scientist, New York UniversityVerified email at nyu.edu
Oriol VinyalsResearch Scientist at Google DeepMindVerified email at google.com
Noah FiedelGoogleVerified email at engineeralum.berkeley.edu

Thomas Mesnard

Research Scientist at Google DeepMind

Verified email at google.com

LLM Reinforcement Learning Artificial Intelligence


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Gemma: Open models based on gemini research and technology G Team, T Mesnard, C Hardin, R Dadashi, S Bhupatiraju, S Pathak, ... arXiv preprint arXiv:2403.08295, 2024	540	2024
Towards biologically plausible deep learning Y Bengio, DH Lee, J Bornschein, T Mesnard, Z Lin arXiv preprint arXiv:1502.04156, 2015	443	2015
Rlaif: Scaling reinforcement learning from human feedback with ai feedback H Lee, S Phatale, H Mansoor, T Mesnard, J Ferret, K Lu, C Bishop, E Hall, ... arXiv preprint arXiv:2309.00267, 2023	348	2023
An objective function for STDP Y Bengio, T Mesnard, A Fischer, S Zhang, Y Wu arXiv preprint arXiv:1509.05936 5 (6.2), 6.3, 2015	186*	2015
Hindsight credit assignment A Harutyunyan, W Dabney, T Mesnard, M Gheshlaghi Azar, B Piot, ... Advances in neural information processing systems 32, 2019	97	2019
Gemma 2: Improving open language models at a practical size G Team, M Riviere, S Pathak, PG Sessa, C Hardin, S Bhupatiraju, ... arXiv preprint arXiv:2408.00118, 2024	89	2024
Counterfactual credit assignment in model-free reinforcement learning T Mesnard, T Weber, F Viola, S Thakoor, A Saade, A Harutyunyan, ... arXiv preprint arXiv:2011.09464, 2020	70	2020
Nash learning from human feedback R Munos, M Valko, D Calandriello, MG Azar, M Rowland, ZD Guo, Y Tang, ... arXiv preprint arXiv:2312.00886, 2023	67	2023
Direct language model alignment from online ai feedback S Guo, B Zhang, T Liu, T Liu, M Khalman, F Llinares, A Rame, T Mesnard, ... arXiv preprint arXiv:2402.04792, 2024	62	2024
Generalization of equilibrium propagation to vector field dynamics B Scellier, A Goyal, J Binas, T Mesnard, Y Bengio arXiv preprint arXiv:1808.04873, 2018	48*	2018
Geometric entropic exploration ZD Guo, MG Azar, A Saade, S Thakoor, B Piot, BA Pires, M Valko, ... arXiv preprint arXiv:2101.02055, 2021	40	2021
Towards deep learning with spiking neurons in energy based models with contrastive hebbian plasticity T Mesnard, W Gerstner, J Brea arXiv preprint arXiv:1612.03214, 2016	27	2016
Charline Le Lan, Christopher A T Mesnard, C Hardin, R Dadashi, S Bhupatiraju, S Pathak, L Sifre, ...	21	2024
Curiosity in hindsight: Intrinsic exploration in stochastic environments D Jarrett, C Tallec, F Altché, T Mesnard, R Munos, M Valko	16	2023
Ghost units yield biologically plausible backprop in deep neural networks T Mesnard, G Vignoud, J Sacramento, W Senn, Y Bengio arXiv preprint arXiv:1911.08585, 2019	7	2019
RecurrentGemma: Moving Past Transformers for Efficient Open Language Models A Botev, S De, SL Smith, A Fernando, GC Muraru, R Haroun, L Berrada, ... arXiv preprint arXiv:2404.07839, 2024	5	2024
A survey of temporal credit assignment in deep reinforcement learning E Pignatelli, J Ferret, M Geist, T Mesnard, H van Hasselt, L Toni arXiv preprint arXiv:2312.01072, 2023	5	2023
Quantile credit assignment T Mesnard, W Chen, A Saade, Y Tang, M Rowland, T Weber, C Lyle, ... International Conference on Machine Learning, 24517-24531, 2023	4	2023
Activation alignment: exploring the use of approximate activity gradients in multilayer networks T Mesnard, B Richards 2018 Conference on Cognitive Computational Neuroscience, Brentwood …, 2018	1	2018
Connectionist Temporal Classification: Labelling Unsegmented Sequences with Recurrent Neural Networks A AUVOLAT, T MESNARD	1	2006

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors