Johan Ferret

Cited by

	All	Since 2019
Citations	2093	2093
h-index	12	12
i10-index	12	12

1700

850

425

1275

2020202120222023202424 83 134 216 1627

Co-authors

Matthieu GeistCohere (ex Google, on leave of Professor, Université de Lorraine)Verified email at univ-lorraine.fr
Olivier PietquinCohere | ex Google DeepMind (On leave - Professor at University of Lille)Verified email at univ-lille.fr
Thomas MesnardResearch Scientist at Google DeepMindVerified email at google.com
Léonard HussenotGoogle DeepMindVerified email at google.com
Olivier BachemResearch Scientist, Google BrainVerified email at google.com
Ramé AlexandreGoogle DeepMindVerified email at google.com
Nino VieillardGoogle DeepMindVerified email at google.com
Geoffrey CideronGoogle DeepMindVerified email at google.com
Philippe PreuxProfessor of computer science, Université de Lille, LIFL, SequeL, INRIAVerified email at univ-lille.fr
Robert DadashiGoogle DeepMindVerified email at google.com
Harrison LeeGoogle DeepmindVerified email at google.com
Samrat PhataleGoogle DeepMindVerified email at google.com
Raphaël MarinierGoogle AIVerified email at google.com
Sabela RamosSoftware Engineer. Google.Verified email at google.com
Nathan GrinsztajnInriaVerified email at inria.fr
Yannis Flet-BerliacPostdoc, Stanford UniversityVerified email at stanford.edu
Alexis D. JacqGoogleVerified email at google.com
Roee AharoniGoogle ResearchVerified email at google.com
Mathieu BlondelGoogleVerified email at google.com
Eduardo PignatelliUniversity College LondonVerified email at ucl.ac.uk

Johan Ferret

Research Scientist, Google DeepMind

Verified email at google.com - Homepage

Reinforcement Learning Machine Learning Artificial Intelligence


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Gemini: a Family of Highly Capable Multimodal Models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023	1042	2023
RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback H Lee, S Phatale, H Mansoor, T Mesnard, J Ferret, K Lu, C Bishop, E Hall, ... International Conference on Machine Learning (ICML 2024), 2023	255	2023
Gemma: Open Models Based on Gemini Research and Technology G Team, T Mesnard, C Hardin, R Dadashi, S Bhupatiraju, S Pathak, ... arXiv preprint arXiv:2403.08295, 2024	254	2024
Acme: A Research Framework for Distributed Reinforcement Learning MW Hoffman, B Shahriari, J Aslanides, G Barth-Maron, N Momchev, ... arXiv preprint arXiv:2006.00979, 2020	241	2020
Adversarially Guided Actor-Critic Y Flet-Berliac, J Ferret, O Pietquin, P Preux, M Geist International Conference on Learning Representations (ICLR 2021), 2021	78	2021
Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback P Roit, J Ferret, L Shani*, R Aharoni, G Cideron, R Dadashi, M Geist, ... ACL, 2023	44	2023
Direct Language Model Alignment from Online AI Feedback S Guo, B Zhang, T Liu, T Liu, M Khalman, F Llinares, A Rame, T Mesnard, ... arXiv preprint arXiv:2402.04792, 2024	36	2024
Self-Attentional Credit Assignment for Transfer in Reinforcement Learning J Ferret, R Marinier, M Geist, O Pietquin International Joint Conference on Artificial Intelligence (IJCAI 2020), 2019	33	2019
WARM: On the Benefits of Weight Averaged Reward Models A Ramé, N Vieillard, L Hussenot, R Dadashi, G Cideron, O Bachem, ... International Conference on Machine Learning (ICML 2024), 2024	26	2024
Self-Imitation Advantage Learning J Ferret, O Pietquin, M Geist International Conference on Autonomous Agents and Multiagent Systems (AAMAS …, 2020	25	2020
There Is No Turning Back: A Self-Supervised Approach for Reversibility-Aware Reinforcement Learning N Grinsztajn, J Ferret, O Pietquin, P Preux, M Geist Advances in Neural Information Processing Systems (NeurIPS 2021), 2021	22	2021
Lazy-MDPs: Towards Interpretable Reinforcement Learning By Learning When To Act A Jacq, J Ferret, O Pietquin, M Geist International Conference on Autonomous Agents and Multiagent Systems (AAMAS …, 2022	20*	2022
Credit assignment as a proxy for transfer in reinforcement learning J Ferret, R Marinier, M Geist, O Pietquin Learning Transferrable Skills Workshop, NeurIPS, 2019	6	2019
A Survey of Temporal Credit Assignment in Deep Reinforcement Learning E Pignatelli, J Ferret, M Geist, T Mesnard, H van Hasselt, L Toni Transactions on Machine Learning Research (TMLR), 2023	5	2023
RecurrentGemma: Moving Past Transformers for Efficient Open Language Models A Botev, S De, SL Smith, A Fernando, GC Muraru, R Haroun, L Berrada, ... arXiv preprint arXiv:2404.07839, 2024	2	2024
More efficient exploration with symbolic priors on action sequence equivalences T Johnstone, N Grinsztajn, J Ferret, P Preux Deep Reinforcement Learning Workshop, NeurIPS, 2022	2*	2022
On actions that matter: Credit assignment and interpretability in reinforcement learning J Ferret Université de Lille, 2022	2	2022
Conditioned Language Policy: A General Framework for Steerable Multi-Objective Finetuning K Wang, R Kidambi, R Sullivan, A Agarwal, C Dann, A Michi, M Gelmi, ... arXiv preprint arXiv:2407.15762, 2024		2024
BOND: Aligning LLMs with Best-of-N Distillation PG Sessa, R Dadashi, L Hussenot, J Ferret, N Vieillard, A Ramé, ... arXiv preprint arXiv:2407.14622, 2024		2024
WARP: On the Benefits of Weight Averaged Rewarded Policies A Ramé, J Ferret, N Vieillard, R Dadashi, L Hussenot, PL Cedoz, ... arXiv preprint arXiv:2406.16768, 2024		2024

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors