Follow
Johan Ferret
Johan Ferret
Research Scientist, Google DeepMind
Verified email at google.com - Homepage
Title
Cited by
Cited by
Year
Gemini: a Family of Highly Capable Multimodal Models
G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ...
arXiv preprint arXiv:2312.11805, 2023
7432023
Acme: A Research Framework for Distributed Reinforcement Learning
MW Hoffman, B Shahriari, J Aslanides, G Barth-Maron, N Momchev, ...
arXiv preprint arXiv:2006.00979, 2020
2372020
RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback
H Lee, S Phatale, H Mansoor, T Mesnard, J Ferret, K Lu, C Bishop, E Hall, ...
International Conference on Machine Learning (ICML 2024), 2023
1972023
Gemma: Open Models Based on Gemini Research and Technology
G Team, T Mesnard, C Hardin, R Dadashi, S Bhupatiraju, S Pathak, ...
arXiv preprint arXiv:2403.08295, 2024
1142024
Adversarially Guided Actor-Critic
Y Flet-Berliac*, J Ferret*, O Pietquin, P Preux, M Geist
International Conference on Learning Representations (ICLR 2021), 2021
702021
Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback
P Roit*, J Ferret*, L Shani*, R Aharoni, G Cideron, R Dadashi, M Geist, ...
ACL, 2023
332023
Self-Attentional Credit Assignment for Transfer in Reinforcement Learning
J Ferret, R Marinier, M Geist, O Pietquin
International Joint Conference on Artificial Intelligence (IJCAI 2020), 2019
332019
Self-Imitation Advantage Learning
J Ferret, O Pietquin, M Geist
International Conference on Autonomous Agents and Multiagent Systems (AAMAS …, 2020
232020
There Is No Turning Back: A Self-Supervised Approach for Reversibility-Aware Reinforcement Learning
N Grinsztajn*, J Ferret*, O Pietquin, P Preux, M Geist
Advances in Neural Information Processing Systems (NeurIPS 2021), 2021
222021
Direct Language Model Alignment from Online AI Feedback
S Guo, B Zhang, T Liu, T Liu, M Khalman, F Llinares, A Rame, T Mesnard, ...
arXiv preprint arXiv:2402.04792, 2024
192024
Lazy-MDPs: Towards Interpretable Reinforcement Learning By Learning When To Act
A Jacq*, J Ferret*, O Pietquin, M Geist
International Conference on Autonomous Agents and Multiagent Systems (AAMAS …, 2022
19*2022
WARM: On the Benefits of Weight Averaged Reward Models
A Ramé, N Vieillard, L Hussenot, R Dadashi, G Cideron, O Bachem, ...
International Conference on Machine Learning (ICML 2024), 2024
162024
Credit assignment as a proxy for transfer in reinforcement learning
J Ferret, R Marinier, M Geist, O Pietquin
Learning Transferrable Skills Workshop, NeurIPS, 2019
62019
A Survey of Temporal Credit Assignment in Deep Reinforcement Learning
E Pignatelli, J Ferret, M Geist, T Mesnard, H van Hasselt, L Toni
Transactions on Machine Learning Research (TMLR), 2023
42023
More efficient exploration with symbolic priors on action sequence equivalences
T Johnstone, N Grinsztajn, J Ferret, P Preux
Deep Reinforcement Learning Workshop, NeurIPS, 2022
2*2022
On actions that matter: Credit assignment and interpretability in reinforcement learning
J Ferret
Université de Lille, 2022
22022
RecurrentGemma: Moving Past Transformers for Efficient Open Language Models
A Botev, S De, SL Smith, A Fernando, GC Muraru, R Haroun, L Berrada, ...
arXiv preprint arXiv:2404.07839, 2024
2024
Offline Credit Assignment in Deep Reinforcement Learning with Hindsight Discriminator Networks
J Ferret, O Pietquin, M Geist
EWRL, 2022
2022
The system can't perform the operation now. Try again later.
Articles 1–18