Prati
Nir Levine
Nir Levine
Research Engineer at DeepMind
Potvrđena adresa e-pošte na google.com
Naslov
Citirano
Citirano
Godina
Improved knowledge distillation via teacher assistant
SI Mirzadeh, M Farajtabar, A Li, N Levine, A Matsukawa, H Ghasemzadeh
Proceedings of the AAAI conference on artificial intelligence 34 (04), 5191-5198, 2020
10132020
Gemini: a family of highly capable multimodal models
G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ...
arXiv preprint arXiv:2312.11805, 2023
5682023
Challenges of real-world reinforcement learning: definitions, benchmarks and analysis
G Dulac-Arnold, N Levine, DJ Mankowitz, J Li, C Paduraru, S Gowal, ...
Machine Learning 110 (9), 2419-2468, 2021
3692021
Rotting bandits
N Levine, K Crammer, S Mannor
Advances in neural information processing systems 30, 2017
1292017
An empirical investigation of the challenges of real-world reinforcement learning
G Dulac-Arnold, N Levine, DJ Mankowitz, J Li, C Paduraru, S Gowal, ...
arXiv preprint arXiv:2003.11881, 2020
1182020
Robust reinforcement learning for continuous control with model misspecification
DJ Mankowitz, N Levine, R Jeong, Y Shi, J Kay, A Abdolmaleki, ...
arXiv preprint arXiv:1906.07516, 2019
1132019
Shallow updates for deep reinforcement learning
N Levine, T Zahavy, DJ Mankowitz, A Tamar, S Mannor
Advances in Neural Information Processing Systems 30, 2017
522017
Prediction, consistency, curvature: Representation learning for locally-linear control
N Levine, Y Chow, R Shu, A Li, M Ghavamzadeh, H Bui
arXiv preprint arXiv:1909.01506, 2019
292019
Optimization and generalization of regularization-based continual learning: a loss approximation viewpoint
D Yin, M Farajtabar, A Li, N Levine, A Mott
arXiv preprint arXiv:2006.10974, 2020
272020
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
M Reid, N Savinov, D Teplyashin, D Lepikhin, T Lillicrap, J Alayrac, ...
arXiv preprint arXiv:2403.05530, 2024
222024
Balancing constraints and rewards with meta-gradient d4pg
DA Calian, DJ Mankowitz, T Zahavy, Z Xu, J Oh, N Levine, T Mann
International Conference on Learning Representations, 2020
212020
An extended relevance model for session search
N Levine, H Roitman, D Cohen
Proceedings of the 40th International ACM SIGIR Conference on Research and …, 2017
212017
Task-agnostic continual learning with hybrid probabilistic models
P Kirichenko, M Farajtabar, D Rao, B Lakshminarayanan, N Levine, A Li, ...
arXiv preprint arXiv:2106.12772, 2021
162021
A maximum-entropy approach to off-policy evaluation in average-reward mdps
N Lazic, D Yin, M Farajtabar, N Levine, D Gorur, C Harris, D Schuurmans
Advances in Neural Information Processing Systems 33, 12461-12471, 2020
112020
Actively learning to attract followers on Twitter
N Levine, TA Mann, S Mannor
arXiv preprint arXiv:1504.04114, 2015
32015
Robust reinforcement learning for continuous control with model misspecification
DJ Mankowitz, N Levine, RC Jeong, A Abdolmaleki, JT Springenberg, ...
US Patent App. 17/620,164, 2022
22022
Relevance model for session search
H Roitman, D Cohen, N Levine
US Patent 10,956,409, 2021
2021
Neural Rate Control for Video Encoding using Imitation Learning
H Mao, C Gu, M Wang, A Chen, N Lazic, N Levine, D Pang, R Claus, ...
arXiv preprint arXiv:2012.05339, 2020
2020
Sustav trenutno ne može provesti ovu radnju. Pokušajte ponovo kasnije.
Članci 1–18