Antoine Miech

Citirano

	Sve	Od 2019.
Citati	5641	5548
H-indeks	16	16
i10-indeks	19	19

2300

1150

575

1725

201820192020202120222023202471 103 226 521 948 2261 1482

Javni pristup

Prikaži sve

9 članaka

2 članka

dostupno

nije dostupno

Na temelju uvjeta financiranja

Suautori

Ivan LaptevVisiting professor at MBZUAI, on leave from INRIAPotvrđena adresa e-pošte na inria.fr
Josef SivicCzech Technical University, CIIRC, ELLIS Unit PraguePotvrđena adresa e-pošte na cvut.cz
Jean-Baptiste AlayracDeepMind, LondonPotvrđena adresa e-pošte na google.com
Cordelia SchmidResearch director INRIA Potvrđena adresa e-pošte na inria.fr
Andrew ZissermanUniversity of OxfordPotvrđena adresa e-pošte na robots.ox.ac.uk
Antoine YangGoogle DeepMindPotvrđena adresa e-pošte na google.com
Makarand TapaswiIIIT Hyderabad, Wadhwani AIPotvrđena adresa e-pošte na iiit.ac.in
Dimitri ZhukovTractablePotvrđena adresa e-pošte na tractable.ai
Lorenzo TorresaniMeta, Fundamental AI Research (FAIR)Potvrđena adresa e-pošte na meta.com
Heng WangTikTokPotvrđena adresa e-pošte na fb.com
Du TranGooglePotvrđena adresa e-pošte na google.com
Piotr BojanowskiMeta AIPotvrđena adresa e-pošte na fb.com
Jeff DonahueResearch Scientist, DeepMindPotvrđena adresa e-pošte na google.com
Karen SimonyanChief Scientist, Microsoft AIPotvrđena adresa e-pošte na microsoft.com

Prati

Antoine Miech

Google DeepMind

Potvrđena adresa e-pošte na google.com - Početna stranica

Computer Vision


Naslov Poredaj po navodima Poredaj po godini Poredaj po naslovu	Citirano Citirano	Godina
Flamingo: a visual language model for few-shot learning JB Alayrac, J Donahue, P Luc, A Miech, I Barr, Y Hasson, K Lenc, ... Advances in neural information processing systems 35, 23716-23736, 2022	1920	2022
HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million Narrated Video Clips A Miech, D Zhukov, JB Alayrac, M Tapaswi, I Laptev, J Sivic Proceedings of the IEEE International Conference on Computer Vision, 2630-2640, 2019	1033	2019
End-to-end learning of visual representations from uncurated instructional videos A Miech, JB Alayrac, L Smaira, I Laptev, J Sivic, A Zisserman Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020	693	2020
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023	457	2023
Learnable pooling with context gating for video classification A Miech, I Laptev, J Sivic arXiv preprint arXiv:1706.06905, 2017	380	2017
Learning a text-video embedding from incomplete and heterogeneous data A Miech, I Laptev, J Sivic arXiv preprint arXiv:1804.02516, 2018	249	2018
Just ask: Learning to answer questions from millions of narrated videos A Yang, A Miech, J Sivic, I Laptev, C Schmid Proceedings of the IEEE/CVF international conference on computer vision …, 2021	237	2021
Zero-shot video question answering via frozen bidirectional language models A Yang, A Miech, J Sivic, I Laptev, C Schmid Advances in Neural Information Processing Systems 35, 124-141, 2022	137	2022
Thinking fast and slow: Efficient text-to-visual retrieval with transformers A Miech, JB Alayrac, I Laptev, J Sivic, A Zisserman Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021	129	2021
Vid2seq: Large-scale pretraining of a visual language model for dense video captioning A Yang, A Nagrani, PH Seo, A Miech, J Pont-Tuset, I Laptev, J Sivic, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	95	2023
Tubedetr: Spatio-temporal video grounding with transformers A Yang, A Miech, J Sivic, I Laptev, C Schmid Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022	76	2022
Leveraging the present to anticipate the future in videos A Miech, I Laptev, J Sivic, H Wang, L Torresani, D Tran Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019	74	2019
Learning from video and text via large-scale discriminative clustering A Miech, JB Alayrac, P Bojanowski, I Laptev, J Sivic Proceedings of the IEEE international conference on computer vision, 5257-5266, 2017	49	2017
Learning to answer visual questions from web videos A Yang, A Miech, J Sivic, I Laptev, C Schmid arXiv preprint arXiv:2205.05019, 2022	22	2022
Look for the change: Learning object states and state-modifying actions from untrimmed web videos T Souček, JB Alayrac, A Miech, I Laptev, J Sivic Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022	18	2022
Rareact: A video dataset of unusual interactions A Miech, JB Alayrac, I Laptev, J Sivic, A Zisserman arXiv preprint arXiv:2008.01018, 2020	18	2020
The end-of-end-to-end: A video understanding pentathlon challenge (2020) S Albanie, Y Liu, A Nagrani, A Miech, E Coto, I Laptev, R Sukthankar, ... arXiv preprint arXiv:2008.00744, 2020	14	2020
Zorro: the masked multimodal transformer A Recasens, J Lin, J Carreira, D Jaegle, L Wang, J Alayrac, P Luc, ... arXiv preprint arXiv:2301.09595, 2023	13	2023
Perception test: A diagnostic benchmark for multimodal video models V Patraucean, L Smaira, A Gupta, A Recasens, L Markeeva, D Banarse, ... Advances in Neural Information Processing Systems 36, 2024	11	2024
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context M Reid, N Savinov, D Teplyashin, D Lepikhin, T Lillicrap, J Alayrac, ... arXiv preprint arXiv:2403.05530, 2024	6	2024

Sustav trenutno ne može provesti ovu radnju. Pokušajte ponovo kasnije.

Članci 1–20

Godišnji broj citata

Dvostruki navodi

Spojeni navodi

Dodavanje suautoraSuautori

Prati

Citirano

Suautori