Antoine Miech 个人学术档案

引用次数

	总计	2019 年至今
引用	6810	6717
h 指数	17	17
i10 指数	20	20

2800

1400

700

2100

201820192020202120222023202472 104 224 523 935 2129 2793

开放获取的出版物数量

查看全部

9 篇文章

2 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Ivan LaptevVisiting professor at MBZUAI, on leave from INRIA在 inria.fr 的电子邮件经过验证
Josef SivicCzech Technical University, CIIRC, ELLIS Unit Prague在 cvut.cz 的电子邮件经过验证
Jean-Baptiste AlayracDeepMind, London在 google.com 的电子邮件经过验证
Andrew ZissermanUniversity of Oxford在 robots.ox.ac.uk 的电子邮件经过验证
Cordelia SchmidResearch director INRIA 在 inria.fr 的电子邮件经过验证
Antoine YangGoogle DeepMind在 google.com 的电子邮件经过验证
Makarand TapaswiIIIT Hyderabad, Wadhwani AI在 iiit.ac.in 的电子邮件经过验证
Dimitri ZhukovTractable在 tractable.ai 的电子邮件经过验证
Lorenzo TorresaniMeta, Fundamental AI Research (FAIR)在 meta.com 的电子邮件经过验证
Heng WangTikTok在 fb.com 的电子邮件经过验证
Du TranGoogle在 google.com 的电子邮件经过验证
Piotr BojanowskiMeta AI在 fb.com 的电子邮件经过验证
Jeff DonahueResearch Scientist, DeepMind在 google.com 的电子邮件经过验证
Karen SimonyanChief Scientist, Microsoft AI在 microsoft.com 的电子邮件经过验证

关注

Antoine Miech

Google DeepMind

在 google.com 的电子邮件经过验证 - 首页

Computer Vision


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Flamingo: a visual language model for few-shot learning JB Alayrac, J Donahue, P Luc, A Miech, I Barr, Y Hasson, K Lenc, ... Advances in neural information processing systems 35, 23716-23736, 2022	2302	2022
HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million Narrated Video Clips A Miech, D Zhukov, JB Alayrac, M Tapaswi, I Laptev, J Sivic Proceedings of the IEEE International Conference on Computer Vision, 2630-2640, 2019	1096	2019
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023	865	2023
End-to-end learning of visual representations from uncurated instructional videos A Miech, JB Alayrac, L Smaira, I Laptev, J Sivic, A Zisserman Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020	726	2020
Learnable pooling with context gating for video classification A Miech, I Laptev, J Sivic arXiv preprint arXiv:1706.06905, 2017	382	2017
Just ask: Learning to answer questions from millions of narrated videos A Yang, A Miech, J Sivic, I Laptev, C Schmid Proceedings of the IEEE/CVF international conference on computer vision …, 2021	261	2021
Learning a text-video embedding from incomplete and heterogeneous data A Miech, I Laptev, J Sivic arXiv preprint arXiv:1804.02516, 2018	252	2018
Zero-shot video question answering via frozen bidirectional language models A Yang, A Miech, J Sivic, I Laptev, C Schmid Advances in Neural Information Processing Systems 35, 124-141, 2022	166	2022
Thinking fast and slow: Efficient text-to-visual retrieval with transformers A Miech, JB Alayrac, I Laptev, J Sivic, A Zisserman Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021	140	2021
Vid2seq: Large-scale pretraining of a visual language model for dense video captioning A Yang, A Nagrani, PH Seo, A Miech, J Pont-Tuset, I Laptev, J Sivic, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	136	2023
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context M Reid, N Savinov, D Teplyashin, D Lepikhin, T Lillicrap, J Alayrac, ... arXiv preprint arXiv:2403.05530, 2024	134	2024
Tubedetr: Spatio-temporal video grounding with transformers A Yang, A Miech, J Sivic, I Laptev, C Schmid Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022	80	2022
Leveraging the present to anticipate the future in videos A Miech, I Laptev, J Sivic, H Wang, L Torresani, D Tran Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019	77	2019
Learning from video and text via large-scale discriminative clustering A Miech, JB Alayrac, P Bojanowski, I Laptev, J Sivic Proceedings of the IEEE international conference on computer vision, 5257-5266, 2017	48	2017
Learning to answer visual questions from web videos A Yang, A Miech, J Sivic, I Laptev, C Schmid arXiv preprint arXiv:2205.05019, 2022	28	2022
Look for the change: Learning object states and state-modifying actions from untrimmed web videos T Souček, JB Alayrac, A Miech, I Laptev, J Sivic Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022	24	2022
Rareact: A video dataset of unusual interactions A Miech, JB Alayrac, I Laptev, J Sivic, A Zisserman arXiv preprint arXiv:2008.01018, 2020	20	2020
Zorro: the masked multimodal transformer A Recasens, J Lin, J Carreira, D Jaegle, L Wang, J Alayrac, P Luc, ... arXiv preprint arXiv:2301.09595, 2023	16	2023
Perception test: A diagnostic benchmark for multimodal video models V Patraucean, L Smaira, A Gupta, A Recasens, L Markeeva, D Banarse, ... Advances in Neural Information Processing Systems 36, 2024	15	2024
The end-of-end-to-end: A video understanding pentathlon challenge (2020) S Albanie, Y Liu, A Nagrani, A Miech, E Coto, I Laptev, R Sukthankar, ... arXiv preprint arXiv:2008.00744, 2020	14	2020

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用