Dmitry Lepikhin 个人学术档案

引用次数

	总计	2019 年至今
引用	5737	5679
h 指数	15	14
i10 指数	16	15

2700

1350

675

2025

20192020202120222023202437 141 279 647 1896 2665

关注

Dmitry Lepikhin

Google

在 google.com 的电子邮件经过验证


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Lamda: Language models for dialog applications R Thoppilan, D De Freitas, J Hall, N Shazeer, A Kulshreshtha, HT Cheng, ... arXiv preprint arXiv:2201.08239, 2022	1250	2022
Palm 2 technical report R Anil, AM Dai, O Firat, M Johnson, D Lepikhin, A Passos, S Shakeri, ... arXiv preprint arXiv:2305.10403, 2023	1031	2023
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023	947	2023
Gshard: Scaling giant models with conditional computation and automatic sharding D Lepikhin, HJ Lee, Y Xu, D Chen, O Firat, Y Huang, M Krikun, N Shazeer, ... arXiv preprint arXiv:2006.16668, 2020	786	2020
Glam: Efficient scaling of language models with mixture-of-experts N Du, Y Huang, AM Dai, S Tong, D Lepikhin, Y Xu, M Krikun, Y Zhou, ... International Conference on Machine Learning, 5547-5569, 2022	402	2022
Massively multilingual neural machine translation in the wild: Findings and challenges N Arivazhagan, A Bapna, O Firat, D Lepikhin, M Johnson, M Krikun, ... arXiv preprint arXiv:1907.05019, 2019	387	2019
Lingvo: a modular and scalable framework for sequence-to-sequence modeling J Shen, P Nguyen, Y Wu, Z Chen, MX Chen, Y Jia, A Kannan, T Sainath, ... arXiv preprint arXiv:1902.08295, 2019	202	2019
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context M Reid, N Savinov, D Teplyashin, D Lepikhin, T Lillicrap, J Alayrac, ... arXiv preprint arXiv:2403.05530, 2024	166	2024
Toju Duke, Lucas Dixon, Kun Zhang, Quoc V N Du, Y Huang, AM Dai, S Tong, D Lepikhin, Y Xu, M Krikun, Y Zhou, ... Le, Yonghui Wu, Zhifeng Chen, and Claire Cui, 2021	136	2021
Gspmd: general and scalable parallelization for ml computation graphs Y Xu, HJ Lee, D Chen, B Hechtman, Y Huang, R Joshi, M Krikun, ... arXiv preprint arXiv:2105.04663, 2021	98	2021
Renelito Delos Santos R Thoppilan, D De Freitas, J Hall, N Shazeer, A Kulshreshtha, HT Cheng, ...	94	2022
Beyond distillation: Task-level mixture-of-experts for efficient inference S Kudugunta, Y Huang, A Bapna, M Krikun, D Lepikhin, MT Luong, O Firat arXiv preprint arXiv:2110.03742, 2021	76	2021
Sunipa Dev R Anil, AM Dai, O Firat, M Johnson, D Lepikhin, A Passos, S Shakeri, ... Jacob Devlin, Mark Díaz, Nan Du, Ethan Dyer, Vladimir Feinberg, Fangxiaoyu …, 2023	60	2023
A very large diversity space of synthetically accessible compounds for use with drug design programs S Nikitin, N Zaitseva, O Demina, V Solovieva, E Mazin, S Mikhalev, ... Journal of computer-aided molecular design 19, 47-63, 2005	43	2005
Sibyl: A system for large scale supervised machine learning K Canini, T Chandra, E Ie, J McFadden, K Goldman, M Gunter, J Harmsen, ... Technical Talk 1 (113), 2.3, 2012	39	2012
Massively multilingual neural machine translation in the wild: Findings and challenges A Naveen, B Ankur, F Orhan, L Dmitry, J Melvin, K Maxim, CM Xu, C Yuan, ... arXiv preprint arXiv: 1907.05019, 2019	14	2019
Exploring routing strategies for multilingual mixture-of-experts models S Kudugunta, Y Huang, A Bapna, M Krikun, D Lepikhin, T Luong, O Firat	4	2021
Systems and methods for routing within multitask mixture-of-experts models Y Huang, D Lepikhin, M Krikun, O Firat, A Bapna, T Luong, S Kudugunta US Patent App. 17/159,437, 2022	1	2022
Massively multilingual neural machine translation in the wild: Findings and challenges A Bapna, CA Cherry, DD Lepikhin, G Foster, M Krikun, M Johnson, ... July, 2019	1	2019
Attention neural networks with conditional computation D Lepikhin, Y Huang, O Firat, M Krikun, D Chen, NM Shazeer, HJ Lee, ... US Patent App. 18/009,841, 2023		2023

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

引用