Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023 | 794 | 2023 |
Search query predictions by a keyboard J Cao, A Greenberg, A Sharma, Y Su, K Nicholas, M Mohsin, J Jurewicz, ... US Patent 9,720,955, 2017 | 72 | 2017 |
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context M Reid, N Savinov, D Teplyashin, D Lepikhin, T Lillicrap, J Alayrac, ... arXiv preprint arXiv:2403.05530, 2024 | 69 | 2024 |
PyGlove: Symbolic programming for automated machine learning D Peng, X Dong, E Real, M Tan, Y Lu, G Bender, H Liu, A Kraft, C Liang, ... Advances in Neural Information Processing Systems 33, 96-108, 2020 | 32 | 2020 |
Mapping images to search queries M Sharifi, D Petrou, A Sharma US Patent 10,489,410, 2019 | 22 | 2019 |
Search query predictions by a keyboard J Cao, A Greenberg, A Sharma, Y Su, K Nicholas, M Mohsin, J Jurewicz, ... US Patent 10,305,828, 2019 | 21 | 2019 |
Towards better semantic understanding of mobile interfaces S Sunkara, M Wang, L Liu, G Baechler, YC Hsiao, A Sharma, J Stout arXiv preprint arXiv:2210.02663, 2022 | 14 | 2022 |
On-device image recognition A Sharma, F Zubach, T Binder, L Mach, S El Ghazzal, M Sharifi US Patent 10,769,428, 2020 | 7 | 2020 |
ScreenAI: A Vision-Language Model for UI and Infographics Understanding G Baechler, S Sunkara, M Wang, F Zubach, H Mansoor, V Etter, ... arXiv preprint arXiv:2402.04615, 2024 | 3 | 2024 |
Visual recognition using user tap locations A Sharma, D Petrou, M Sharifi US Patent 10,664,519, 2020 | 2 | 2020 |
Chart-based reasoning: Transferring capabilities from llms to vlms V Carbune, H Mansoor, F Liu, R Aralikatte, G Baechler, J Chen, A Sharma arXiv preprint arXiv:2403.12596, 2024 | 1 | 2024 |
Visual recognition using user tap locations A Sharma, D Petrou, M Sharifi US Patent 11,461,386, 2022 | 1 | 2022 |
Surfacing images of a collection based on device context M Sharifi, K Naliuka, A Sharma | 1 | 2018 |
Automated assistant control of non-assistant applications via identification of synonymous term and/or speech processing biasing J Lange, A Sharma, A Coimbra, G Bakir, G Taubman, I Firman, J Chen, ... US Patent 11,967,321, 2024 | | 2024 |
Mapping Images to Search Queries M Sharifi, A Sharma, D Petrou US Patent App. 18/344,509, 2023 | | 2023 |
Mapping images to search queries M Sharifi, D Petrou, A Sharma US Patent 11,734,287, 2023 | | 2023 |
Visual Recognition Using User Tap Locations A Sharma, D Petrou, M Sharifi US Patent App. 17/958,728, 2023 | | 2023 |
Mapping images to search queries M Sharifi, D Petrou, A Sharma US Patent 11,269,897, 2022 | | 2022 |
Systems and Methods for Providing a Machine-Learned Model with Adjustable Computational Demand A Sharma, A Mordvintsev, M Sharifi US Patent App. 16/972,429, 2021 | | 2021 |
Scanning and Ranking a Stream of Visual Objects M Sharifi, A Sharma | | 2021 |