Dinov2: Learning robust visual features without supervision M Oquab, T Darcet, T Moutakanni, H Vo, M Szafraniec, V Khalidov, ... arXiv preprint arXiv:2304.07193, 2023 | 1214* | 2023 |
A survey of deep active learning P Ren, Y Xiao, X Chang, PY Huang, Z Li, BB Gupta, X Chen, X Wang ACM computing surveys (CSUR) 54 (9), 1-40, 2021 | 1098 | 2021 |
A comprehensive survey of neural architecture search: Challenges and solutions P Ren, Y Xiao, X Chang, PY Huang, Z Li, X Chen, X Wang ACM Computing Surveys (CSUR) 54 (4), 1-34, 2021 | 620 | 2021 |
Videoclip: Contrastive pre-training for zero-shot video-text understanding H Xu, G Ghosh, PY Huang, D Okhonko, A Aghajanyan, F Metze, ... EMNLP 2021, 2021 | 438 | 2021 |
Support-set bottlenecks for video-text representation learning M Patrick*, PY Huang*, Y Asano*, F Metze, A Hauptmann, J Henriques, ... ICLR 2021, 2020 | 261 | 2020 |
Self-Supervised Deep Correlation Tracking D Yuan, X Chang, PY Huang, Q Liu, Z He IEEE Transactions on Image Processing (TIP), 2020 | 240 | 2020 |
Attention-based multimodal neural machine translation PY Huang, F Liu, SR Shiang, J Oh, C Dyer First Conference on Machine Translation (WMT16), 2016 | 220 | 2016 |
Masked autoencoders that listen PY Huang, H Xu, J Li, A Baevski, M Auli, W Galuba, F Metze, ... NeurIPS 2022, 2022 | 160 | 2022 |
Structural analysis and optimization of convolutional neural networks with a small sample size RN D’souza, PY Huang, FC Yeh Scientific reports 10 (1), 834, 2020 | 135 | 2020 |
Cm3: A causal masked multimodal model of the internet A Aghajanyan, B Huang, C Ross, V Karpukhin, H Xu, N Goyal, D Okhonko, ... arXiv preprint arXiv:2201.07520, 2022 | 122 | 2022 |
VLM: Task-agnostic Video-Language Model Pre-training for Video Understanding H Xu, G Ghosh, PY Huang, P Arora, M Aminzadeh, C Feichtenhofer, ... ACL-Findings 2021, 2021 | 121 | 2021 |
Rcaa: Relational context-aware agents for person search X Chang, PY Huang, YD Shen, X Liang, Y Yang, AG Hauptmann ECCV 2018, 2018 | 117 | 2018 |
Video pivoting unsupervised multi-modal machine translation M Li, PY Huang, X Chang, J Hu, Y Yang, A Hauptmann IEEE Transactions on Pattern Analysis and Machine Intelligence 45 (3), 3918-3932, 2022 | 109 | 2022 |
Entity hierarchy embedding Z Hu, PY Huang, Y Deng, Y Gao, E Xing ACL 2015, 2015 | 88 | 2015 |
Introducing meta llama 3: The most capable openly available llm to date AI Meta Meta AI, 2024 | 61* | 2024 |
Multilingual Multimodal Pre-training for Zero-Shot Cross-Lingual Transfer of Vision-Language Models PY Huang*, M Patrick*, J Hu, G Neubig, F Metze, A Hauptmann NAACL 2021, 2021 | 58 | 2021 |
Demystifying clip data H Xu, S Xie, XE Tan, PY Huang, R Howes, V Sharma, SW Li, G Ghosh, ... ICLR 2024, 2023 | 54 | 2023 |
SeamlessM4T-Massively Multilingual & Multimodal Machine Translation L Barrault, YA Chung, MC Meglioli, D Dale, N Dong, PA Duquenne, ... arXiv preprint arXiv:2308.11596, 2023 | 48 | 2023 |
Unsupervised Multimodal Neural Machine Translation with Pseudo Visual Pivoting PY Huang, J Hu, X Chang, A Hauptmann ACL 2020, 2020 | 46 | 2020 |
Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles C Ryali, YT Hu, D Bolya, C Wei, H Fan, PY Huang, V Aggarwal, ... ICML 2023, 2023 | 44 | 2023 |