Vatt: Transformers for multimodal self-supervised learning from raw video, audio and text H Akbari, L Yuan, R Qian, WH Chuang, SF Chang, Y Cui, B Gong Advances in Neural Information Processing Systems 34, 24206-24221, 2021 | 556 | 2021 |
Unsupervised event-based learning of optical flow, depth, and egomotion AZ Zhu, L Yuan, K Chaney, K Daniilidis Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019 | 446 | 2019 |
EV-FlowNet: Self-supervised optical flow estimation for event-based cameras AZ Zhu, L Yuan, K Chaney, K Daniilidis arXiv preprint arXiv:1802.06898, 2018 | 423 | 2018 |
Movinets: Mobile video networks for efficient video recognition D Kondratyuk, L Yuan, Y Li, L Zhang, M Tan, M Brown, B Gong Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021 | 237 | 2021 |
Surrogate gap minimization improves sharpness-aware training J Zhuang, B Gong, L Yuan, Y Cui, H Adam, N Dvornek, S Tatikonda, ... arXiv preprint arXiv:2203.08065, 2022 | 122 | 2022 |
Unsupervised event-based optical flow using motion compensation A Zihao Zhu, L Yuan, K Chaney, K Daniilidis Proceedings of the European Conference on Computer Vision (ECCV) Workshops, 0-0, 2018 | 79 | 2018 |
Deeplab2: A tensorflow library for deep labeling M Weber, H Wang, S Qiao, J Xie, MD Collins, Y Zhu, L Yuan, D Kim, Q Yu, ... arXiv preprint arXiv:2106.09748, 2021 | 47 | 2021 |
Human gaze-driven spatial tasking of an autonomous MAV L Yuan, C Reardon, G Warnell, G Loianno IEEE Robotics and Automation Letters 4 (2), 1343-1350, 2019 | 45 | 2019 |
Learning view-disentangled human pose representation by contrastive cross-view mutual information maximization L Zhao, Y Wang, J Zhao, L Yuan, JJ Sun, F Schroff, H Adam, X Peng, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021 | 31 | 2021 |
Zoom-in-to-check: Boosting video interpolation via instance-level discrimination L Yuan, Y Chen, H Liu, T Kong, J Shi Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019 | 31 | 2019 |
View-invariant, occlusion-robust probabilistic embedding for human pose T Liu, JJ Sun, L Zhao, J Zhao, L Yuan, Y Wang, LC Chen, F Schroff, ... International Journal of Computer Vision 130 (1), 111-135, 2022 | 17 | 2022 |
Contextualized spatio-temporal contrastive learning with self-supervision L Yuan, R Qian, Y Cui, B Gong, F Schroff, MH Yang, H Adam, T Liu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 15 | 2022 |
Videoglue: Video general understanding evaluation of foundation models L Yuan, NB Gundavarapu, L Zhao, H Zhou, Y Cui, L Jiang, X Yang, M Jia, ... arXiv preprint arXiv:2307.03166, 2023 | 7 | 2023 |
Unified visual relationship detection with vision and language models L Zhao, L Yuan, B Gong, Y Cui, F Schroff, MH Yang, H Adam, T Liu Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 7 | 2023 |
Unsupervised Event-based Learning of Optical Flow AZ Zhu, L Yuan, K Chaney, K Daniilidis Depth, and Egomotion. arXiv e-prints, page, 2019 | 7 | 2019 |
PolyMaX: General Dense Prediction with Mask Transformer X Yang, L Yuan, K Wilber, A Sharma, X Gu, S Qiao, S Debats, H Wang, ... Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2024 | 6 | 2024 |
Exploring temporal granularity in self-supervised video representation learning R Qian, Y Li, L Yuan, B Gong, T Liu, M Brown, S Belongie, MH Yang, ... arXiv preprint arXiv:2112.04480, 2021 | 6 | 2021 |
Distilling vision-language models on millions of videos Y Zhao, L Zhao, X Zhou, J Wu, CT Chu, H Miao, F Schroff, H Adam, T Liu, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 5 | 2024 |
Learning from semantic alignment between unpaired multiviews for egocentric video recognition Q Wang, L Zhao, L Yuan, T Liu, X Peng Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 5 | 2023 |
VideoPrism: A Foundational Visual Encoder for Video Understanding L Zhao, NB Gundavarapu, L Yuan, H Zhou, S Yan, JJ Sun, L Friedman, ... arXiv preprint arXiv:2402.13217, 2024 | 3 | 2024 |