Learning similarity conditions without explicit supervision R Tan, MI Vasileva, K Saenko, BA Plummer Proceedings of the IEEE/CVF international conference on computer vision …, 2019 | 107 | 2019 |
Logan: Latent graph co-attention network for weakly-supervised video moment retrieval R Tan, H Xu, K Saenko, BA Plummer Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2021 | 71 | 2021 |
Detecting cross-modal inconsistency to defend against neural fake news R Tan, BA Plummer, K Saenko Empirical Methods in Natural Language Processing (EMNLP) 2020, 2081–2106, 2020 | 68 | 2020 |
Language features matter: Effective language representations for vision-language tasks A Burns, R Tan, K Saenko, S Sclaroff, BA Plummer Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2019 | 36 | 2019 |
Look at what i’m doing: Self-supervised spatial grounding of narrations in instructional videos R Tan, B Plummer, K Saenko, H Jin, B Russell Advances in Neural Information Processing Systems 34, 14476-14487, 2021 | 21 | 2021 |
wman: Weakly-supervised moment alignment network for text-based video segment retrieval R Tan, H Xu, K Saenko, BA Plummer | 17 | 2019 |
Language-Guided Audio-Visual Source Separation via Trimodal Consistency R Tan, A Ray, A Burns, BA Plummer, J Salamon, O Nieto, B Russell, ... The IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2023, 2023 | 8 | 2023 |
NewsStories: Illustrating articles with visual summaries R Tan, BA Plummer, K Saenko, JP Lewis, A Sud, T Leung European Conference on Computer Vision 2022 (Springer, Cham), 644-661, 2022 | 7 | 2022 |
Multiscale video pretraining for long-term activity forecasting R Tan, M De Lange, M Iuzzolino, BA Plummer, K Saenko, K Ridgeway, ... arXiv preprint arXiv:2307.12854, 2023 | 3 | 2023 |
Socratis: Are large multimodal models emotionally aware? K Deng, A Ray, R Tan, S Gabriel, BA Plummer, K Saenko ICCV 2023 Wecia, 2023 | 2 | 2023 |
EgoAdapt: A multi-stream evaluation study of adaptation to real-world egocentric user video M De Lange, H Eghbalzadeh, R Tan, M Iuzzolino, F Meier, K Ridgeway arXiv preprint arXiv:2307.05784, 2023 | 1 | 2023 |
Koala: Key frame-conditioned long video-LLM R Tan, X Sun, P Hu, J Wang, H Deilamsalehy, BA Plummer, B Russell, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | | 2024 |
Localization Of Narrations In Image Data H Jin, B Russell, RXH Tan US Patent App. 17/499,193, 2023 | | 2023 |
Analytic system for measuring bat speed R Tan | | 2018 |
Language-Guided Audio-Visual Source Separation via Trimodal Consistency Supplemental Material R Tan, A Ray, A Burns, BA Plummer, J Salamon, O Nieto, B Russell, ... | | |
Language Features Matter: Effective Language Representations for Vision-Language Tasks Supplementary A Burns, R Tan, K Saenko, S Sclaroff, BA Plummer | | |