Nerv: Neural representations for videos H Chen, B He, H Wang, Y Ren, SN Lim, A Shrivastava NeurIPS 2021, 2021 | 162 | 2021 |
ASM-Loc: Action-aware Segment Modeling for Weakly-Supervised Temporal Action Localization B He, X Yang, L Kang, Z Cheng, X Zhou, A Shrivastava CVPR 2022, 2022 | 82 | 2022 |
Align and Attend: Multimodal Summarization with Dual Contrastive Losses B He, J Wang, J Qiu, T Bui, A Shrivastava, Z Wang CVPR 2023, 2023 | 34 | 2023 |
To see is to believe: Prompting gpt-4v for better visual instruction tuning J Wang, L Meng, Z Weng, B He, Z Wu, YG Jiang arXiv preprint arXiv:2311.07574, 2023 | 30 | 2023 |
Feature combination meets attention: Baidu soccer embeddings and transformer based temporal detection X Zhou, L Kang, Z Cheng, B He, J Xin arXiv preprint arXiv:2106.14447, 2021 | 30 | 2021 |
GTA: Global Temporal Attention for Video Action Understanding B He, X Yang, Z Wu, H Chen, SN Lim, A Shrivastava BMVC 2021, 2021 | 28 | 2021 |
Towards Scalable Neural Representation for Diverse Videos B He, X Yang, H Wang, Z Wu, H Chen, S Huang, Y Ren, SN Lim, ... CVPR 2023, 2023 | 20 | 2023 |
Learning Semantic Correspondence with Sparse Annotations S Huang, L Yang, B He, S Zhang, X He, A Shrivastava ECCV 2022, 2022 | 18 | 2022 |
CNeRV: Content-adaptive Neural Representation for Visual Data H Chen, M Gwilliam, B He, SN Lim, A Shrivastava BMVC 2022, 2022 | 15 | 2022 |
Recognizing actions using object states N Saini, B He, G Shrivastava, SS Rambhatla, A Shrivastava ICLR2022 Workshop on the Elements of Reasoning: Objects, Structure and Causality, 2022 | 12 | 2022 |
Chop & learn: Recognizing and generating object-state compositions N Saini, H Wang, A Swaminathan, V Jayasundara, B He, K Gupta, ... Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 10 | 2023 |
MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding B He, H Li, YK Jang, M Jia, X Cao, A Shah, A Shrivastava, SN Lim CVPR 2024, 2024 | 7 | 2024 |
Omnivid: A generative framework for universal video understanding J Wang, D Chen, C Luo, B He, L Yuan, Z Wu, YG Jiang CVPR 2024, 2024 | 4 | 2024 |
Transformer-based temporal detection in video Z Cheng, L Kang, X Zhou, H Tian, X Li, B He, XIN Jingyu US Patent App. 17/572,624, 2023 | 1 | 2023 |
Content-Aware Image Color Editing With Auxiliary Color Restoration Tasks Y Ren, J Shi, Z Zhang, Y Fan, Z Lin, B He, A Shrivastava WACV 2024, 5192-5201, 2024 | | 2024 |
MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding Supplementary Material B He, H Li, YK Jang, M Jia, X Cao, A Shah, A Shrivastava, SN Lim | | |
Appendix for Content-Aware Image Color Editing with Auxiliary Color Restoration Tasks Y Ren, J Shi, Z Zhang, Y Fan, Z Lin, B He, A Shrivastava | | |
Align and Attend: Multimodal Summarization with Dual Contrastive Losses Supplementary Material B He, J Wang, J Qiu, T Bui, A Shrivastava, Z Wang | | |
ASM-Loc: Action-aware Segment Modeling for Weakly-Supervised Temporal Action Localization Supplementary Material B He, X Yang, ZC Le Kang, X Zhou, A Shrivastava | | |