TVSum: Summarizing web videos using titles Y Song, J Vallmitjana, A Stent, A Jaimes Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2015 | 733 | 2015 |
Learning from noisy labels with distillation Y Li, J Yang, Y Song, L Cao, J Luo, LJ Li Proceedings of the IEEE International Conference on Computer Vision, 1910-1918, 2017 | 654 | 2017 |
TGIF-QA: Toward spatio-temporal reasoning in visual question answering Y Jang, Y Song, Y Yu, Y Kim, G Kim Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2017 | 536 | 2017 |
Polysemous visual-semantic embedding for cross-modal retrieval Y Song, M Soleymani Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2019 | 286 | 2019 |
TGIF: A new dataset and benchmark on animated gif description Y Li, Y Song, L Cao, J Tetreault, L Goldberg, A Jaimes, J Luo Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2016 | 278 | 2016 |
Video co-summarization: Video summarization by visual co-occurrence WS Chu, Y Song, A Jaimes Proceedings of the IEEE conference on computer vision and pattern …, 2015 | 270 | 2015 |
Improving pairwise ranking for multi-label image classification Y Li, Y Song, J Luo Proceedings of the IEEE conference on computer vision and pattern …, 2017 | 262 | 2017 |
# FluxFlow: Visual analysis of anomalous information spreading on social media J Zhao, N Cao, Z Wen, Y Song, YR Lin, C Collins IEEE transactions on visualization and computer graphics 20 (12), 1773-1782, 2014 | 229 | 2014 |
Continuous body and hand gesture recognition for natural human-computer interaction Y Song, D Demirdjian, R Davis ACM Transactions on Interactive Intelligent Systems (TiiS) 2 (1), 5, 2012 | 210 | 2012 |
Video2GIF: Automatic generation of animated gifs from video M Gygli, Y Song, L Cao Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2016 | 180* | 2016 |
Tracking body and hands for gesture recognition: Natops aircraft handling signals database Y Song, D Demirdjian, R Davis 2011 IEEE International Conference on Automatic Face & Gesture Recognition …, 2011 | 155 | 2011 |
Action recognition by hierarchical sequence summarization Y Song, LP Morency, R Davis Proceedings of the IEEE conference on computer vision and pattern …, 2013 | 139 | 2013 |
Fast, cheap, and good: Why animated GIFs engage us S Bakhshi, DA Shamma, L Kennedy, Y Song, P de Juan, JJ Kaye Proceedings of the 2016 chi conference on human factors in computing systems …, 2016 | 128 | 2016 |
To click or not to click: Automatic selection of beautiful thumbnails from videos Y Song, M Redi, J Vallmitjana, A Jaimes Proceedings of the 25th ACM International on Conference on Information and …, 2016 | 115 | 2016 |
Active Contrastive Learning of Audio-Visual Video Representations S Ma, Z Zeng, D McDuff, Y Song International Conference on Learning Representations, 2021 | 112 | 2021 |
Multi-view latent variable discriminative models for action recognition Y Song, LP Morency, R Davis 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2120-2127, 2012 | 103 | 2012 |
Parameter Efficient Multimodal Transformers for Video Representation Learning S Lee, Y Yu, G Kim, T Breuel, J Kautz, Y Song International Conference on Learning Representations, 2021 | 88 | 2021 |
Multimodal Human Behavior Analysis: Learning Correlation and Interaction Across Modalities Y Song, LP Morency, R Davis Proceedings of the 14th ACM international conference on Multimodal …, 2012 | 88 | 2012 |
Computerized system and method for automatically detecting and rendering highlights from streaming videos Y Song, J Vallmitjana US Patent 10,390,082, 2019 | 83 | 2019 |
One-class conditional random fields for sequential anomaly detection Y Song, Z Wen, CY Lin, R Davis Proceedings of the Twenty-Third international joint conference on Artificial …, 2013 | 75* | 2013 |