A Lip Sync Expert Is All You Need for Speech to Lip Generation In The Wild KR Prajwal, R Mukhopadhyay, VP Namboodiri, CV Jawahar Proceedings of the 28th ACM International Conference on Multimedia, 484-492, 2020 | 587 | 2020 |
DRUNET: a dilated-residual U-Net deep learning network to segment optic nerve head tissues in optical coherence tomography images SK Devalla, PK Renukanand, BK Sreedhar, G Subramanian, L Zhang, ... Biomedical optics express 9 (7), 3244-3265, 2018 | 197 | 2018 |
Towards Automatic Face-to-Face Translation P KR, R Mukhopadhyay, J Philip, A Jha, V Namboodiri, CV Jawahar Proceedings of the 27th ACM International Conference on Multimedia, 1428-1436, 2019 | 168 | 2019 |
Learning Individual Speaking Styles for Accurate Lip to Speech Synthesis KR Prajwal, R Mukhopadhyay, VP Namboodiri, CV Jawahar Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020 | 102 | 2020 |
Sub-word Level Lip Reading With Visual Attention KR Prajwal, T Afouras, A Zisserman arXiv preprint arXiv:2110.07603, 2021 | 86 | 2021 |
IndicSpeech: Text-to-Speech Corpus for Indian Languages N Srivastava, R Mukhopadhyay, KR Prajwal, CV Jawahar Proceedings of The 12th Language Resources and Evaluation Conference, 6417-6422, 2020 | 29 | 2020 |
Automatic dense annotation of large-vocabulary sign language videos L Momeni, H Bull, KR Prajwal, S Albanie, G Varol, A Zisserman European Conference on Computer Vision, 671-690, 2022 | 19 | 2022 |
Visual Speech Enhancement Without A Real Visual Stream SB Hegde, KR Prajwal, R Mukhopadhyay, VP Namboodiri, CV Jawahar Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2021 | 19 | 2021 |
Data-efficient training strategies for neural TTS systems KR Prajwal, CV Jawahar Proceedings of the 3rd ACM India Joint International Conference on Data …, 2021 | 12 | 2021 |
Visual Keyword Spotting with Attention KR Prajwal, L Momeni, T Afouras, A Zisserman arXiv preprint arXiv:2110.15957, 2021 | 11 | 2021 |
Towards Increased Accessibility of Meme Images with the Help of Rich Face Emotion Captions KR Prajwal, CV Jawahar, P Kumaraguru Proceedings of the 27th ACM International Conference on Multimedia, 202-210, 2019 | 11 | 2019 |
Weakly-supervised Fingerspelling Recognition in British Sign Language KR Prajwal, H Bull, L Momeni, S Albanie, G Varol, A Zisserman British Machine Vision Conference (BMVC) 2022, 2022 | 7* | 2022 |
Lip-to-Speech Synthesis for Arbitrary Speakers in the Wild SB Hegde, KR Prajwal, R Mukhopadhyay, VP Namboodiri, CV Jawahar Proceedings of the 30th ACM International Conference on Multimedia, 6250-6258, 2022 | 6 | 2022 |
System and method for lip-syncing a face to target speech using a machine learning model CV Jawahar, R Mukhopadhyay, KR Prajwal, V Namboodiri US Patent App. 17/567,120, 2022 | 1 | 2022 |
A Tale of Two Languages: Large-Vocabulary Continuous Sign Language Recognition from Spoken Language Supervision C Raude, KR Prajwal, L Momeni, H Bull, S Albanie, A Zisserman, G Varol arXiv preprint arXiv:2405.10266, 2024 | | 2024 |
MusicFlow: Cascaded Flow Matching for Text Guided Music Generation KR Prajwal, B Shi, M Le, A Vyas, A Tjandra, M Luthra, B Guo, H Wang, ... Forty-first International Conference on Machine Learning, 0 | | |
The Interplay of Speech and Lip Movements P KR, R Mukhopadhyay, S Hegde, V Namboodiri, CV Jawahar | | |