Learning Sound Localization Better From Semantically Similar Samples A Senocak, H Ryu, J Kim, IS Kweon ICASSP IEEE International Conference on Acoustics, Speech and Signal …, 2022 | 30 | 2022 |
Less can be more: Sound source localization with a classification model A Senocak, H Ryu, J Kim, IS Kweon Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2022 | 26 | 2022 |
Generative bias for robust visual question answering JW Cho, DJ Kim, H Ryu, IS Kweon Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 25 | 2023 |
Sound source localization is all about cross-modal alignment A Senocak, H Ryu, J Kim, TH Oh, H Pfister, JS Chung Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 12 | 2023 |
Hindi as a second language: Improving visually grounded speech with semantically similar samples H Ryu, A Senocak, IS Kweon, JS Chung ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 6 | 2023 |
Audio-visual fusion layers for event type aware video recognition A Senocak, J Kim, TH Oh, H Ryu, D Li, IS Kweon arXiv preprint arXiv:2202.05961, 2022 | 1 | 2022 |
Aligning Sight and Sound: Advanced Sound Source Localization Through Audio-Visual Alignment A Senocak, H Ryu, J Kim, TH Oh, H Pfister, JS Chung arXiv preprint arXiv:2407.13676, 2024 | | 2024 |
Speech Guided Masked Image Modeling for Visually Grounded Speech J Woo, H Ryu, A Senocak, JS Chung ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | | 2024 |