Learning Sound Localization Better From Semantically Similar Samples A Senocak, H Ryu, J Kim, IS Kweon ICASSP IEEE International Conference on Acoustics, Speech and Signal ¡¦, 2022 | 25 | 2022 |
Less can be more: Sound source localization with a classification model A Senocak, H Ryu, J Kim, IS Kweon Proceedings of the IEEE/CVF Winter Conference on Applications of Computer ¡¦, 2022 | 22 | 2022 |
Generative bias for robust visual question answering JW Cho, DJ Kim, H Ryu, IS Kweon Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern ¡¦, 2023 | 14 | 2023 |
Hindi as a second language: Improving visually grounded speech with semantically similar samples H Ryu, A Senocak, IS Kweon, JS Chung ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and ¡¦, 2023 | 4 | 2023 |
Sound source localization is all about cross-modal alignment A Senocak, H Ryu, J Kim, TH Oh, H Pfister, JS Chung Proceedings of the IEEE/CVF International Conference on Computer Vision ¡¦, 2023 | 4 | 2023 |
Audio-visual fusion layers for event type aware video recognition A Senocak, J Kim, TH Oh, H Ryu, D Li, IS Kweon arXiv preprint arXiv:2202.05961, 2022 | 1 | 2022 |
SPEECH GUIDED MASKED IMAGE MODELING FOR VISUALLY GROUNDED SPEECH J Woo, H Ryu, A Senocak, JS Chung | | |