Query-adaptive video summarization via quality-aware relevance estimation AB Vasudevan, M Gygli, A Volokitin, L Van Gool Proceedings of the 25th ACM international conference on Multimedia, 582-590, 2017 | 101 | 2017 |
Object referring in videos with language and human gaze AB Vasudevan, D Dai, L Van Gool Proceedings of the IEEE Conference on Computer Vision and Pattern ¡¦, 2018 | 62 | 2018 |
Talk2nav: Long-range vision-and-language navigation with dual attention and spatial memory AB Vasudevan, D Dai, L Van Gool International Journal of Computer Vision 129, 246-266, 2021 | 43 | 2021 |
Semantic object prediction and spatial sound super-resolution with binaural sounds AB Vasudevan, D Dai, L Van Gool European conference on computer vision, 638-655, 2020 | 42 | 2020 |
Object referring in visual scene with spoken language AB Vasudevan, D Dai, L Van Gool 2018 IEEE winter conference on applications of computer vision (WACV), 1861-1870, 2018 | 21 | 2018 |
Dynamic scene classification using spatial and temporal cues A Vasudevan, S Muralidharan, S Chintapalli, S Raman Proceedings of the IEEE International Conference on Computer Vision ¡¦, 2013 | 21 | 2013 |
Binaural soundnet: predicting semantics, depth and motion with binaural sounds D Dai, AB Vasudevan, J Matas, L Van Gool IEEE transactions on pattern analysis and machine intelligence 45 (1), 123-136, 2022 | 6 | 2022 |
ETH-CVL@ MediaEval 2016: Textual-Visual Embeddings and Video2GIF for Video Interestingness. AB Vasudevan, M Gygli, A Volokitin, L Van Gool MediaEval, 2016 | 5 | 2016 |
Sound and visual representation learning with multiple pretraining tasks AB Vasudevan, D Dai, L Van Gool Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern ¡¦, 2022 | 4 | 2022 |
A novel approach to the extraction of multiple salient objects in an image S Muralidharan, AB Vasudevan, CS Pratheek, S Raman 2015 IEEE International Conference on Signal Processing, Informatics ¡¦, 2015 | 1 | 2015 |
Motion characterization of a dynamic scene AB Vasudevan, S Muralidharan, SP Chintapalli, S Raman 2014 International Conference on Computer Vision Theory and Applications ¡¦, 2014 | 1 | 2014 |
Planning with an Ensemble of World Models AB Vasudevan, N Peri, D Ramanan | | 2023 |
LCA-on-the-Line: Benchmarking Out-of-Distribution Generalization with Class Taxonomies J Shi | | 2023 |
The Un-Kidnappable Robot: Acoustic Localization of Sneaking People M Yang, P Grady, S Brahmbhatt, AB Vasudevan, CC Kemp, J Hays arXiv preprint arXiv:2310.03743, 2023 | | 2023 |
A method for training a neural network to describe an environment on the basis of an audio signal, and the corresponding neural network W Abbeloos, AB VASUDEVAN, DAI Dengxin, L Van Gool US Patent App. 17/792,073, 2023 | | 2023 |
Sound and Visual Representation Learning with Multiple Pretraining Tasks A Balajee Vasudevan, D Dai, L Van Gool arXiv e-prints, arXiv: 2201.01046, 2022 | | 2022 |
Multimodal Semantic Understanding and Navigation in Outdoor Scenes AB Vasudevan ETH Zurich, 2021 | | 2021 |
Supplementary Material: Sound and Visual Representation Learning with Multiple Pretraining Tasks AB Vasudevan, D Dai, L Van Gool | | |
Deep Visual Semantic Embedding for Video Thumbnail Selection AB Vasudevan | | |