Frozen in time: A joint video and image encoder for end-to-end retrieval M Bain, A Nagrani, G Varol, A Zisserman Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021 | 728 | 2021 |
Condensed Movies: Story Based Retrieval with Contextual Embeddings M Bain, A Nagrani, A Brown, A Zisserman Asian Conference on Computer Vision (ACCV), 2020, 2020 | 79 | 2020 |
WhisperX: Time-accurate speech transcription of long-form audio M Bain, J Huh, T Han, A Zisserman Interspeech 2023, 2023 | 77 | 2023 |
A Prompt Array Keeps the Bias Away: Debiasing Vision-Language Models with Adversarial Learning H Berg, SM Hall, Y Bhalgat, W Yang, HR Kirk, A Shtedritski, M Bain Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the …, 2022 | 63 | 2022 |
The CLIP-Hitchhiker's Guide to Long Video Retrieval M Bain, A Nagrani, G Varol, A Zisserman arXiv preprint arXiv:2205.08508, 2022 | 48 | 2022 |
Automated audiovisual behavior recognition in wild primates M Bain, A Nagrani, D Schofield, S Berdugo, J Bessa, J Owen, ... Science Advances 7 (46), eabi4883, 2021 | 41 | 2021 |
AutoAD: Movie Description in Context T Han*, M Bain*, A Nagrani, G Varol, W Xie, A Zisserman IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023, 2023 | 19 | 2023 |
AutoAD II: The sequel-who, when, and what in movie audio description T Han, M Bain, A Nagrani, G Varol, W Xie, A Zisserman Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 13 | 2023 |
Count, crop and recognise: Fine-grained recognition in the wild M Bain, A Nagrani, D Schofield, A Zisserman Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2019 | 10 | 2019 |
Balancing the picture: Debiasing vision-language datasets with synthetic contrast sets B Smith, M Farinha, SM Hall, HR Kirk, A Shtedritski, M Bain arXiv preprint arXiv:2305.15407, 2023 | 8 | 2023 |
Understanding video through the lens of language M Bain University of Oxford, 2023 | 1 | 2023 |
AutoAD III: The Prequel--Back to the Pixels T Han, M Bain, A Nagrani, G Varol, W Xie, A Zisserman arXiv preprint arXiv:2404.14412, 2024 | | 2024 |
Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models A Ormazabal, C Zheng, CM d'Autume, D Yogatama, D Fu, D Ong, E Chen, ... arXiv preprint arXiv:2404.12387, 2024 | | 2024 |
Culture in communication: inter-community variation in buttress drumming by wild chimpanzees J Bessa, M Bain, A Nagrani, A Zisserman, J Di, KH Giovanni, D Biro Chimpanzee Culture in Cantanhez National Park, Guinea-Bissau, 98, 0 | | |