VoxCeleb: a large-scale speaker identification dataset A Nagrani, JS Chung, A Zisserman Interspeech, 2017 | 2399 | 2017 |
VoxCeleb2: Deep Speaker Recognition JS Chung, A Nagrani, A Zisserman Interspeech, 2018 | 2253 | 2018 |
Lip reading sentences in the wild JS Chung, A Senior, O Vinyals, A Zisserman 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 3444 …, 2017 | 892* | 2017 |
Lip reading in the wild JS Chung, A Zisserman Computer Vision–ACCV 2016: 13th Asian Conference on Computer Vision, Taipei …, 2017 | 773 | 2017 |
Deep audio-visual speech recognition T Afouras, JS Chung, A Senior, O Vinyals, A Zisserman IEEE Transactions on Pattern Analysis and Machine Intelligence, 2018 | 769 | 2018 |
VoxCeleb: Large-scale Speaker Verification in the Wild A Nagrani, JS Chung, W Xie, A Zisserman Computer Speech & Language, 101027, 2019 | 631 | 2019 |
Out of time: automated lip sync in the wild JS Chung, A Zisserman Computer Vision–ACCV 2016 Workshops: ACCV 2016 International Workshops …, 2017 | 620 | 2017 |
In defence of metric learning for speaker recognition JS Chung, J Huh, S Mun, M Lee, HS Heo, S Choe, C Ham, S Jung, ... Interspeech, 2020 | 452 | 2020 |
The Conversation: Deep Audio-Visual Speech Enhancement T Afouras, JS Chung, A Zisserman Interspeech, 2018 | 392 | 2018 |
Utterance-level Aggregation For Speaker Recognition In The Wild W Xie, A Nagrani, JS Chung, A Zisserman ICASSP, 2019 | 391 | 2019 |
LRS3-TED: a large-scale dataset for visual speech recognition T Afouras, JS Chung, A Zisserman arXiv preprint arXiv:1809.00496, 2018 | 369 | 2018 |
You said that? JS Chung, A Jamaludin, A Zisserman BMVC, 2017 | 245 | 2017 |
Self-Supervised Learning of Audio-Visual Objects from Video T Afouras, A Owens, JS Chung, A Zisserman European Conference on Computer Vision, 2020 | 231 | 2020 |
AASIST: Audio anti-spoofing using integrated spectro-temporal graph attention networks J Jung, HS Heo, H Tak, H Shim, JS Chung, BJ Lee, HJ Yu, N Evans ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 174 | 2022 |
BSL-1K: Scaling up co-articulated sign language recognition using mouthing cues S Albanie, G Varol, L Momeni, T Afouras, JS Chung, N Fox, A Zisserman European Conference on Computer Vision, 2020 | 164 | 2020 |
You said that?: Synthesising talking faces from audio A Jamaludin, JS Chung, A Zisserman International Journal of Computer Vision 127, 1767-1779, 2019 | 163 | 2019 |
Lip Reading in Profile JS Chung, A Zisserman BMVC, 2017 | 140 | 2017 |
Spot the conversation: speaker diarisation in the wild JS Chung, J Huh, A Nagrani, T Afouras, A Zisserman Interspeech, 2020 | 135 | 2020 |
ASR is all you need: cross-modal distillation for lip reading T Afouras, JS Chung, A Zisserman ICASSP, 2020 | 131 | 2020 |
Deep Lip Reading: a comparison of models and an online application T Afouras, JS Chung, A Zisserman Interspeech, 2018 | 125 | 2018 |