TCD-TIMIT: An audio-visual corpus of continuous speech N Harte, E Gillen Multimedia, IEEE Transactions on 17 (5), 603-615, 2015 | 266 | 2015 |
ViSQOL: an objective speech quality model A Hines, J Skoglund, AC Kokaram, N Harte EURASIP Journal on Audio, Speech, and Music Processing 2015, 1-18, 2015 | 160 | 2015 |
Speech intelligibility prediction using a neurogram similarity index measure A Hines, N Harte Speech Communication 54 (2), 306-320, 2012 | 105 | 2012 |
Phoneme-to-viseme mapping for visual speech recognition L Cappelletta, N Harte International Conference on Pattern Recognition Applications and Methods 2 ¡¦, 2012 | 95 | 2012 |
Attention-based audio-visual fusion for robust automatic speech recognition G Sterpu, C Saam, N Harte Proceedings of the 20th ACM International conference on Multimodal ¡¦, 2018 | 81 | 2018 |
ViSQOLAudio: An objective audio quality metric for low bitrate codecs A Hines, E Gillen, D Kelly, J Skoglund, A Kokaram, N Harte The Journal of the Acoustical Society of America 137 (6), EL449-EL455, 2015 | 62 | 2015 |
Objective assessment of perceptual audio quality using ViSQOLAudio C Sloan, N Harte, D Kelly, AC Kokaram, A Hines IEEE Transactions on Broadcasting 63 (4), 693-705, 2017 | 61 | 2017 |
Robustness of speech quality metrics to background noise and network degradations: Comparing ViSQOL, PESQ and POLQA A Hines, J Skoglund, A Kokaram, N Harte 2013 IEEE International Conference on Acoustics, Speech and Signal ¡¦, 2013 | 57 | 2013 |
ViSQOL: The virtual speech quality objective listener A Hines, J Skoglund, A Kokaram, N Harte IWAENC 2012; international workshop on acoustic signal enhancement, 1-4, 2012 | 57 | 2012 |
Viseme definitions comparison for visual-only speech recognition L Cappelletta, N Harte 2011 19th European Signal Processing Conference, 2109-2113, 2011 | 50 | 2011 |
Multimodal continuous turn-taking prediction using multiscale rnns M Roddy, G Skantze, N Harte Proceedings of the 20th ACM International Conference on Multimodal ¡¦, 2018 | 48 | 2018 |
Speaker verification in score-ageing-quality classification space F Kelly, A Drygajlo, N Harte Computer Speech & Language 27 (5), 1068-1084, 2013 | 47 | 2013 |
Speaker verification with long-term ageing data F Kelly, A Drygajlo, N Harte 2012 5th IAPR international conference on biometrics (ICB), 478-483, 2012 | 43 | 2012 |
TCD-VoIP, a research database of degraded speech for assessing quality in VoIP applications N Harte, E Gillen, A Hines 2015 Seventh International Workshop on Quality of Multimedia Experience ¡¦, 2015 | 42 | 2015 |
Speech intelligibility from image processing A Hines, N Harte Speech Communication 52 (9), 736-752, 2010 | 41 | 2010 |
Multi-resolution cepstral features for phoneme recognition across speech sub-bands P McCourt, S Vaseght, N Harte Proceedings of the 1998 IEEE International Conference on Acoustics, Speech ¡¦, 1998 | 39 | 1998 |
Investigating Speech Features for Continuous Turn-Taking Prediction Using LSTMs M Roddy, G Skantze, N Harte Proc. Interspeech 2018, 586-590, 2018 | 37 | 2018 |
The effect of multimodal emotional expression and agent appearance on trust in human-agent interaction I Torre, E Carrigan, R McDonnell, K Domijan, K McCabe, N Harte Proceedings of the 12th ACM SIGGRAPH Conference on Motion, Interaction and ¡¦, 2019 | 36 | 2019 |
Multi-resolution phonetic/segmental features and models for hmm-based speech recognition S Vaseghi, N Harte, B Milner 1997 IEEE International Conference on Acoustics, Speech, and Signal ¡¦, 1997 | 35 | 1997 |
How to teach DNNs to pay attention to the visual modality in speech recognition G Sterpu, C Saam, N Harte IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 1052-1064, 2020 | 33 | 2020 |