Improved Vocal Tract Length Perturbation for a State-of-the-Art End-to-End Speech Recognition System. C Kim, M Shin, A Garg, D Gowda Interspeech, 739-743, 2019 | 47 | 2019 |
A review of on-device fully neural end-to-end automatic speech recognition algorithms C Kim, D Gowda, D Lee, J Kim, A Kumar, S Kim, A Garg, C Han ACSSC 2020: Asilomar Conference on Signals, Systems, and Computers, 2020 | 37 | 2020 |
end-to-end training of a large vocabulary end-to-end speech recognition system C Kim, S Kim, K Kim, M Kumar, J Kim, K Lee, C Han, A Garg, E Kim, ... ASRU 2019 : IEEE Workshop on Automatic Speech Recognition & Understanding, 2019 | 30 | 2019 |
Utterance Confidence Measure for End-to-End Speech Recognition with Applications to Distributed Speech Recognition Scenarios. A Kumar, S Singh, D Gowda, A Garg, S Singh, C Kim Interspeech 2020, 4357-4361, 2020 | 23 | 2020 |
Improved multi-stage training of online attention-based encoder-decoder models A Garg, D Gowda, A Kumar, K Kim, M Kumar, C Kim 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 70-77, 2019 | 23 | 2019 |
Multi-Task Multi-Resolution Char-to-BPE Cross-Attention Decoder for End-to-End Speech Recognition. D Gowda, A Garg, K Kim, M Kumar, C Kim Interspeech, 2783-2787, 2019 | 22 | 2019 |
Hierarchical Multi-Stage Word-to-Grapheme Named Entity Corrector for Automatic Speech Recognition. A Garg, A Gupta, D Gowda, S Singh, C Kim INTERSPEECH, 1793-1797, 2020 | 20 | 2020 |
Streaming On-Device End-to-End ASR System for Privacy-Sensitive Voice-Typing. A Garg, GP Vadisetti, D Gowda, S Jin, A Jayasimha, Y Han, J Kim, J Park, ... INTERSPEECH, 3371-3375, 2020 | 17 | 2020 |
Streaming end-to-end speech recognition with jointly trained neural feature enhancement C Kim, A Garg, D Gowda, S Mun, C Han ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 11 | 2021 |
Utterance Invariant Training for Hybrid Two-Pass End-to-End Speech Recognition. D Gowda, A Kumar, K Kim, H Yang, A Garg, S Singh, J Kim, M Kumar, ... Interspeech, 2827-2831, 2020 | 7 | 2020 |
Voice recognition device and method C Kim, DN Gowda, S Kim, M Shin, LP Heck, A Garg, KIM Kwangyoun, ... US Patent 11,961,522, 2024 | 5 | 2024 |
A comparison of streaming models and data augmentation methods for robust speech recognition J Kim, M Kumar, D Gowda, A Garg, C Kim 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021 | 5 | 2021 |
Method and device for speech recognition DN Gowda, KIM Kwangyoun, A Garg, C Kim US Patent 11,302,331, 2022 | 3 | 2022 |
Self-supervised accent learning for under-resourced accents using native language data M Kumar, J Kim, D Gowda, A Garg, C Kim ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 2 | 2023 |
Semi-supervised transfer learning for language expansion of end-to-end speech recognition models to low-resource languages J Kim, M Kumar, D Gowda, A Garg, C Kim 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021 | 2 | 2021 |
Data-driven grapheme-to-phoneme representations for a lexicon-free text-to-speech A Garg, J Kim, S Khyalia, C Kim, D Gowda arXiv preprint arXiv:2401.10465, 2024 | | 2024 |
System and method for modifying speech recognition result C Kim, DN Gowda, A Garg, K Lee US Patent 11,521,619, 2022 | | 2022 |
HiTNet: Byte-to-BPE Hierarchical Transcription Network for End-to-End Speech Recognition D Gowda, A Garg, J Kim, M Kumar, S Singh, A Gupta, A Kumar, ... 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021 | | 2021 |