Yashesh Gaur

Cited by

	All	Since 2019
Citations	1731	1631
h-index	21	21
i10-index	39	35

440

220

110

330

20162017201820192020202120222023202410 19 62 90 103 296 355 438 345

Public access

View all

1 article

0 articles

available

not available

Based on funding mandates

Co-authors

Jinyu LiPartner Applied Science Manager, MicrosoftVerified email at microsoft.com
Zhong MengGoogleVerified email at google.com
Naoyuki KandaMicrosoftVerified email at microsoft.com
Yifan GongPrincipal Science Manager, Microsoft Corp.Verified email at microsoft.com
Anuroop SriramMeta FAIRVerified email at alumni.cmu.edu
Sanjeev SatheeshStanford UniversityVerified email at stanford.edu
Eric BattenbergGoogle ResearchVerified email at google.com
Adam CoatesPreviously Apple, Khosla Ventures, Baidu SVAIL, Stanford PhDVerified email at cs.stanford.edu
Jeffrey P. BighamCarnegie Mellon University & AppleVerified email at cs.cmu.edu
Florian MetzeCarnegie Mellon University; Meta AIVerified email at andrew.cmu.edu
Yajie MiaoCarnegie Mellon UniversityVerified email at cs.cmu.edu

Yashesh Gaur

Meta AI

Verified email at cs.cmu.edu

Machine Learning Speech & Language


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Exploring neural transducers for end-to-end speech recognition E Battenberg, J Chen, R Child, A Coates, YGY Li, H Liu, S Satheesh, ... 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2017	281*	2017
On the comparison of popular end-to-end models for large scale speech recognition J Li, Y Wu, Y Gaur, C Wang, R Zhao, S Liu arXiv preprint arXiv:2005.14327, 2020	148	2020
Internal language model estimation for domain-adaptive end-to-end speech recognition Z Meng, S Parthasarathy, E Sun, Y Gaur, N Kanda, L Lu, X Chen, R Zhao, ... 2021 IEEE Spoken Language Technology Workshop (SLT), 243-250, 2021	102	2021
Serialized output training for end-to-end overlapped speech recognition N Kanda, Y Gaur, X Wang, Z Meng, T Yoshioka arXiv preprint arXiv:2003.12687, 2020	101	2020
Joint speaker counting, speech recognition, and speaker identification for overlapped speech of any number of speakers N Kanda, Y Gaur, X Wang, Z Meng, Z Chen, T Zhou, T Yoshioka arXiv preprint arXiv:2006.10930, 2020	74	2020
Robust speech recognition using generative adversarial networks A Sriram, H Jun, Y Gaur, S Satheesh 2018 IEEE international conference on acoustics, speech and signal …, 2018	69	2018
Viola: Unified codec language models for speech recognition, synthesis, and translation T Wang, L Zhou, Z Zhang, Y Wu, S Liu, Y Gaur, Z Chen, J Li, F Wei arXiv preprint arXiv:2305.16107, 2023	60	2023
The effects of automatic speech recognition quality on human transcription latency Y Gaur, WS Lasecki, F Metze, JP Bigham Proceedings of the 13th International Web for All Conference, 1-8, 2016	59	2016
Minimum latency training strategies for streaming sequence-to-sequence ASR H Inaguma, Y Gaur, L Lu, J Li, Y Gong ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	55	2020
Internal language model training for domain-adaptive end-to-end speech recognition Z Meng, N Kanda, Y Gaur, S Parthasarathy, E Sun, L Lu, X Chen, J Li, ... ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	51	2021
Domain adaptation via teacher-student learning for end-to-end speech recognition Z Meng, J Li, Y Gaur, Y Gong 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019	51	2019
On decoder-only architecture for speech-to-text and large language model integration J Wu, Y Gaur, Z Chen, L Zhou, Y Zhu, T Wang, J Li, S Liu, B Ren, L Liu, ... 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023	46	2023
Streaming multi-talker ASR with token-level serialized output training N Kanda, J Wu, Y Wu, X Xiao, Z Meng, X Wang, Y Gaur, Z Chen, J Li, ... arXiv preprint arXiv:2202.00842, 2022	44	2022
Speaker adaptation for attention-based end-to-end speech recognition Z Meng, Y Gaur, J Li, Y Gong arXiv preprint arXiv:1911.03762, 2019	44	2019
A Federated Approach in Training Acoustic Models. D Dimitriadis, RG Ken'ichi Kumatani, R Gmyr, Y Gaur, SE Eskimez Interspeech, 981-985, 2020	43	2020
Investigation of end-to-end speaker-attributed ASR for continuous multi-talker recordings N Kanda, X Chang, Y Gaur, X Wang, Z Meng, Z Chen, T Yoshioka 2021 IEEE Spoken Language Technology Workshop (SLT), 809-816, 2021	42	2021
End-to-end speaker-attributed ASR with Transformer N Kanda, G Ye, Y Gaur, X Wang, Z Meng, Z Chen, T Yoshioka arXiv preprint arXiv:2104.02128, 2021	39	2021
Large-scale pre-training of end-to-end multi-talker ASR for meeting transcription with single distant microphone N Kanda, G Ye, Y Wu, Y Gaur, X Wang, Z Meng, Z Chen, T Yoshioka arXiv preprint arXiv:2103.16776, 2021	35	2021
Transcribe-to-diarize: Neural speaker diarization for unlimited number of speakers using end-to-end speaker-attributed ASR N Kanda, X Xiao, Y Gaur, X Wang, Z Meng, Z Chen, T Yoshioka ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	30	2022
Internal language model adaptation with text-only data for end-to-end speech recognition Z Meng, Y Gaur, N Kanda, J Li, X Chen, Y Wu, Y Gong arXiv preprint arXiv:2110.05354, 2021	26	2021

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors