Yu Zhang

인용

	전체	2019년 이후
서지정보	22987	21231
h-index	57	55
i10-index	111	101

7000

3500

1750

5250

2015201620172018201920202021202220232024101 264 417 865 1431 2341 4031 5143 6435 1796

공개 액세스

모두 보기

자료 7개

자료 0개

공개

비공개

재정 지원 요구사항 기준

공동 저자

Yonghui WuGoogle Braingoogle.com의 이메일 확인됨
Chung-Cheng ChiuAppleapple.com의 이메일 확인됨
Wei Hanillinois.edu의 이메일 확인됨
Ye JiaMetagoogle.com의 이메일 확인됨
Ron J WeissGooglegoogle.com의 이메일 확인됨
William ChanIdeogramideogram.ai의 이메일 확인됨
Heiga ZenPrincipal Scientist (Director), Google DeepMindgoogle.com의 이메일 확인됨
Ruoming Pang (庞若鸣)Apple AI/MLapple.com의 이메일 확인됨
James GlassMIT Computer Science and Artificial Intelligence Laboratorymit.edu의 이메일 확인됨
Bo LiGooglegoogle.com의 이메일 확인됨
James QinGooglegoogle.com의 이메일 확인됨
Dong Yu (俞栋)Distinguished Scientist @ Tencent AI Lab, ACM/IEEE/ISCA Fellowglobal.tencent.com의 이메일 확인됨
Jonathan ShenGooglegoogle.com의 이메일 확인됨
Quoc V. LeResearch Scientist, Googlestanford.edu의 이메일 확인됨
Daniel S. ParkGoogle Braingoogle.com의 이메일 확인됨
Tara SainathPrincipal Research Scientist, Googlegoogle.com의 이메일 확인됨
Wei-Ning HsuFacebook AI Research (FAIR)csail.mit.edu의 이메일 확인됨
Zhifeng ChenGoogle Inc.google.com의 이메일 확인됨
Yuxuan WangByteDancecse.ohio-state.edu의 이메일 확인됨
Anmol GulatiResearcher, Google Deepmindgoogle.com의 이메일 확인됨

팔로우

Yu Zhang

OpenAI

csail.mit.edu의 이메일 확인됨 - 홈페이지

Speech Recognition Speech Synthesis


제목 서지정보순 정렬 연도순 정렬 제목순 정렬	인용 인용	연도
Specaugment: A simple data augmentation method for automatic speech recognition DS Park, W Chan, Y Zhang, CC Chiu, B Zoph, ED Cubuk, QV Le arXiv preprint arXiv:1904.08779, 2019	3715	2019
Natural tts synthesis by conditioning wavenet on mel spectrogram predictions J Shen, R Pang, RJ Weiss, M Schuster, N Jaitly, Z Yang, Z Chen, Y Zhang, ... 2018 IEEE international conference on acoustics, speech and signal …, 2018	2968	2018
Conformer: Convolution-augmented transformer for speech recognition A Gulati, J Qin, CC Chiu, N Parmar, Y Zhang, J Yu, W Han, S Wang, ... arXiv preprint arXiv:2005.08100, 2020	2563	2020
Transfer learning from speaker verification to multispeaker text-to-speech synthesis Y Jia, Y Zhang, R Weiss, Q Wang, J Shen, F Ren, P Nguyen, R Pang, ... Advances in neural information processing systems 31, 2018	883	2018
Style tokens: Unsupervised style modeling, control and transfer in end-to-end speech synthesis Y Wang, D Stanton, Y Zhang, RJS Ryan, E Battenberg, J Shor, Y Xiao, ... International conference on machine learning, 5180-5189, 2018	882	2018
Libritts: A corpus derived from librispeech for text-to-speech H Zen, V Dang, R Clark, Y Zhang, RJ Weiss, Y Jia, Z Chen, Y Wu arXiv preprint arXiv:1904.02882, 2019	701	2019
Wavegrad: Estimating gradients for waveform generation N Chen, Y Zhang, H Zen, RJ Weiss, M Norouzi, W Chan arXiv preprint arXiv:2009.00713, 2020	588	2020
Very deep convolutional networks for end-to-end speech recognition Y Zhang, W Chan, N Jaitly 2017 IEEE international conference on acoustics, speech and signal …, 2017	546	2017
An introduction to computational networks and the computational network toolkit MS Dong Yu, Adam Eversole, Mike Seltzer, Kaisheng Yao, Zhiheng Huang, Brian ... Tech. Rep. MSR, Microsoft Research, 2014, http://codebox/cntk, 2014	465*	2014
Unsupervised learning of disentangled and interpretable representations from sequential data WN Hsu, Y Zhang, J Glass Advances in neural information processing systems 30, 2017	397	2017
Spoken language understanding using long short-term memory neural networks K Yao, B Peng, Y Zhang, D Yu, G Zweig, Y Shi IEEE SLT, 2014	397	2014
Highway long short-term memory rnns for distant speech recognition Y Zhang, G Chen, D Yu, K Yao, S Khudanpur, J Glass 2016 IEEE international conference on acoustics, speech and signal …, 2016	354	2016
Advances in joint CTC-attention based end-to-end speech recognition with a deep CNN encoder and RNN-LM T Hori, S Watanabe, Y Zhang, W Chan arXiv preprint arXiv:1706.02737, 2017	344	2017
Pushing the limits of semi-supervised learning for automatic speech recognition Y Zhang, J Qin, DS Park, W Han, CC Chiu, R Pang, QV Le, Y Wu arXiv preprint arXiv:2010.10504, 2020	314	2020
Simple recurrent units for highly parallelizable recurrence T Lei, Y Zhang, SI Wang, H Dai, Y Artzi arXiv preprint arXiv:1709.02755, 2017	308	2017
W2v-bert: Combining contrastive learning and masked language modeling for self-supervised speech pre-training YA Chung, Y Zhang, W Han, CC Chiu, J Qin, R Pang, Y Wu 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021	297	2021
ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech X Wang, J Yamagishi, M Todisco, H Delgado, A Nautsch, N Evans, ... Computer Speech & Language 64, 101114, 2020	291	2020
Contextnet: Improving convolutional neural networks for automatic speech recognition with global context W Han, Z Zhang, Y Zhang, J Yu, CC Chiu, J Qin, A Gulati, R Pang, Y Wu arXiv preprint arXiv:2005.03191, 2020	270	2020
Hierarchical generative modeling for controllable speech synthesis WN Hsu, Y Zhang, RJ Weiss, H Zen, Y Wu, Y Wang, Y Cao, Y Jia, Z Chen, ... arXiv preprint arXiv:1810.07217, 2018	266	2018
Improved noisy student training for automatic speech recognition DS Park, Y Zhang, Y Jia, W Han, CC Chiu, B Li, Y Wu, QV Le arXiv preprint arXiv:2005.09629, 2020	235	2020

현재 시스템이 작동되지 않습니다. 나중에 다시 시도해 주세요.

학술자료 1–20

연간 인용횟수

중복된 서지정보

병합된 서지정보

공동 저자 추가공동 저자

팔로우

인용

공동 저자