Wangyou Zhang

Cited by

	All	Since 2019
Citations	1626	1626
h-index	14	14
i10-index	16	16

520

260

130

390

20192020202120222023202412 160 389 398 520 144

Public access

View all

16 articles

1 article

available

not available

Based on funding mandates

Co-authors

Shinji WatanabeCarnegie Mellon UniversityVerified email at cmu.edu
Yanmin QianProfessor, Shanghai Jiao Tong UniversityVerified email at sjtu.edu.cn
Xuankai ChangCarnegie Mellon University, StudentVerified email at andrew.cmu.edu
Chenda LiShanghai Jiao Tong UniversityVerified email at sjtu.edu.cn
Jing ShiInstitute of Automation Chinese Academy of SciencesVerified email at ia.ac.cn
Christoph BoeddekerPaderborn UniversityVerified email at mail.upb.de
Aswin Shanmugam SubramanianMicrosoftVerified email at microsoft.com

Wangyou Zhang

Ph.D. candidate, Department of Computer Science and Engineering, Shanghai Jiao Tong University

Verified email at sjtu.edu.cn - Homepage

Signal Processing Speech Separation Speech Enhancement Robust Speech Recognition


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
A comparative study on Transformer vs RNN in speech applications S Karita, N Chen, T Hayashi, T Hori, H Inaguma, Z Jiang, M Someki, ... 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019	757	2019
Recent Developments on ESPnet Toolkit Boosted by Conformer P Guo, F Boyer, X Chang, T Hayashi, Y Higuchi, H Inaguma, N Kamo, C Li, ... ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	262	2021
MIMO-SPEECH: End-to-End Multi-Channel Multi-Speaker Speech Recognition X Chang, W Zhang, Y Qian, JL Roux, S Watanabe 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019	114	2019
End-To-End Multi-Speaker Speech Recognition With Transformer X Chang, W Zhang, Y Qian, J Le Roux, S Watanabe ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	99	2020
ESPnet-SE: End-to-End Speech Enhancement and Separation Toolkit Designed for ASR Integration C Li, J Shi, W Zhang, AS Subramanian, X Chang, N Kamo, M Hira, ... IEEE Spoken Language Technology Workshop (SLT), 785–792, 2021	68	2021
The 2020 ESPnet update: new features, broadened applications, performance improvements, and future plans S Watanabe, F Boyer, X Chang, P Guo, T Hayashi, Y Higuchi, T Hori, ... 2021 IEEE Data Science and Learning Workshop (DSLW), 1-6, 2021	51	2021
End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend W Zhang, C Boeddeker, S Watanabe, T Nakatani, M Delcroix, K Kinoshita, ... ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	34	2021
Improving End-to-End Single-Channel Multi-Talker Speech Recognition W Zhang, X Chang, Y Qian, S Watanabe IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 1385-1394, 2020	32	2020
End-to-End Far-Field Speech Recognition with Unified Dereverberation and Beamforming W Zhang, AS Subramanian, X Chang, S Watanabe, Y Qian Proc. Interspeech 2020, 324-328, 2020	30	2020
Robust DOA Estimation Based on Convolutional Neural Network and Time-Frequency Masking W Zhang, Y Zhou, Y Qian Proc. Interspeech 2019, 2703-2707, 2019	25	2019
Convolutive Transfer Function Invariant SDR Training Criteria for Multi-Channel Reverberant Speech Separation C Boeddeker, W Zhang, T Nakatani, K Kinoshita, T Ochiai, M Delcroix, ... ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	24	2021
Towards Low-Distortion Multi-Channel Speech Enhancement: The ESPnet-SE Submission to the L3DAS22 Challenge YJ Lu, S Cornell, X Chang, W Zhang, C Li, Z Ni, ZQ Wang, S Watanabe ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	21	2022
Closing the Gap Between Time-Domain Multi-Channel Speech Enhancement on Real and Simulation Conditions W Zhang, J Shi, C Li, S Watanabe, Y Qian 2021 IEEE Workshop on Applications of Signal Processing to Audio and …, 2021	20	2021
ESPnet-SE++: Speech Enhancement for Robust Speech Recognition, Translation, and Understanding YJ Lu, X Chang, C Li, W Zhang, S Cornell, Z Ni, Y Masuyama, B Yan, ... Proc. Interspeech 2022, 5458-5462, 2022	17	2022
End-to-End Dereverberation, Beamforming, and Speech Recognition in a Cocktail Party W Zhang, X Chang, C Boeddeker, T Nakatani, S Watanabe, Y Qian IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 3173-3188, 2022	13	2022
Reproducing Whisper-Style Training Using An Open-Source Toolkit And Publicly Available Data Y Peng, J Tian, B Yan, D Berrebbi, X Chang, X Li, J Shi, S Arora, W Chen, ... 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023	11	2023
The SJTU System For Multimodal Information Based Speech Processing Challenge 2021 W Wang, X Gong, Y Wu, Z Zhou, C Li, W Zhang, B Han, Y Qian ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	7	2022
Separating Long-Form Speech with Group-Wise Permutation Invariant Training W Zhang, Z Chen, N Kanda, S Liu, J Li, SE Eskimez, T Yoshioka, X Xiao, ... Proc. Interspeech 2022, 5383–5387, 2022	6	2022
Knowledge Distillation for End-to-End Monaural Multi-Talker ASR System W Zhang, X Chang, Y Qian Proc. Interspeech 2019, 2633-2637, 2019	6	2019
Text Adaptive Detection for Customizable Keyword Spotting Y Xi, T Tan, W Zhang, B Yang, K Yu ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	5	2022

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors