Follow
Shi-Xiong (Austin) ZHANG
Shi-Xiong (Austin) ZHANG
Microsoft --> Principal Researcher, Tencent AI Lab
Verified email at tencent.com
Title
Cited by
Cited by
Year
End-to-end attention based text-dependent speaker verification
SX Zhang, Z Chen, Y Zhao, J Li, Y Gong
2016 IEEE Spoken Language Technology Workshop (SLT), 171-178, 2016
1852016
Investigation of Multilingual Deep Neural Networks for Spoken Term Detection
K Knill, MJF Gales, S Rath, P Woodland, SX Zhang
ASRU, 2013
982013
An overview of deep-learning-based audio-visual speech enhancement and separation
D Michelsanti, ZH Tan, SX Zhang, Y Xu, M Yu, D Yu, J Jensen
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 1368-1396, 2021
932021
SIMPLIFYING LONG SHORT-TERM MEMORY ACOUSTIC MODELS FOR FAST TRAINING AND DECODING
Y Miao, J Li, Y Wang, S Zhang, Y Gong
ICASSP, 2016
872016
A comprehensive study of speech separation: spectrogram vs waveform separation
F Bahmaninezhad, J Wu, R Gu, SX Zhang, Y Xu, M Yu, D Yu
arXiv preprint arXiv:1905.07497, 2019
672019
Time Domain Audio Visual Speech Separation
J Wu, Y Xu, SX Zhang, LW Chen, M Yu, L Xie, D Yu
Automatic Speech Recognition and Understanding Workshop, ASRU 2019,, 2019
632019
End-to-End Multi-Channel Speech Separation
R Gu, J Wu, SX Zhang, L Chen, Y Xu, M Yu, D Su, Y Zou, D Yu
https://arxiv.org/abs/1905.06286, 2019
602019
Neural Spatial Filter: Target Speaker Speech Separation Assisted with Directional Information
R Gu, L Chen, SX Zhang, J Zheng, Y Xu, M Yu, D Su, Y Zou, D Yu
572019
ADL-MVDR: All deep learning MVDR beamformer for target speech separation
Z Zhang, Y Xu, M Yu, SX Zhang, L Chen, D Yu
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
552021
Multi-modal multi-channel target speech separation
R Gu, SX Zhang, Y Xu, L Chen, Y Zou, D Yu
IEEE Journal of Selected Topics in Signal Processing 14 (3), 530-541, 2020
552020
New era for robust speech recognition: exploiting deep learning
S Watanabe, M Delcroix, F Metze, JR Hershey, et al.
Springer, 2017
54*2017
Audio-visual Recognition of Overlapped speech for the LRS2 dataset
J Yu, SX Zhang, J Wu, S Ghorbani, B Wu, S Kang, S Liu, X Liu, H Meng, ...
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
492020
Structured SVMs for automatic speech recognition
SX Zhang, MJF Gales
IEEE Transactions on Audio, Speech, and Language Processing 21 (3), 544-555, 2012
432012
Enhancing End-to-End Multi-Channel Speech Separation Via Spatial Feature Learning
R Gu, SX Zhang, L Chen, Y Xu, M Yu, D Su, Y Zou, D Yu
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
412020
DEEP NEURAL SUPPORT VECTOR MACHINES FOR SPEECH RECOGNITION
SX Zhang, C Liu, K Yao, Y Gong
ICASSP 2015, 2015
402015
Computerized intelligent assistant for conferences
A Diamant, KM Ben-Dor, E Krupka, R Halaly, Y Smolin, I Gurvich, ...
US Patent 10,867,610, 2020
382020
Structured log linear models for noise robust speech recognition
SX Zhang, A Ragni, MJF Gales
IEEE Signal Processing Letters 17 (11), 945-948, 2010
362010
Audio-visual speech separation and dereverberation with a two-stage multimodal network
K Tan, Y Xu, SX Zhang, M Yu, D Yu
IEEE Journal of Selected Topics in Signal Processing 14 (3), 542-553, 2020
342020
Structured Support Vector Machines for Noise Robust Continuous Speech Recognition.
SX Zhang, MJF Gales
INTERSPEECH, 989-990, 2011
312011
Far-Field Location Guided Target Speech Extraction Using End-to-End Speech Recognition Objectives
AS Subramanian, C Weng, M Yu, SX Zhang, Y Xu, S Watanabe, D Yu
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
292020
The system can't perform the operation now. Try again later.
Articles 1–20