Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions J Shen, R Pang, RJ Weiss, M Schuster, N Jaitly, Z Yang, Z Chen, Y Zhang, ... arXiv preprint arXiv:1712.05884, 2017 | 1801 | 2017 |
TACOTRON: TOWARDS END-TO-END SPEECH SYN Y Wang, RJ Skerry-Ryan, D Stanton, Y Wu, RJ Weiss, N Jaitly, Z Yang, ... arXiv preprint arXiv:1703.10135, 2017 | 1546* | 2017 |
A leaf recognition algorithm for plant classification using probabilistic neural network SG Wu, FS Bao, EY Xu, YX Wang, YF Chang, QL Xiang Signal Processing and Information Technology, 2007 IEEE International ¡¦, 2007 | 1041 | 2007 |
On training targets for supervised speech separation Y Wang, A Narayanan, DL Wang IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP) 22 ¡¦, 2014 | 925 | 2014 |
Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis Y Wang, D Stanton, Y Zhang, RJ Skerry-Ryan, E Battenberg, J Shor, ... arXiv preprint arXiv:1803.09017, 2018 | 526 | 2018 |
Complex ratio masking for monaural speech separation DS Williamson, Y Wang, DL Wang IEEE/ACM Transactions on Audio, Speech, and Language Processing 24 (3), 483-492, 2016 | 524 | 2016 |
Towards scaling up classification-based speech separation Y Wang, DL Wang IEEE Transactions on Audio, Speech, and Language Processing 21 (7), 1381-1390, 2013 | 461 | 2013 |
Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron RJ Skerry-Ryan, E Battenberg, Y Xiao, Y Wang, D Stanton, J Shor, ... arXiv preprint arXiv:1803.09047, 2018 | 418 | 2018 |
PANNs: Large-Scale Pretrained Audio Neural Networks for Audio Pattern Recognition Q Kong, Y Cao, T Iqbal, Y Wang, W Wang, MD Plumbley arXiv preprint arXiv:1912.10211, 2019 | 335 | 2019 |
Learning spectral mapping for speech dereverberation and denoising K Han, Y Wang, DL Wang, WS Woods, I Merks, T Zhang IEEE Transactions on Audio, Speech, and Language Processing 23 (6), 982-992, 2015 | 239 | 2015 |
An algorithm to improve speech recognition in noise for hearing-impaired listeners EW Healy, SE Yoho, Y Wang, DL Wang The Journal of the Acoustical Society of America 134 (4), 3029-3038, 2013 | 236 | 2013 |
Exploring monaural features for classification-based speech segregation Y Wang, K Han, DL Wang IEEE Transactions on Audio, Speech, and Language Processing 21 (2), 270-279, 2013 | 217 | 2013 |
A feature study for classification-based speech separation at low signal-to-noise ratios J Chen, Y Wang, DL Wang IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP) 22 ¡¦, 2014 | 201 | 2014 |
A feature study for classification-based speech separation at low signal-to-noise ratios J Chen, Y Wang, DL Wang IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP) 22 ¡¦, 2014 | 201 | 2014 |
Hierarchical Generative Modeling for Controllable Speech Synthesis WN Hsu, Y Zhang, RJ Weiss, H Zen, Y Wu, Y Wang, Y Cao, Y Jia, Z Chen, ... arXiv preprint arXiv:1810.07217, 2018 | 174 | 2018 |
Robust speaker identification in noisy and reverberant conditions X Zhao, Y Wang, DL Wang IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP) 22 (4 ¡¦, 2014 | 156 | 2014 |
Large-scale training to increase speech intelligibility for hearing-impaired listeners in novel noises J Chen, Y Wang, SE Yoho, DL Wang, EW Healy The Journal of the Acoustical Society of America 139 (5), 2604-2612, 2016 | 153 | 2016 |
A deep neural network for time-domain signal reconstruction Y Wang, DL Wang Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International ¡¦, 2015 | 108 | 2015 |
Trainable frontend for robust and far-field keyword spotting Y Wang, P Getreuer, T Hughes, RF Lyon, RA Saurous Acoustics, Speech and Signal Processing (ICASSP), 2017 IEEE International ¡¦, 2017 | 104 | 2017 |
Semi-Supervised Training for Improving Data Efficiency in End-to-End Speech Synthesis YA Chung, Y Wang, WN Hsu, Y Zhang, RJ Skerry-Ryan arXiv preprint arXiv:1808.10128, 2018 | 100 | 2018 |