Follow
Dan Su
Dan Su
Tencent AI Lab
Verified email at tencent.com
Title
Cited by
Cited by
Year
Gigaspeech: An evolving, multi-domain asr corpus with 10,000 hours of transcribed audio
G Chen, S Chai, G Wang, J Du, WQ Zhang, C Weng, D Su, D Povey, ...
arXiv preprint arXiv:2106.06909, 2021
1532021
Replay and synthetic speech detection with res2net architecture
X Li, N Li, C Weng, X Liu, D Su, D Yu, H Meng
ICASSP 2021-2021 IEEE international conference on acoustics, speech and …, 2021
1232021
Fastdiff: A fast conditional diffusion model for high-quality speech synthesis
R Huang, MWY Lam, J Wang, D Su, D Yu, Y Ren, Z Zhao
arXiv preprint arXiv:2204.09934, 2022
1112022
Deep extractor network for target speaker recovery from single channel speech mixtures
J Wang, J Chen, D Su, L Chen, M Yu, Y Qian, D Yu
arXiv preprint arXiv:1807.08974, 2018
1032018
Durian: Duration informed attention network for multimodal synthesis
C Yu, H Lu, N Hu, M Yu, C Weng, K Xu, P Liu, D Tuo, S Kang, G Lei, D Su, ...
arXiv preprint arXiv:1909.01700, 2019
992019
DurIAN: Duration Informed Attention Network for Speech Synthesis.
C Yu, H Lu, N Hu, M Yu, C Weng, K Xu, P Liu, D Tuo, S Kang, G Lei, D Su, ...
Interspeech, 2027-2031, 2020
982020
Component fusion: Learning replaceable language model component for end-to-end speech recognition system
C Shan, C Weng, G Wang, D Su, M Luo, D Yu, L Xie
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
972019
Neural Spatial Filter: Target Speaker Speech Separation Assisted with Directional Information.
R Gu, L Chen, SX Zhang, J Zheng, Y Xu, M Yu, D Su, Y Zou, D Yu
Interspeech, 4290-4294, 2019
892019
End-to-end multi-channel speech separation
R Gu, J Wu, SX Zhang, L Chen, Y Xu, M Yu, D Su, Y Zou, D Yu
arXiv preprint arXiv:1905.06286, 2019
782019
Investigating end-to-end speech recognition for mandarin-english code-switching
C Shan, C Weng, G Wang, D Su, M Luo, D Yu, L Xie
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
782019
Deep Discriminative Embeddings for Duration Robust Speaker Verification.
N Li, D Tuo, D Su, Z Li, D Yu, A Tencent
Interspeech, 2262-2266, 2018
772018
BDDM: Bilateral denoising diffusion models for fast and high-quality speech synthesis
MWY Lam, J Wang, D Su, D Yu
arXiv preprint arXiv:2203.13508, 2022
712022
Enhancing end-to-end multi-channel speech separation via spatial feature learning
R Gu, SX Zhang, L Chen, Y Xu, M Yu, D Su, Y Zou, D Yu
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
622020
Improving Attention Based Sequence-to-Sequence Models for End-to-End English Conversational Speech Recognition.
C Weng, J Cui, G Wang, J Wang, C Yu, D Su, D Yu
Interspeech, 761-765, 2018
612018
Speech-XLNet: Unsupervised acoustic model pretraining for self-attention networks
X Song, G Wang, Z Wu, Y Huang, D Su, D Yu, H Meng
arXiv preprint arXiv:1910.10387, 2019
552019
Sandglasset: A light multi-granularity self-attentive network for time-domain speech separation
MWY Lam, J Wang, D Su, D Yu
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
442021
Diffgan-tts: High-fidelity and efficient text-to-speech with denoising diffusion gans
S Liu, D Su, D Yu
arXiv preprint arXiv:2201.11972, 2022
422022
Investigating robustness of adversarial samples detection for automatic speaker verification
X Li, N Li, J Zhong, X Wu, X Liu, D Su, D Yu, H Meng
arXiv preprint arXiv:2006.06186, 2020
412020
Joint training of complex ratio mask based beamformer and acoustic model for noise robust asr
Y Xu, C Weng, L Hui, J Liu, M Yu, D Su, D Yu
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
412019
Simple attention module based speaker verification with iterative noisy label detection
X Qin, N Li, C Weng, D Su, M Li
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
382022
The system can't perform the operation now. Try again later.
Articles 1–20