Follow
Mengxiao Bi
Mengxiao Bi
Fuxi AI Lab, NetEase Inc.
Verified email at corp.netease.com
Title
Cited by
Cited by
Year
Very deep convolutional neural networks for noise robust speech recognition
Y Qian, M Bi, T Tan, K Yu
IEEE/ACM Transactions on Audio, Speech, and Language Processing 24 (12 …, 2016
3782016
Opencpop: A high-quality open source chinese popular song corpus for singing voice synthesis
Y Wang, X Wang, P Zhu, J Wu, H Li, H Xue, Y Zhang, L Xie, M Bi
arXiv preprint arXiv:2201.07429, 2022
592022
Visinger: Variational inference with adversarial learning for end-to-end singing voice synthesis
Y Zhang, J Cong, H Xue, L Xie, P Zhu, M Bi
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
532022
Very deep convolutional neural networks for LVCSR.
M Bi, Y Qian, K Yu
Interspeech, 3259-3263, 2015
512015
Deep feed-forward sequential memory networks for speech synthesis
M Bi, H Lu, S Zhang, M Lei, Z Yan
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
162018
One-shot voice conversion for style transfer based on speaker adaptation
Z Wang, Q Xie, T Li, H Du, L Xie, P Zhu, M Bi
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
132022
Learn2sing 2.0: Diffusion and mutual information-based target speaker SVS by learning from singing teacher
H Xue, X Wang, Y Zhang, L Xie, P Zhu, M Bi
arXiv preprint arXiv:2203.16408, 2022
82022
Expressive-vc: Highly expressive voice conversion with attention fusion of bottleneck and perturbation features
Z Ning, Q Xie, P Zhu, Z Wang, L Xue, J Yao, L Xie, M Bi
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
72023
Dualvc: Dual-mode voice conversion using intra-model knowledge distillation and hybrid predictive coding
Z Ning, Y Jiang, P Zhu, J Yao, S Wang, L Xie, M Bi
arXiv preprint arXiv:2305.12425, 2023
22023
Dualvc 2: Dynamic masked convolution for unified streaming and non-streaming voice conversion
Z Ning, Y Jiang, P Zhu, S Wang, J Yao, L Xie, M Bi
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
12024
EDTalk: Efficient Disentanglement for Emotional Talking Head Synthesis
S Tan, B Ji, M Bi, Y Pan
arXiv preprint arXiv:2404.01647, 2024
2024
Multi-GradSpeech: Towards Diffusion-based Multi-Speaker Text-to-speech Using Consistent Diffusion Models
H Xue, S Guo, P Zhu, M Bi
arXiv preprint arXiv:2308.10428, 2023
2023
The system can't perform the operation now. Try again later.
Articles 1–12