Phonetic posteriorgrams for many-to-one voice conversion without parallel data training L Sun, K Li, H Wang, S Kang, H Meng 2016 IEEE International Conference on Multimedia and Expo (ICME), 1-6, 2016 | 356 | 2016 |
Voice conversion using deep bidirectional long short-term memory based recurrent neural networks L Sun, S Kang, K Li, H Meng 2015 IEEE international conference on acoustics, speech and signal …, 2015 | 326 | 2015 |
Personalized, Cross-Lingual TTS Using Phonetic Posteriorgrams. L Sun, H Wang, S Kang, K Li, HM Meng Interspeech, 322-326, 2016 | 73 | 2016 |
Voice Conversion Across Arbitrary Speakers Based on a Single Target-Speaker Utterance. S Liu, J Zhong, L Sun, X Wu, X Liu, H Meng Interspeech, 496-500, 2018 | 65 | 2018 |
Improved end-to-end dysarthric speech recognition via meta-learning based model re-initialization D Wang, J Yu, X Wu, L Sun, X Liu, H Meng 2021 12th International Symposium on Chinese Spoken Language Processing …, 2021 | 37 | 2021 |
End-to-end voice conversion via cross-modal knowledge distillation for dysarthric speech reconstruction D Wang, J Yu, X Wu, S Liu, L Sun, X Liu, H Meng ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 37 | 2020 |
End-to-end accent conversion without using native utterances S Liu, D Wang, Y Cao, L Sun, X Wu, S Kang, Z Wu, X Liu, D Su, D Yu, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 36 | 2020 |
Phonetic posteriorgrams for many-to-one voice conversion L Sun, K Li, H Wang, S Kang, MLH Meng US Patent 10,176,819, 2019 | 14 | 2019 |
Jointly Trained Conversion Model and WaveNet Vocoder for Non-Parallel Voice Conversion Using Mel-Spectrograms and Phonetic Posteriorgrams. S Liu, Y Cao, X Wu, L Sun, X Liu, H Meng INTERSPEECH, 714-718, 2019 | 13 | 2019 |
Feature based adaptation for speaking style synthesis X Wu, L Sun, S Kang, S Liu, Z Wu, X Liu, H Meng 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 10 | 2018 |
The HCCL-CUHK System for the Voice Conversion Challenge 2018. S Liu, L Sun, X Wu, X Liu, H Meng Odyssey, 248-254, 2018 | 8 | 2018 |
Spectro-Temporal Modelling with Time-Frequency LSTM and Structured Output Layer for Voice Conversion. R Li, Z Wu, Y Ning, L Sun, H Meng, L Cai INTERSPEECH, 3409-3413, 2017 | 7 | 2017 |
Learning explicit prosody models and deep speaker embeddings for atypical voice conversion D Wang, S Liu, L Sun, X Wu, X Liu, H Meng arXiv preprint arXiv:2011.01678, 2020 | 6 | 2020 |
Attention-Based recurrent generator with gaussian tolerance for statistical parametric speech synthesis X Wu, S Kang, L Sun, Y Ning, Z Wu, H Meng Workshop on Affective Social Multimedia Computing, 2017 | 4 | 2017 |
Speaker identity preservation in dysarthric speech reconstruction by adversarial speaker adaptation D Wang, S Liu, X Wu, H Lu, L Sun, X Liu, H Meng ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 3 | 2022 |
FastFoley: Non-autoregressive Foley Sound Generation Based on Visual Semantics S Li, L Zhang, C Dong, H Xue, Z Wu, L Sun, K Li, H Meng National Conference on Man-Machine Speech Communication, 252-263, 2022 | 1 | 2022 |
Speech Recognition and Text-to-Speech Synthesis L Sun, S Kang, X Liu, H Meng Chinese Language Resources: Data Collection, Linguistic Analysis, Annotation …, 2023 | | 2023 |
Posteriorgram-to-Acoustic Modeling for Unconstrained Voice Conversion with Deep Learning L Sun PQDT-Global, 2017 | | 2017 |
IEEE ICME’16 Best Paper: Phonetic Posteriorgrams for Many-to-One Voice Conversion without Parallel Data Training L Sun, K Li, H Wang, S Kang, H Meng MMTC Communications–Review, 2016 | | 2016 |