Expressive Text-to-Speech using Style Tag M Kim, SJ Cheon, BJ Choi, JJ Kim, NS Kim arXiv preprint arXiv:2104.00436, 2021 | 43 | 2021 |
Transfer learning framework for low-resource text-to-speech using a large-scale unlabeled speech corpus M Kim, M Jeong, BJ Choi, S Ahn, JY Lee, NS Kim arXiv preprint arXiv:2203.15447, 2022 | 20 | 2022 |
Disentangled speaker representation learning via mutual information minimization SH Mun, MH Han, M Kim, D Lee, NS Kim 2022 Asia-Pacific Signal and Information Processing Association Annual …, 2022 | 7 | 2022 |
Transduce and speak: Neural transducer for text-to-speech with semantic token prediction M Kim, M Jeong, BJ Choi, D Lee, NS Kim 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-7, 2023 | 4 | 2023 |
Adversarial speaker-consistency learning using untranscribed speech data for zero-shot multi-speaker text-to-speech BJ Choi, M Jeong, M Kim, SH Mun, NS Kim 2022 Asia-Pacific Signal and Information Processing Association Annual …, 2022 | 4 | 2022 |
Fully unsupervised training of few-shot keyword spotting D Lee, M Kim, SH Mun, MH Han, NS Kim 2022 IEEE Spoken Language Technology Workshop (SLT), 266-272, 2023 | 3 | 2023 |
Efficient parallel audio generation using group masked language modeling M Jeong, M Kim, JY Lee, NS Kim arXiv preprint arXiv:2401.01099, 2024 | 1 | 2024 |
Variable-Length Speaker Conditioning in Flow-Based Text-to-Speech BJ Choi, M Jeong, M Kim, NS Kim IEEE Signal Processing Letters, 2024 | | 2024 |
Transfer Learning for Low-Resource, Multi-Lingual, and Zero-Shot Multi-Speaker Text-to-Speech M Jeong, M Kim, BJ Choi, J Yoon, W Jang, NS Kim IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024 | | 2024 |
Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction M Kim, M Jeong, BJ Choi, S Kim, JY Lee, NS Kim arXiv preprint arXiv:2401.01498, 2024 | | 2024 |
EM-network: oracle guided self-distillation for sequence learning JW Yoon, S Ahn, H Lee, M Kim, SM Kim, NS Kim International Conference on Machine Learning, 40111-40128, 2023 | | 2023 |
Improving Learning Objectives for Speaker Verification from the Perspective of Score Comparison MH Han, SH Mun, M Kim, M Jeong, SH Ahn, NS Kim ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | | 2023 |
EM-Network: Learning Better Latent Variable for Sequence-to-Sequence Models JW Yoon, SH Ahn, H Lee, M Kim, SM Kim, NS Kim | | 2022 |
A Controllable Multi-Lingual Multi-Speaker Multi-Style Text-to-Speech Synthesis With Multivariate Information Minimization SJ Cheon, BJ Choi, M Kim, H Lee, NS Kim IEEE Signal Processing Letters 29, 55-59, 2021 | | 2021 |