Follow
Minchan Kim
Title
Cited by
Cited by
Year
Expressive Text-to-Speech using Style Tag
M Kim, SJ Cheon, BJ Choi, JJ Kim, NS Kim
arXiv preprint arXiv:2104.00436, 2021
432021
Transfer learning framework for low-resource text-to-speech using a large-scale unlabeled speech corpus
M Kim, M Jeong, BJ Choi, S Ahn, JY Lee, NS Kim
arXiv preprint arXiv:2203.15447, 2022
202022
Disentangled speaker representation learning via mutual information minimization
SH Mun, MH Han, M Kim, D Lee, NS Kim
2022 Asia-Pacific Signal and Information Processing Association Annual …, 2022
72022
Transduce and speak: Neural transducer for text-to-speech with semantic token prediction
M Kim, M Jeong, BJ Choi, D Lee, NS Kim
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-7, 2023
42023
Adversarial speaker-consistency learning using untranscribed speech data for zero-shot multi-speaker text-to-speech
BJ Choi, M Jeong, M Kim, SH Mun, NS Kim
2022 Asia-Pacific Signal and Information Processing Association Annual …, 2022
42022
Fully unsupervised training of few-shot keyword spotting
D Lee, M Kim, SH Mun, MH Han, NS Kim
2022 IEEE Spoken Language Technology Workshop (SLT), 266-272, 2023
32023
Efficient parallel audio generation using group masked language modeling
M Jeong, M Kim, JY Lee, NS Kim
arXiv preprint arXiv:2401.01099, 2024
12024
Variable-Length Speaker Conditioning in Flow-Based Text-to-Speech
BJ Choi, M Jeong, M Kim, NS Kim
IEEE Signal Processing Letters, 2024
2024
Transfer Learning for Low-Resource, Multi-Lingual, and Zero-Shot Multi-Speaker Text-to-Speech
M Jeong, M Kim, BJ Choi, J Yoon, W Jang, NS Kim
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024
2024
Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction
M Kim, M Jeong, BJ Choi, S Kim, JY Lee, NS Kim
arXiv preprint arXiv:2401.01498, 2024
2024
EM-network: oracle guided self-distillation for sequence learning
JW Yoon, S Ahn, H Lee, M Kim, SM Kim, NS Kim
International Conference on Machine Learning, 40111-40128, 2023
2023
Improving Learning Objectives for Speaker Verification from the Perspective of Score Comparison
MH Han, SH Mun, M Kim, M Jeong, SH Ahn, NS Kim
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
2023
EM-Network: Learning Better Latent Variable for Sequence-to-Sequence Models
JW Yoon, SH Ahn, H Lee, M Kim, SM Kim, NS Kim
2022
A Controllable Multi-Lingual Multi-Speaker Multi-Style Text-to-Speech Synthesis With Multivariate Information Minimization
SJ Cheon, BJ Choi, M Kim, H Lee, NS Kim
IEEE Signal Processing Letters 29, 55-59, 2021
2021
The system can't perform the operation now. Try again later.
Articles 1–14