‪Minchan Kim‬ - ‪Google Scholar‬

Get my own profile

Cited by

	All	Since 2019
Citations	82	82
h-index	4	4
i10-index	2	2

0

36

18

20212022202320246 13 36 25

Minchan Kim

Minchan Kim

Seoul National University

Verified email at hi.snu.ac.kr

speech synthesis machine learning deep learning generative model


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Expressive Text-to-Speech using Style Tag M Kim, SJ Cheon, BJ Choi, JJ Kim, NS Kim arXiv preprint arXiv:2104.00436, 2021	43	2021
Transfer learning framework for low-resource text-to-speech using a large-scale unlabeled speech corpus M Kim, M Jeong, BJ Choi, S Ahn, JY Lee, NS Kim arXiv preprint arXiv:2203.15447, 2022	20	2022
Disentangled speaker representation learning via mutual information minimization SH Mun, MH Han, M Kim, D Lee, NS Kim 2022 Asia-Pacific Signal and Information Processing Association Annual …, 2022	7	2022
Transduce and speak: Neural transducer for text-to-speech with semantic token prediction M Kim, M Jeong, BJ Choi, D Lee, NS Kim 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-7, 2023	4	2023
Adversarial speaker-consistency learning using untranscribed speech data for zero-shot multi-speaker text-to-speech BJ Choi, M Jeong, M Kim, SH Mun, NS Kim 2022 Asia-Pacific Signal and Information Processing Association Annual …, 2022	4	2022
Fully unsupervised training of few-shot keyword spotting D Lee, M Kim, SH Mun, MH Han, NS Kim 2022 IEEE Spoken Language Technology Workshop (SLT), 266-272, 2023	3	2023
Efficient parallel audio generation using group masked language modeling M Jeong, M Kim, JY Lee, NS Kim arXiv preprint arXiv:2401.01099, 2024	1	2024
Variable-Length Speaker Conditioning in Flow-Based Text-to-Speech BJ Choi, M Jeong, M Kim, NS Kim IEEE Signal Processing Letters, 2024		2024
Transfer Learning for Low-Resource, Multi-Lingual, and Zero-Shot Multi-Speaker Text-to-Speech M Jeong, M Kim, BJ Choi, J Yoon, W Jang, NS Kim IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024		2024
Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction M Kim, M Jeong, BJ Choi, S Kim, JY Lee, NS Kim arXiv preprint arXiv:2401.01498, 2024		2024
EM-network: oracle guided self-distillation for sequence learning JW Yoon, S Ahn, H Lee, M Kim, SM Kim, NS Kim International Conference on Machine Learning, 40111-40128, 2023		2023
Improving Learning Objectives for Speaker Verification from the Perspective of Score Comparison MH Han, SH Mun, M Kim, M Jeong, SH Ahn, NS Kim ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023		2023
EM-Network: Learning Better Latent Variable for Sequence-to-Sequence Models JW Yoon, SH Ahn, H Lee, M Kim, SM Kim, NS Kim		2022
A Controllable Multi-Lingual Multi-Speaker Multi-Style Text-to-Speech Synthesis With Multivariate Information Minimization SJ Cheon, BJ Choi, M Kim, H Lee, NS Kim IEEE Signal Processing Letters 29, 55-59, 2021		2021

The system can't perform the operation now. Try again later.

Articles 1–14