Æȷοì
Taejin Park
Á¦¸ñ
Àοë
Àοë
¿¬µµ
A review of speaker diarization: Recent advances with deep learning
TJ Park, N Kanda, D Dimitriadis, KJ Han, S Watanabe, S Narayanan
Computer Speech & Language 72, 101317, 2022
2872022
Auto-tuning spectral clustering for speaker diarization using normalized maximum eigengap
TJ Park, KJ Han, M Kumar, S Narayanan
IEEE Signal Processing Letters 27, 381-385, 2019
1172019
Titanet: Neural model for speaker representation with 1d depth-wise separable convolutions and global context
NR Koluguri, T Park, B Ginsburg
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and ¡¦, 2022
692022
Binaural rendering method and apparatus for decoding multi channel audio
YJ Lee, JI Seo, JH Yoo, SK Beack, JM Sung, TJ Lee, KO Kang, JW Kim, ...
US Patent 9,319,819, 2016
492016
Musical instrument sound classification with deep convolutional neural network using feature fusion approach
T Park, T Lee
arXiv preprint arXiv:1512.07370, 2015
462015
Multimodal speaker segmentation and diarization using lexical and acoustic cues via sequence to sequence neural networks
TJ Park, P Georgiou
arXiv preprint arXiv:1805.10731, 2018
372018
Speaker diarization with lexical information
TJ Park, KJ Han, J Huang, X He, B Zhou, P Georgiou, S Narayanan
arXiv preprint arXiv:2004.06756, 2020
352020
Speaker diarization using latent space clustering in generative adversarial network
M Pal, M Kumar, R Peri, TJ Park, SH Kim, C Lord, S Bishop, S Narayanan
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and ¡¦, 2020
232020
Meta-learning with latent space clustering in generative adversarial network for speaker diarization
M Pal, M Kumar, R Peri, TJ Park, SH Kim, C Lord, S Bishop, S Narayanan
IEEE/ACM transactions on audio, speech, and language processing 29, 1204-1219, 2021
212021
Multi-scale speaker diarization with dynamic scale weighting
TJ Park, NR Koluguri, J Balam, B Ginsburg
arXiv preprint arXiv:2203.15974, 2022
142022
Tackling dynamics in federated incremental learning with variational embedding rehearsal
TJ Park, K Kumatani, D Dimitriadis
arXiv preprint arXiv:2110.09695, 2021
142021
Multi-scale speaker diarization with neural affinity score fusion
TJ Park, M Kumar, S Narayanan
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and ¡¦, 2021
142021
Automatic prediction of suicidal risk in military couples using multimodal interaction cues from couples conversations
SN Chakravarthula, M Nasir, SY Tseng, H Li, TJ Park, B Baucom, ...
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and ¡¦, 2020
142020
Multi-Task Discriminative Training of Hybrid DNN-TVM Model for Speaker Verification with Noisy and Far-Field Speech.
A Jati, R Peri, M Pal, TJ Park, N Kumar, R Travadi, PG Georgiou, ...
Interspeech, 2463-2467, 2019
142019
The Second DIHARD Challenge: System Description for USC-SAIL Team.
TJ Park, M Kumar, N Flemotomos, M Pal, R Peri, R Lahiri, PG Georgiou, ...
INTERSPEECH, 998-1002, 2019
112019
Encoding/decoding apparatus for processing channel signal and method therefor
JI Seo, SK Beack, DY Jang, KO Kang, TJ Park, YJ Lee, KW Choi, JW Kim
US Patent 10,068,579, 2018
112018
Robust multi-channel speech recognition using frequency aligned network
T Park, K Kumatani, M Wu, S Sundaram
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and ¡¦, 2020
82020
Apparatus for processing audio signal for sound bar and method therefor
JI Seo, DY Jang, TJ Park, KW Choi, KO Kang, JW Kim
US Patent App. 14/760,770, 2015
82015
Binaural rendering method and apparatus for decoding multi channel audio
YJ Lee, JI Seo, JH Yoo, SK Beack, JM Sung, TJ Lee, KO Kang, JW Kim, ...
US Patent 10,199,045, 2019
72019
Apparatus and method for transmitting watermark robust to acoustic channel distortion
SK Beack, TJ Park, JM Sung, YJ Lee, TJ Lee, KO Kang
US Patent App. 14/881,375, 2016
62016
ÇöÀç ½Ã½ºÅÛÀÌ ÀÛµ¿µÇÁö ¾Ê½À´Ï´Ù. ³ªÁß¿¡ ´Ù½Ã ½ÃµµÇØ ÁÖ¼¼¿ä.
ÇмúÀÚ·á 1–20