Follow
Cong Han
Cong Han
PhD Student, Columbia University
Verified email at columbia.edu
Title
Cited by
Cited by
Year
FaSNet: Low-latency adaptive beamforming for multi-microphone audio processing
Y Luo, C Han, N Mesgarani, E Ceolini, SC Liu
2019 IEEE automatic speech recognition and understanding workshop (ASRU …, 2019
1442019
Speaker-independent auditory attention decoding without access to clean speech sources
C Han, J O’Sullivan, Y Luo, J Herrero, AD Mehta, N Mesgarani
Science advances 5 (5), eaav6134, 2019
852019
Real-time binaural speech separation with preserved spatial cues
C Han, Y Luo, N Mesgarani
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
422020
Ultra-lightweight speech separation via group communication
Y Luo, C Han, N Mesgarani
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
352021
Online deep attractor network for real-time single-channel speech separation
C Han, Y Luo, N Mesgarani
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
292019
Group communication with context codec for lightweight source separation
Y Luo, C Han, N Mesgarani
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 1752-1761, 2021
27*2021
Dual-path RNN for long recording speech separation
C Li, Y Luo, C Han, J Li, T Yoshioka, T Zhou, M Delcroix, K Kinoshita, ...
2021 IEEE Spoken Language Technology Workshop (SLT), 865-872, 2021
242021
Continuous speech separation using speaker inventory for long multi-talker recording
C Han, Y Luo, C Li, T Zhou, K Kinoshita, S Watanabe, M Delcroix, ...
Interspeech, 3036--3040, 2021
212021
StyleTTS: A style-based generative model for natural and diverse text-to-speech synthesis
YA Li, C Han, N Mesgarani
arXiv preprint arXiv:2205.15439, 2022
202022
Improving conversational recommendation systems' quality with context-aware item meta information
B Yang, C Han, Y Li, L Zuo, Z Yu
Findings of NAACL 2022, 38–48, 2022
192022
Distortion-controlled training for end-to-end reverberant speech separation with auxiliary autoencoding loss
Y Luo, C Han, N Mesgarani
2021 IEEE Spoken Language Technology Workshop (SLT), 825-832, 2021
132021
StyleTTS 2: Towards human-level text-to-speech through style diffusion and adversarial training with large speech language models
YA Li, C Han, V Raghavan, G Mischler, N Mesgarani
Advances in Neural Information Processing Systems 36, 2024
122024
Dual-path modeling for long recording speech separation in meetings
C Li, Z Chen, Y Luo, C Han, T Zhou, K Kinoshita, M Delcroix, S Watanabe, ...
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
112021
Learning speech production and perception through sensorimotor interactions
S Shamma, P Patel, S Mukherjee, G Marion, B Khalighinejad, C Han, ...
Cerebral cortex communications 2 (1), tgaa091, 2021
112021
StyleTTS-VC: One-shot voice conversion by knowledge transfer from style-based TTS models
YA Li, C Han, N Mesgarani
2022 IEEE Spoken Language Technology Workshop (SLT), 920-927, 2023
92023
Rethinking the separation layers in speech separation networks
Y Luo, Z Chen, C Han, C Li, T Zhou, N Mesgarani
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
82021
Phoneme-level BERT for enhanced prosody of text-to-speech with grapheme predictions
YA Li, C Han, X Jiang, N Mesgarani
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
72023
Binaural speech separation of moving speakers with preserved spatial cues
C Han, Y Luo, N Mesgarani
Interspeech, 3505-3509, 2021
42021
Online binaural speech separation of moving speakers with a Wavesplit network
C Han, N Mesgarani
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
32023
Multi-channel speech denoising for machine ears
C Han, EM Kaya, K Hoefer, M Slaney, S Carlile
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
32022
The system can't perform the operation now. Try again later.
Articles 1–20