FaSNet: Low-latency adaptive beamforming for multi-microphone audio processing Y Luo, C Han, N Mesgarani, E Ceolini, SC Liu 2019 IEEE automatic speech recognition and understanding workshop (ASRU …, 2019 | 85 | 2019 |
Speaker-independent auditory attention decoding without access to clean speech sources C Han, J O’Sullivan, Y Luo, J Herrero, AD Mehta, N Mesgarani Science advances 5 (5), eaav6134, 2019 | 54 | 2019 |
Ultra-lightweight speech separation via group communication Y Luo, C Han, N Mesgarani ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 24 | 2021 |
Real-time binaural speech separation with preserved spatial cues C Han, Y Luo, N Mesgarani ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 24 | 2020 |
Online deep attractor network for real-time single-channel speech separation C Han, Y Luo, N Mesgarani ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 19 | 2019 |
Dual-path RNN for long recording speech separation C Li, Y Luo, C Han, J Li, T Yoshioka, T Zhou, M Delcroix, K Kinoshita, ... 2021 IEEE Spoken Language Technology Workshop (SLT), 865-872, 2021 | 16 | 2021 |
Group communication with context codec for lightweight source separation Y Luo, C Han, N Mesgarani IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 1752-1761, 2021 | 14 | 2021 |
Continuous speech separation using speaker inventory for long multi-talker recording C Han, Y Luo, C Li, T Zhou, K Kinoshita, S Watanabe, M Delcroix, ... arXiv preprint arXiv:2012.09727, 2020 | 12 | 2020 |
Distortion-controlled training for end-to-end reverberant speech separation with auxiliary autoencoding loss Y Luo, C Han, N Mesgarani 2021 IEEE Spoken Language Technology Workshop (SLT), 825-832, 2021 | 9 | 2021 |
Dual-path modeling for long recording speech separation in meetings C Li, Z Chen, Y Luo, C Han, T Zhou, K Kinoshita, M Delcroix, S Watanabe, ... ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 7 | 2021 |
Learning speech production and perception through sensorimotor interactions S Shamma, P Patel, S Mukherjee, G Marion, B Khalighinejad, C Han, ... Cerebral cortex communications 2 (1), tgaa091, 2021 | 7 | 2021 |
Rethinking the separation layers in speech separation networks Y Luo, Z Chen, C Han, C Li, T Zhou, N Mesgarani ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 5 | 2021 |
StyleTTS: A Style-Based Generative Model for Natural and Diverse Text-to-Speech Synthesis YA Li, C Han, N Mesgarani arXiv preprint arXiv:2205.15439, 2022 | 3 | 2022 |
Empirical Analysis of Generalized Iterative Speech Separation Networks. Y Luo, C Han, N Mesgarani Interspeech, 3485-3489, 2021 | 3 | 2021 |
Improving Conversational Recommendation Systems' Quality with Context-Aware Item Meta Information B Yang, C Han, Y Li, L Zuo, Z Yu arXiv preprint arXiv:2112.08140, 2021 | 2 | 2021 |
Group communication with context codec for ultra-lightweight source separation Y Luo, C Han, N Mesgarani arXiv preprint arXiv:2012.07291, 2020 | 2 | 2020 |
StyleTTS-VC: One-Shot Voice Conversion by Knowledge Transfer from Style-Based TTS Models YA Li, C Han, N Mesgarani arXiv preprint arXiv:2212.14227, 2022 | 1 | 2022 |
Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions YA Li, C Han, X Jiang, N Mesgarani arXiv preprint arXiv:2301.08810, 2023 | | 2023 |
Multi-Channel Speech Denoising for Machine Ears C Han, EM Kaya, K Hoefer, M Slaney, S Carlile ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | | 2022 |
Binaural Speech Separation of Moving Speakers With Preserved Spatial Cues. C Han, Y Luo, N Mesgarani Interspeech, 3505-3509, 2021 | | 2021 |