Spoken question answering and speech continuation using spectrogram-powered llm E Nachmani, A Levkovitch, R Hirsch, J Salazar, C Asawaroengchai, ... arXiv preprint arXiv:2305.15255, 2023 | 36 | 2023 |
Zero-shot voice conditioning for denoising diffusion tts models A Levkovitch, E Nachmani, L Wolf arXiv preprint arXiv:2206.02246, 2022 | 27 | 2022 |
Lms with a voice: Spoken language modeling beyond speech tokens E Nachmani, A Levkovitch, J Salazar, C Asawaroengchai, S Mariooryad, ... arXiv preprint arXiv:2305.15255, 2023 | 15 | 2023 |
Translatotron 3: Speech to speech translation with monolingual data E Nachmani, A Levkovitch, Y Ding, C Asawaroengchai, H Zen, ... ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 13 | 2024 |
Zero-Shot Mono-to-Binaural Speech Synthesis A Levkovitch, J Salazar, S Mariooryad, RJ Skerry-Ryan, N Bar, B Kleijn, ... arXiv preprint arXiv:2412.08356, 2024 | | 2024 |
LANGUAGE MODELS USING SPOKEN LANGUAGE MODELING MD Tadmor, E Nachmani, A Levkovitch, J Salazar, C Asawaroengchai, ... US Patent App. 18/662,442, 2024 | | 2024 |
Speech-to-speech translation with monolingual data MT Ramanovich, E Nachmani, A Levkovitch, B Chun, D Yifan, N Bar, ... US Patent App. 18/589,358, 2024 | | 2024 |
Towards Universal Mono-to-Binaural Speech Synthesis A Levkovitch, J Salazar, S Mariooryad, RJ Skerry-Ryan, N Bar, WB Kleijn, ... | | |