Zero-shot voice conditioning for denoising diffusion tts models A Levkovitch, E Nachmani, L Wolf arXiv preprint arXiv:2206.02246, 2022 | 21 | 2022 |
Lms with a voice: Spoken language modeling beyond speech tokens E Nachmani, A Levkovitch, J Salazar, C Asawaroengchai, S Mariooryad, ... arXiv preprint arXiv:2305.15255, 2023 | 8 | 2023 |
Translatotron 3: Speech to speech translation with monolingual data E Nachmani, A Levkovitch, Y Ding, C Asawaroengchai, H Zen, ... ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 1 | 2024 |
Spoken Question Answering and Speech Continuation Using Spectrogram-Powered LLM E Nachmani, A Levkovitch, R Hirsch, J Salazar, C Asawaroengchai, ... The Twelfth International Conference on Learning Representations, 2023 | | 2023 |