Sri Karlapati
Sri Karlapati
Amazon Research, Cambridge, United Kingdom.
Verified email at
Cited by
Cited by
CopyCat: Many-to-Many Fine-Grained Prosody Transfer for Neural Text-to-Speech
S Karlapati, A Moinet, A Joly, V Klimkov, D Sáez-Trigueros, T Drugman
arXiv preprint arXiv:2004.14617, 2020
CAMP: a Two-Stage Approach to Modelling Prosody in Context
Z Hodari, A Moinet, S Karlapati, J Lorenzo-Trueba, T Merritt, A Joly, ...
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
Prosodic Representation Learning and Contextual Sampling for Neural Text-to-Speech
S Karlapati, A Abbas, Z Hodari, A Moinet, A Joly, P Karanasou, ...
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data
M Łajszczak, G Cámbara, Y Li, F Beyhan, A van Korlaar, F Yang, A Joly, ...
arXiv preprint arXiv:2402.08093, 2024
Simple and Effective Multi-sentence TTS with Expressive and Coherent Prosody
P Makarov, A Abbas, M Łajszczak, A Joly, S Karlapati, A Moinet, ...
arXiv preprint arXiv:2206.14643, 2022
CopyCat2: A Single Model for Multi-Speaker TTS and Many-to-Many Fine-Grained Prosody Transfer
S Karlapati, P Karanasou, M Lajszczak, A Abbas, A Moinet, P Makarov, ...
arXiv preprint arXiv:2206.13443, 2022
Expressive, Variable, and Controllable Duration Modelling in TTS
A Abbas, T Merritt, A Moinet, S Karlapati, E Muszynska, S Slangen, E Gatti, ...
arXiv preprint arXiv:2206.14165, 2022
A learned conditional prior for the VAE acoustic space of a TTS system
P Karanasou, S Karlapati, A Moinet, A Joly, A Abbas, S Slangen, ...
Predicting deformation mechanisms in architected metamaterials using GNN
PP Indurkar, S Karlapati, AJD Shaikeea, VS Deshpande
arXiv preprint arXiv:2202.09427, 2022
Voicy: Zero-Shot Non-Parallel Voice Conversion in Noisy Reverberant Environments
A Mottini, J Lorenzo-Trueba, SVK Karlapati, T Drugman
arXiv preprint arXiv:2106.08873, 2021
Multi-scale spectrogram text-to-speech
SA Abbas, B Bollepalli, AP Moinet, TR Drugman, AVPY Joly, P Karanasou, ...
US Patent 11,694,674, 2023
eCat: An end-to-end model for multi-speaker TTS & many-to-many fine-grained prosody transfer
A Abbas, S Karlapati, B Schnell, P Karanasou, MG Moya, A Nagaraj, ...
arXiv preprint arXiv:2306.11327, 2023
Hash based frequent pattern mining approach to text compression
C Oswald, S Srinidhi, KS Vishnu, TV Vishal, B Sivaselvan
First EAI International Conference on Computer Science and Engineering, 228-238, 2017
A Comparative Analysis of Pretrained Language Models for Text-to-Speech
MG Moya, P Karanasou, S Karlapati, B Schnell, N Peinelt, A Moinet, ...
12th Speech Synthesis Workshop (SSW) 2023, 2023
Energy-conserving equivariant GNN for elasticity of lattice architected metamaterials
I Grega, I Batatia, G Csányi, S Karlapati, VS Deshpande
arXiv preprint arXiv:2401.16914, 2024
Learned condition text-to-speech synthesis
P Karanasou, SVK Karlapati, AP Moinet, AVPY Joly, SA Abbas, ...
US Patent 11,830,476, 2023
Synthetic speech processing
JL Trueba, ARM D'Oliveira, TR Drugman, SVK KARLAPATI
US Patent 11,735,156, 2023
Synthetic speech processing
JL Trueba, ARM D'Oliveira, TR Drugman, SVK KARLAPATI
US Patent App. 18/305,456, 2023
Synthetic speech processing by representing text by phonemes exhibiting predicted volume and pitch using neural networks
A Joly, S Slangen, AP Moinet, TR Drugman, P Karanasou, SA Abbas, ...
US Patent 11,978,431, 2024
Synthetic speech processing
AVPY Joly, P Karanasou, APJ Moinet, TR Drugman, SVK Karlapati, ...
US Patent 11,574,624, 2023
The system can't perform the operation now. Try again later.
Articles 1–20