Yinghao Aaron Li

Cited by

	All	Since 2019
Citations	198	198
h-index	6	6
i10-index	4	4

100

2019202020212022202320241 8 21 35 96 37

Public access

View all

3 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Nima MesgaraniAssociate Professor, Columbia UniversityVerified email at ee.columbia.edu
Cong HanPhD Student, Columbia UniversityVerified email at columbia.edu
Robert KimMD/PhD Student, UCSD, Salk InstituteVerified email at ucsd.edu
Terrence SejnowskiFrancis Crick Professor, Salk Institute, Distingished Professor, UC San DiegoVerified email at salk.edu
Ali ZareColumbia UniversityVerified email at columbia.edu
Xilin JiangPhD student, Columbia UniversityVerified email at columbia.edu
Gavin MischlerPhD Student at Columbia UniversityVerified email at columbia.edu
Vinay S RaghavanPostdoctoral Fellow, The City College of New YorkVerified email at ccny.cuny.edu
Shuai TangAmazon Web ServicesVerified email at amazon.com
Virginia de SaProfessor of Cognitive Science, Associate Director of the Halicioglu Data Science Institute, UCSDVerified email at ucsd.edu
Vishal ChoudhariElectrical Engineering Ph.D. Student, Columbia UniversityVerified email at columbia.edu

Yinghao Aaron Li

PhD Student, Columbia University

Verified email at columbia.edu

Computational Neuroscience Voice Conversion Speech Synthesis


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Starganv2-vc: A diverse, unsupervised, non-parallel framework for natural-sounding voice conversion YA Li, A Zare, N Mesgarani arXiv preprint arXiv:2107.10394, 2021	71	2021
Simple framework for constructing functional spiking recurrent neural networks R Kim, Y Li, TJ Sejnowski Proceedings of the national academy of sciences 116 (45), 22811-22820, 2019	68	2019
Styletts: A style-based generative model for natural and diverse text-to-speech synthesis YA Li, C Han, N Mesgarani arXiv preprint arXiv:2205.15439, 2022	20	2022
Styletts 2: Towards human-level text-to-speech through style diffusion and adversarial training with large speech language models YA Li, C Han, V Raghavan, G Mischler, N Mesgarani Advances in Neural Information Processing Systems 36, 2024	13	2024
Styletts-vc: One-shot voice conversion by knowledge transfer from style-based tts models YA Li, C Han, N Mesgarani 2022 IEEE Spoken Language Technology Workshop (SLT), 920-927, 2023	9	2023
Phoneme-level bert for enhanced prosody of text-to-speech with grapheme predictions YA Li, C Han, X Jiang, N Mesgarani ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	7	2023
Learning the synaptic and intrinsic membrane dynamics underlying working memory in spiking neural network models Y Li, R Kim, TJ Sejnowski Neural Computation 33 (12), 3264-3287, 2021	4	2021
SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANs YA Li, C Han, N Mesgarani 2023 IEEE Workshop on Applications of Signal Processing to Audio and …, 2023	2	2023
Improved decoding of attentional selection in multi-talker environments with self-supervised learned speech representation C Han, V Choudhari, YA Li, N Mesgarani 2023 45th Annual International Conference of the IEEE Engineering in …, 2023	2	2023
Supervised spike sorting using deep convolutional siamese network and hierarchical clustering Y Li, S Tang, VR de Sa unpublished thesis, 2019	2	2019
Exploring Self-supervised Contrastive Learning of Spatial Sound Event Representation X Jiang, C Han, YA Li, N Mesgarani ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024		2024
Listen, Chat, and Edit: Text-Guided Soundscape Modification for Enhanced Auditory Experience X Jiang, C Han, YA Li, N Mesgarani arXiv preprint arXiv:2402.03710, 2024		2024
Contextual Feature Extraction Hierarchies Converge in Large Language Models and the Brain G Mischler, YA Li, S Bickel, AD Mehta, N Mesgarani arXiv preprint arXiv:2401.17671, 2024		2024
HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier Transform YA Li, C Han, X Jiang, N Mesgarani arXiv preprint arXiv:2309.09493, 2023		2023
DeCoR: Defy Knowledge Forgetting by Predicting Earlier Audio Codes X Jiang, YA Li, N Mesgarani arXiv preprint arXiv:2305.18441, 2023		2023

The system can't perform the operation now. Try again later.

Articles 1–15

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors