Xinsheng Wang

Cited by

	All	Since 2019
Citations	354	344
h-index	11	11
i10-index	13	13

140

105

201720182019202020212022202320242 8 19 16 39 71 135 62

Public access

View all

4 articles

8 articles

available

not available

Based on funding mandates

Co-authors

Lei XieNorthwestern Polytechnical UniversityVerified email at nwpu.edu.cn
jihua zhuSchool of Software Engineering, Xi'an Jiaotong UniversityVerified email at mail.xjtu.edu.cn
Odette ScharenborgAssociate Professor, Delft University of Technology, The NetherlandsVerified email at tudelft.nl
Tao LiAudio, Speech and Language Processing Group (ASLP@NPU), School of Computer ScienceVerified email at npu-aslp.org
Pengcheng ZhuFuxi AI Lab, NetEase Inc.Verified email at corp.netease.com
Mengxiao BiFuxi AI Lab, NetEase Inc.Verified email at corp.netease.com
yi leiVerified email at nwpu.edu.cn
Tingting QiaoZhejiang University, The university of Sydney, Delft University of TechnologyVerified email at zju.edu.cn
Siyuan FengByteDanceVerified email at tudelft.nl

Xinsheng Wang

Xi'an Jiaotong University

Verified email at stu.xjtu.edu.cn - Homepage

speech synthesis singing voice synthesis voice conversion


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Opencpop: A high-quality open source chinese popular song corpus for singing voice synthesis Y Wang, X Wang, P Zhu, J Wu, H Li, H Xue, Y Zhang, L Xie, M Bi arXiv preprint arXiv:2201.07429, 2022	59	2022
Msemotts: Multi-scale emotion transfer, prediction, and control for emotional speech synthesis Y Lei, S Yang, X Wang, L Xie IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 853-864, 2022	52	2022
Experimental study on the relation between internal flow and flashing spray characteristics of R134a using straight tube nozzles XS Wang, B Chen, R Wang, H Xin, ZF Zhou International Journal of Heat and Mass Transfer 115, 524-536, 2017	39	2017
Cross-speaker emotion disentangling and transfer for end-to-end speech synthesis T Li, X Wang, Q Xie, Z Wang, L Xie IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 1448-1460, 2022	23	2022
Numerical simulation of cryogen spray cooling by a three-dimensional hybrid vortex method R Wang, B Chen, XS Wang Applied Thermal Engineering 119, 319-330, 2017	18	2017
Atomization and surface heat transfer characteristics of cryogen spray cooling with expansion-chambered nozzles XS Wang, B Chen, ZF Zhou International Journal of Heat and Mass Transfer 121, 15-27, 2018	17	2018
Generating images from spoken descriptions X Wang, T Qiao, J Zhu, A Hanjalic, O Scharenborg IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 850-865, 2021	16	2021
S2IGAN: Speech-to-Image Generation via Adversarial Learning X Wang, T Qiao, J Zhu, A Hanjalic, O Scharenborg Proc. Interspeech 2020, 2292--2296, 2020	15	2020
Visual space optimization for zero-shot learning X Wang, S Pang, J Zhu, Z Li, Z Tian, Y Li arXiv preprint arXiv:1907.00330, 2019	15	2019
Anyonenet: Synchronized speech and talking head generation for arbitrary persons X Wang, Q Xie, J Zhu, L Xie, O Scharenborg IEEE Transactions on Multimedia, 2022	12	2022
ALIGN OR ATTEND? TOWARD MORE EFFICIENT AND ACCURATE SPOKEN WORD DISCOVERY USING SPEECH-TO-IMAGE RETRIEVAL L Wang, X Wang, M Hasegawa-Johnson, O Scharenborg, N Dehak IEEE International Conference on Acoustics, Speech and Signal Processing …, 2020	12	2020
Multi-speaker multi-style text-to-speech synthesis with single-speaker single-style training data scenarios Q Xie, T Li, X Wang, Z Wang, L Xie, G Yu, G Wan 2022 13th International Symposium on Chinese Spoken Language Processing …, 2022	11	2022
Cross-speaker emotion transfer based on prosody compensation for end-to-end speech synthesis T Li, X Wang, Q Xie, Z Wang, M Jiang, L Xie arXiv preprint arXiv:2207.01198, 2022	10	2022
Learn2sing 2.0: Diffusion and mutual information-based target speaker SVS by learning from singing teacher H Xue, X Wang, Y Zhang, L Xie, P Zhu, M Bi arXiv preprint arXiv:2203.16408, 2022	8	2022
Controllable crossspeaker emotion transfer for end-to-end speech synthesis T Li, X Wang, Q Xie, Z Wang, L Xie arXiv preprint arXiv:2109.06733, 2021	8	2021
Show and speak: Directly synthesize spoken description of images X Wang, S Feng, J Zhu, M Hasegawa-Johnson, O Scharenborg IEEE International Conference on Acoustics, Speech and Signal Processing, 2020	7	2020
Synthesizing spoken descriptions of images X Wang, J Van Der Hout, J Zhu, M Hasegawa-Johnson, O Scharenborg IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 3242-3254, 2021	6	2021
Domain segmentation and adjustment for generalized zero-shot learning X Wang, S Pang, J Zhu arXiv preprint arXiv:2002.00226, 2020	6	2020
Adaptive deep feature aggregation using Fourier transform and low-pass filtering for robust object retrieval Z Zhou, X Wang, C Li, M Zeng, Z Li Journal of Visual Communication and Image Representation 72, 102860, 2020	5	2020
Learning fine-grained semantics in spoken language using visual grounding X Wang, T Tian, J Zhu, O Scharenborg 2021 IEEE International Symposium on Circuits and Systems (ISCAS), 1-5, 2021	4	2021

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors