Follow
Xinsheng Wang
Title
Cited by
Cited by
Year
Opencpop: A high-quality open source chinese popular song corpus for singing voice synthesis
Y Wang, X Wang, P Zhu, J Wu, H Li, H Xue, Y Zhang, L Xie, M Bi
arXiv preprint arXiv:2201.07429, 2022
592022
Msemotts: Multi-scale emotion transfer, prediction, and control for emotional speech synthesis
Y Lei, S Yang, X Wang, L Xie
IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 853-864, 2022
522022
Experimental study on the relation between internal flow and flashing spray characteristics of R134a using straight tube nozzles
XS Wang, B Chen, R Wang, H Xin, ZF Zhou
International Journal of Heat and Mass Transfer 115, 524-536, 2017
392017
Cross-speaker emotion disentangling and transfer for end-to-end speech synthesis
T Li, X Wang, Q Xie, Z Wang, L Xie
IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 1448-1460, 2022
232022
Numerical simulation of cryogen spray cooling by a three-dimensional hybrid vortex method
R Wang, B Chen, XS Wang
Applied Thermal Engineering 119, 319-330, 2017
182017
Atomization and surface heat transfer characteristics of cryogen spray cooling with expansion-chambered nozzles
XS Wang, B Chen, ZF Zhou
International Journal of Heat and Mass Transfer 121, 15-27, 2018
172018
Generating images from spoken descriptions
X Wang, T Qiao, J Zhu, A Hanjalic, O Scharenborg
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 850-865, 2021
162021
S2IGAN: Speech-to-Image Generation via Adversarial Learning
X Wang, T Qiao, J Zhu, A Hanjalic, O Scharenborg
Proc. Interspeech 2020, 2292--2296, 2020
152020
Visual space optimization for zero-shot learning
X Wang, S Pang, J Zhu, Z Li, Z Tian, Y Li
arXiv preprint arXiv:1907.00330, 2019
152019
Anyonenet: Synchronized speech and talking head generation for arbitrary persons
X Wang, Q Xie, J Zhu, L Xie, O Scharenborg
IEEE Transactions on Multimedia, 2022
122022
ALIGN OR ATTEND? TOWARD MORE EFFICIENT AND ACCURATE SPOKEN WORD DISCOVERY USING SPEECH-TO-IMAGE RETRIEVAL
L Wang, X Wang, M Hasegawa-Johnson, O Scharenborg, N Dehak
IEEE International Conference on Acoustics, Speech and Signal Processing …, 2020
122020
Multi-speaker multi-style text-to-speech synthesis with single-speaker single-style training data scenarios
Q Xie, T Li, X Wang, Z Wang, L Xie, G Yu, G Wan
2022 13th International Symposium on Chinese Spoken Language Processing …, 2022
112022
Cross-speaker emotion transfer based on prosody compensation for end-to-end speech synthesis
T Li, X Wang, Q Xie, Z Wang, M Jiang, L Xie
arXiv preprint arXiv:2207.01198, 2022
102022
Learn2sing 2.0: Diffusion and mutual information-based target speaker SVS by learning from singing teacher
H Xue, X Wang, Y Zhang, L Xie, P Zhu, M Bi
arXiv preprint arXiv:2203.16408, 2022
82022
Controllable crossspeaker emotion transfer for end-to-end speech synthesis
T Li, X Wang, Q Xie, Z Wang, L Xie
arXiv preprint arXiv:2109.06733, 2021
82021
Show and speak: Directly synthesize spoken description of images
X Wang, S Feng, J Zhu, M Hasegawa-Johnson, O Scharenborg
IEEE International Conference on Acoustics, Speech and Signal Processing, 2020
72020
Synthesizing spoken descriptions of images
X Wang, J Van Der Hout, J Zhu, M Hasegawa-Johnson, O Scharenborg
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 3242-3254, 2021
62021
Domain segmentation and adjustment for generalized zero-shot learning
X Wang, S Pang, J Zhu
arXiv preprint arXiv:2002.00226, 2020
62020
Adaptive deep feature aggregation using Fourier transform and low-pass filtering for robust object retrieval
Z Zhou, X Wang, C Li, M Zeng, Z Li
Journal of Visual Communication and Image Representation 72, 102860, 2020
52020
Learning fine-grained semantics in spoken language using visual grounding
X Wang, T Tian, J Zhu, O Scharenborg
2021 IEEE International Symposium on Circuits and Systems (ISCAS), 1-5, 2021
42021
The system can't perform the operation now. Try again later.
Articles 1–20