Follow
Xinhao Mei
Title
Cited by
Cited by
Year
Audioldm: Text-to-audio generation with latent diffusion models
H Liu, Z Chen, Y Yuan, X Mei, X Liu, D Mandic, W Wang, MD Plumbley
arXiv preprint arXiv:2301.12503, 2023
2092023
Wavcaps: A chatgpt-assisted weakly-labelled audio captioning dataset for audio-language multimodal research
X Mei, C Meng, H Liu, Q Kong, T Ko, C Zhao, MD Plumbley, Y Zou, ...
arXiv preprint arXiv:2303.17395, 2023
77*2023
Audio captioning transformer
X Mei, X Liu, Q Huang, MD Plumbley, W Wang
arXiv preprint arXiv:2107.09817, 2021
662021
On metric learning for audio-text cross-modal retrieval
X Mei, X Liu, J Sun, MD Plumbley, W Wang
arXiv preprint arXiv:2203.15537, 2022
462022
An encoder-decoder based audio captioning system with transfer and reinforcement learning
X Mei, Q Huang, X Liu, G Chen, J Wu, Y Wu, J Zhao, S Li, T Ko, HL Tang, ...
arXiv preprint arXiv:2108.02752, 2021
452021
AudioLDM 2: Learning holistic audio generation with self-supervised pretraining
H Liu, Q Tian, Y Yuan, X Liu, X Mei, Q Kong, Y Wang, W Wang, Y Wang, ...
arXiv preprint arXiv:2308.05734, 2023
36*2023
Automated audio captioning: an overview of recent progress and new challenges
X Mei, X Liu, MD Plumbley, W Wang
EURASIP journal on audio, speech, and music processing 2022 (1), 26, 2022
312022
Leveraging pre-trained bert for audio captioning
X Liu, X Mei, Q Huang, J Sun, J Zhao, H Liu, MD Plumbley, V Kilic, ...
2022 30th European Signal Processing Conference (EUSIPCO), 1145-1149, 2022
272022
Separate what you describe: Language-queried audio source separation
X Liu, H Liu, Q Kong, X Mei, J Zhao, Q Huang, MD Plumbley, W Wang
arXiv preprint arXiv:2203.15147, 2022
262022
CL4AC: A contrastive loss for audio captioning
X Liu, Q Huang, X Mei, T Ko, HL Tang, MD Plumbley, W Wang
arXiv preprint arXiv:2107.09990, 2021
242021
Diverse audio captioning via adversarial training
X Mei, X Liu, J Sun, MD Plumbley, W Wang
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
232022
Language-based audio retrieval with pre-trained models
X Mei, X Liu, H Liu, J Sun, MD Plumbley, W Wang
DCASE 2022 Challenge, Tech. Rep., 2022
192022
An encoder-decoder based audio captioning system with transfer and reinforcement learning for DCASE challenge 2021 task 6
X Mei, Q Huang, X Liu, G Chen, J Wu, Y Wu, J Zhao, S Li, T Ko, HL Tang, ...
DCASE2021 Challenge, Tech. Rep, Tech. Rep, 2021
152021
Visually-aware audio captioning with adaptive audio-visual attention
X Liu, Q Huang, X Mei, H Liu, Q Kong, J Sun, S Li, T Ko, Y Zhang, ...
arXiv preprint arXiv:2210.16428, 2022
122022
Surrey system for dcase 2022 task 5: Few-shot bioacoustic event detection with segment-level metric learning
H Liu, X Liu, X Mei, Q Kong, W Wang, MD Plumbley
arXiv preprint arXiv:2207.10547, 2022
112022
Simple pooling front-ends for efficient audio classification
X Liu, H Liu, Q Kong, X Mei, MD Plumbley, W Wang
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
102023
Automated audio captioning with keywords guidance
X Mei, X Liu, H Liu, J Sun, MD Plumbley, W Wang
Proc. Detection and Classification of Acoustic Scenes and Events, 2022
92022
Deep neural decision forest for acoustic scene classification
J Sun, X Liu, X Mei, J Zhao, MD Plumbley, V Kılı็, W Wang
2022 30th European Signal Processing Conference (EUSIPCO), 772-776, 2022
82022
Segment-level metric learning for few-shot bioacoustic event detection
H Liu, X Liu, X Mei, Q Kong, W Wang, MD Plumbley
arXiv preprint arXiv:2207.07773, 2022
82022
First-shot anomalous sound detection with GMM clustering and finetuned attribute classification using audio pretrained model
J Tian, H Zhang, Q Zhu, F Xiao, H Liu, X Mei, Y Liu, W Wang, J Guan
Technical report, DCASE 2023 Challenge, 2023
52023
The system can't perform the operation now. Try again later.
Articles 1–20