Follow
Junyi Ao
Junyi Ao
Other names敖君逸
The Chinese University of Hong Kong, Shenzhen
Verified email at link.cuhk.edu.cn - Homepage
Title
Cited by
Cited by
Year
SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing
J Ao, R Wang, L Zhou, C Wang, S Ren, Y Wu, S Liu, T Ko, Q Li, Y Zhang, ...
Proceedings of the 60th Annual Meeting of the Association for Computational …, 2022
1812022
LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT
R Wang, Q Bai, J Ao, L Zhou, Z Xiong, Z Wei, Y Zhang, T Ko, H Li
INTERSPEECH 2022, 2022
552022
SpeechUT: Bridging Speech and Text with Hidden-Unit for Encoder-Decoder Based Speech-Text Pre-training
Z Zhang, L Zhou, J Ao, S Liu, L Dai, J Li, F Wei
Proceedings of the 2022 Conference on Empirical Methods in Natural Language …, 2022
442022
Multi-View Self-Attention Based Transformer for Speaker Recognition
R Wang, J Ao, L Zhou, S Liu, Z Wei, T Ko, Q Li, Y Zhang
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and …, 2022
422022
Pre-Training Transformer Decoder for End-to-End ASR Model with Unpaired Speech Data
J Ao, Z Zhang, L Zhou, S Liu, H Li, T Ko, L Dai, J Li, Y Qian, F Wei
INTERSPEECH 2022, 2022
172022
CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning
C Meng, J Ao, T Ko, M Wang, H Li
INTERSPEECH 2023, 2022
92022
token2vec: A Joint Self-Supervised Pre-training Framework Using Unpaired Speech and Text
X Yue, J Ao, X Gao, H Li
ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and …, 2023
62023
The YiTrans speech translation system for IWSLT 2022 offline shared task
Z Zhang, J Ao
Proceedings of the 19th International Conference on Spoken Language …, 2022
52022
Self-Supervised Acoustic Word Embedding Learning via Correspondence Transformer Encoder
J Lin, X Yue, J Ao, H Li
INTERSPEECH 2023, 2023
32023
The YiTrans End-to-End Speech Translation System for IWSLT 2022 Offline Shared Task
Z Zhang, J Ao, L Zhou, S Liu, F Wei, J Li
arXiv preprint arXiv:2206.05777, 2022
32022
USED: Universal Speaker Extraction and Diarization
J Ao, MS Yıldırım, R Tao, M Ge, S Wang, Y Qian, H Li
arXiv preprint arXiv:2309.10674, 2023
12023
SA-WavLM: Speaker-Aware Self-Supervised Pre-training for Mixture Speech
J Lin, M Ge, J Ao, L Deng, H Li
INTERSPEECH 2024, 2024
2024
SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words
J Ao, Y Wang, X Tian, D Chen, J Zhang, L Lu, Y Wang, H Li, Z Wu
arXiv preprint arXiv:2406.13340, 2024
2024
Text-guided HuBERT: Self-Supervised Speech Pre-training via Generative Adversarial Networks
D Ma, X Yue, J Ao, X Gao, H Li
arXiv preprint arXiv:2402.15725, 2024
2024
The NUS-HLT System for ICASSP2024 ICMC-ASR Grand Challenge
M Ge, Y Peng, Y Jiang, J Lin, J Ao, MS Yildirim, S Wang, H Li, M Feng
arXiv preprint arXiv:2312.16002, 2023
2023
Improving Attention-based End-to-end ASR by Incorporating an N-gram Neural Network
J Ao, T Ko
2021 12th International Symposium on Chinese Spoken Language Processing …, 2021
2021
The system can't perform the operation now. Try again later.
Articles 1–16