Follow
Ruchao Fan
Title
Cited by
Cited by
Year
An online attention-based model for speech recognition
R Fan, P Zhou, W Chen, J Jia, G Liu
Proc. Interspeech 2019, 4390--4394, 2019
572019
CASS-NAT: CTC alignment-based single step non-autoregressive transformer for speech recognition
R Fan, W Chu, P Chang, J Xiao
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
392021
Improving generalization of transformer for speech recognition with parallel schedule sampling and relative positional embedding
P Zhou, R Fan, W Chen, J Jia
arXiv preprint arXiv:1911.00203, 2019
302019
DRAFT: A Novel Framework to Reduce Domain Shifting in Self-supervised Learning and Its Application to Children's ASR
R Fan, A Alwan
Proc. Interspeech 2022, 4900--4904, 2022
262022
Fundamental frequency feature normalization and data augmentation for child speech recognition
G Yeung, R Fan, A Alwan
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
222021
Towards better domain adaptation for self-supervised models: A case study of child asr
R Fan, Y Zhu, J Wang, A Alwan
IEEE Journal of Selected Topics in Signal Processing 16 (6), 1242-1252, 2022
192022
An Improved Single Step Non-autoregressive Transformer for Automatic Speech Recognition
R Fan, W Chu, P Chang, J Xiao, A Alwan
Proc. Interspeech 2021, 3715--3719, 2021
182021
Bi-apc: Bidirectional autoregressive predictive coding for unsupervised pre-training and its application to children’s asr
R Fan, A Afshan, A Alwan
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
142021
Exploring the use of an unsupervised autoregressive model as a shared encoder for text-dependent speaker verification
V Ravi, R Fan, A Afshan, H Lu, A Alwan
Proc. Interspeech 2020, 766--770, 2020
142020
LPC augment: an LPC-based ASR data augmentation algorithm for low and zero-resource children’s dialects
A Johnson, R Fan, R Morris, A Alwan
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
92022
Fundamental frequency feature warping for frequency normalization and data augmentation in child automatic speech recognition
G Yeung, R Fan, A Alwan
Speech Communication 135, 1-10, 2021
92021
Low Resource German ASR with Untranscribed Data Spoken by Non-native Children--INTERSPEECH 2021 Shared Task SPAPL System
J Wang, Y Zhu, R Fan, W Chu, A Alwan
Proc. Interspeech 2021, 1279--1283, 2021
92021
Cnn-based audio front end processing on speech recognition
R Fan, G Liu
2018 International Conference on Audio, Language and Image Processing …, 2018
82018
A ctc alignment-based non-autoregressive transformer for end-to-end automatic speech recognition
R Fan, W Chu, P Chang, A Alwan
IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 1436-1448, 2023
72023
CTCBERT: Advancing hidden-unit BERT with CTC objectives
R Fan, Y Wang, Y Gaur, J Li
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
52023
Towards better meta-initialization with task augmentation for kindergarten-aged speech recognition
Y Zhu, R Fan, A Alwan
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
42022
Acoustic-aware non-autoregressive spell correction with mask sample decoding
R Fan, G Ye, Y Gaur, J Li
arXiv preprint arXiv:2210.08665, 2022
22022
Research on end-to-end speech recognition [D]
R Fan
Beijing University of Posts and Telecommunications, 2-5, 2019
2*2019
UniEnc-CASSNAT: An Encoder-only Non-autoregressive ASR for Speech SSL Models
R Fan, NB Shankar, A Alwan
IEEE Signal Processing Letters, 2024
12024
Improving the Accuracy and Inference Efficiency for Low-resource Automatic Speech Recognition
R Fan
UCLA, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–20