Æȷοì
ShiLiang Zhang
ShiLiang Zhang
SpeechLab£¬Alibaba
mail.ustc.edu.cnÀÇ À̸ÞÀÏ È®ÀεÊ
Á¦¸ñ
Àοë
Àοë
¿¬µµ
Watch, attend and parse: An end-to-end neural network based approach to handwritten mathematical expression recognition
J Zhang, J Du, S Zhang, D Liu, Y Hu, J Hu, S Wei, L Dai
Pattern Recognition 71, 196-206, 2017
2472017
Qwen-audio: Advancing universal audio understanding via unified large-scale audio-language models
Y Chu, J Xu, X Zhou, Q Yang, S Zhang, Z Yan, C Zhou, J Zhou
arXiv preprint arXiv:2311.07919, 2023
1632023
Deep-FSMN for large vocabulary continuous speech recognition
S Zhang, M Lei, Z Yan, L Dai
2018 IEEE International Conference on Acoustics, Speech and Signal ¡¦, 2018
1302018
M2MeT: The ICASSP 2022 multi-channel multi-party meeting transcription challenge
F Yu, S Zhang, Y Fu, L Xie, S Zheng, Z Du, W Huang, P Guo, Z Yan, B Ma, ...
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and ¡¦, 2022
922022
Feedforward sequential memory networks: A new structure to learn long-term dependency
S Zhang, C Liu, H Jiang, S Wei, L Dai, Y Hu
arXiv preprint arXiv:1512.08301, 2015
902015
The Fixed-Size Ordinally-Forgetting Encoding Method for Neural Network Language Models
S Zhang, H Jiang, M Xu, J Hou, L Dai
ACL2015, 495, 2015
882015
Paraformer: Fast and accurate parallel transformer for non-autoregressive end-to-end speech recognition
Z Gao, S Zhang, I McLoughlin, Z Yan
arXiv preprint arXiv:2206.08317, 2022
832022
Summary On The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge
HB Fan Yu, Shiliang Zhang, Pengcheng Guo, Yihui Fu, Zhihao Du, Siqi Zheng ...
ICASSP, 2022
59*2022
Investigation of Transformer Based Spelling Correction Model for CTC-Based End-to-End Mandarin Speech Recognition.
S Zhang, M Lei, Z Yan
Interspeech, 2180-2184, 2019
59*2019
LauraGPT: Listen, attend, understand, and regenerate audio with GPT
J Wang, Z Du, Q Chen, Y Chu, Z Gao, Z Li, K Hu, X Zhou, J Xu, Z Ma, ...
54*2023
MDERank: A masked document embedding rank approach for unsupervised keyphrase extraction
L Zhang, Q Chen, W Wang, C Deng, SL Zhang, B Li, W Wang, X Cao
arXiv preprint arXiv:2110.06651, 2021
522021
emotion2vec: Self-supervised pre-training for speech emotion representation
Z Ma, Z Zheng, J Ye, J Li, Z Gao, S Zhang, X Chen
arXiv preprint arXiv:2312.15185, 2023
472023
Simplified self-attention for transformer-based end-to-end speech recognition
H Luo, S Zhang, M Lei, L Xie
2021 IEEE Spoken Language Technology Workshop (SLT), 75-81, 2021
472021
Robust audio-visual speech recognition using bimodal DFSMN with multi-condition training and dropout regularization
S Zhang, M Lei, B Ma, L Xie
ICASSP 2019-2019 IEEE international conference on acoustics, speech and ¡¦, 2019
422019
Prosospeech: Enhancing prosody with quantized vector pre-training in text-to-speech
Y Ren, M Lei, Z Huang, S Zhang, Q Chen, Z Yan, Z Zhao
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and ¡¦, 2022
402022
Gaussian Prediction Based Attention for Online End-to-End Speech Recognition.
J Hou, S Zhang, LR Dai
Interspeech, 3692-3696, 2017
402017
Improving deep neural networks for LVCSR using dropout and shrinking structure
S Zhang, Y Bao, P Zhou, H Jiang, L Dai
2014 IEEE International Conference on Acoustics, Speech and Signal ¡¦, 2014
402014
Funcodec: A fundamental, reproducible and integrable open-source toolkit for neural speech codec
Z Du, S Zhang, K Hu, S Zheng
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and ¡¦, 2024
392024
San-m: Memory equipped self-attention for end-to-end speech recognition
Z Gao, S Zhang, M Lei, I McLoughlin
arXiv preprint arXiv:2006.01713, 2020
372020
Investigation of modeling units for mandarin speech recognition using dfsmn-ctc-smbr
S Zhang, M Lei, Y Liu, W Li
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and ¡¦, 2019
362019
ÇöÀç ½Ã½ºÅÛÀÌ ÀÛµ¿µÇÁö ¾Ê½À´Ï´Ù. ³ªÁß¿¡ ´Ù½Ã ½ÃµµÇØ ÁÖ¼¼¿ä.
ÇмúÀÚ·á 1–20