Follow
Alexander H. Liu
Alexander H. Liu
Verified email at mit.edu - Homepage
Title
Cited by
Cited by
Year
A unified feature disentangler for multi-domain image translation and manipulation
AH Liu, YC Liu, YY Yeh, YCF Wang
Advances in neural information processing systems 31, 2018
3812018
Towards scene understanding: Unsupervised monocular depth estimation with semantic-aware representation
PY Chen, AH Liu, YC Liu, YCF Wang
Proceedings of the IEEE/CVF Conference on computer vision and pattern …, 2019
2442019
Non-autoregressive predictive coding for learning speech representations from local dependencies
AH Liu, YA Chung, J Glass
arXiv preprint arXiv:2011.00406, 2020
832020
Contrastive audio-visual masked autoencoder
Y Gong, A Rouditchenko, AH Liu, D Harwath, L Karlinsky, H Kuehne, ...
arXiv preprint arXiv:2210.07839, 2022
642022
Towards end-to-end unsupervised speech recognition
AH Liu, WN Hsu, M Auli, A Baevski
2022 IEEE Spoken Language Technology Workshop (SLT), 221-228, 2023
592023
Parp: Prune, adjust and re-prune for self-supervised speech recognition
CIJ Lai, Y Zhang, AH Liu, S Chang, YL Liao, YS Chuang, K Qian, ...
Advances in Neural Information Processing Systems 34, 21256-21272, 2021
532021
Spoken moments: Learning joint audio-visual representations from video descriptions
M Monfort, SY Jin, A Liu, D Harwath, R Feris, J Glass, A Oliva
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021
522021
Towards unsupervised speech recognition and synthesis with quantized speech representation learning
AH Liu, T Tu, H Lee, L Lee
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
522020
Adversarial training of end-to-end speech recognition using a criticizing language model
AH Liu, H Lee, L Lee
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
502019
Listen, think, and understand
Y Gong, H Luo, AH Liu, L Karlinsky, J Glass
arXiv preprint arXiv:2305.10790, 2023
372023
Cross-modal discrete representation learning
AH Liu, SY Jin, CIJ Lai, A Rouditchenko, A Oliva, J Glass
arXiv preprint arXiv:2106.05438, 2021
352021
Simple and effective unsupervised speech synthesis
AH Liu, CIJ Lai, WN Hsu, M Auli, A Baevski, J Glass
arXiv preprint arXiv:2204.02524, 2022
172022
Worse wer, but better bleu? leveraging word embedding as intermediate in multitask end-to-end speech translation
SP Chuang, TW Sung, AH Liu, H Lee
arXiv preprint arXiv:2005.10678, 2020
162020
Improving automatic speech recognition and speech translation via word embedding prediction
SP Chuang, AH Liu, TW Sung, H Lee
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 93-105, 2020
142020
Sequence-to-sequence automatic speech recognition with word embedding regularization and fused decoding
AH Liu, TW Sung, SP Chuang, H Lee, L Lee
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
112020
Uavm: Towards unifying audio and visual models
Y Gong, AH Liu, A Rouditchenko, J Glass
IEEE Signal Processing Letters 29, 2437-2441, 2022
102022
End-to-end whispered speech recognition with frequency-weighted approaches and pseudo whisper pre-training
HJ Chang, AH Liu, H Lee, L Lee
2021 IEEE Spoken Language Technology Workshop (SLT), 186-193, 2021
102021
Semi-supervised learning for multi-speaker text-to-speech synthesis using discrete speech representation
T Tu, YJ Chen, AH Liu, H Lee
arXiv preprint arXiv:2005.08024, 2020
102020
Dinosr: Self-distillation and online clustering for self-supervised speech representation learning
AH Liu, HJ Chang, M Auli, WN Hsu, J Glass
Advances in Neural Information Processing Systems 36, 2024
72024
Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clustering
HJ Chang, AH Liu, J Glass
arXiv preprint arXiv:2305.11072, 2023
62023
The system can't perform the operation now. Try again later.
Articles 1–20