Hubert: Self-supervised speech representation learning by masked prediction of hidden units WN Hsu, B Bolte, YHH Tsai, K Lakhotia, R Salakhutdinov, A Mohamed IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 3451-3460, 2021 | 1365 | 2021 |
Multimodal transformer for unaligned multimodal language sequences YHH Tsai, S Bai, PP Liang, JZ Kolter, LP Morency, R Salakhutdinov Proceedings of the conference. Association for Computational Linguistics …, 2019 | 910 | 2019 |
Learning factorized multimodal representations YHH Tsai, PP Liang, A Zadeh, LP Morency, R Salakhutdinov arXiv preprint arXiv:1806.06176, 2018 | 339 | 2018 |
Learning cross-domain landmarks for heterogeneous domain adaptation YHH Tsai, YR Yeh, YCF Wang Proceedings of the IEEE conference on computer vision and pattern …, 2016 | 195 | 2016 |
Learning cross-domain landmarks for heterogeneous domain adaptation YHH Tsai, YR Yeh, YCF Wang Proceedings of the IEEE conference on computer vision and pattern …, 2016 | 195 | 2016 |
Learning robust visual-semantic embeddings YH Hubert Tsai, LK Huang, R Salakhutdinov Proceedings of the IEEE International conference on Computer Vision, 3571-3580, 2017 | 183 | 2017 |
Transformer dissection: a unified understanding of transformer's attention via the lens of kernel YHH Tsai, S Bai, M Yamada, LP Morency, R Salakhutdinov arXiv preprint arXiv:1908.11775, 2019 | 177 | 2019 |
Self-supervised learning from a multi-view perspective YHH Tsai, Y Wu, R Salakhutdinov, LP Morency arXiv preprint arXiv:2006.05576, 2020 | 149 | 2020 |
HuBERT: How much can a bad teacher benefit ASR pre-training? WN Hsu, YHH Tsai, B Bolte, R Salakhutdinov, A Mohamed ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 125 | 2021 |
Unsupervised domain adaptation with label and structural consistency CA Hou, YHH Tsai, YR Yeh, YCF Wang IEEE Transactions on Image Processing 25 (12), 5552-5562, 2016 | 109 | 2016 |
Video relationship reasoning using gated spatio-temporal energy graph YHH Tsai, S Divvala, LP Morency, R Salakhutdinov, A Farhadi Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019 | 106 | 2019 |
Capsules with inverted dot-product attention routing YHH Tsai, N Srivastava, H Goh, R Salakhutdinov arXiv preprint arXiv:2002.04764, 2020 | 80 | 2020 |
Unsupervised domain adaptation with imbalanced cross-domain data TMH Hsu, WY Chen, CA Hou, YHH Tsai, YR Yeh, YCF Wang Proceedings of the IEEE International Conference on Computer Vision, 4121-4129, 2015 | 76 | 2015 |
Learning representations from imperfect time series data via tensor rank regularization PP Liang, Z Liu, YHH Tsai, Q Zhao, R Salakhutdinov, LP Morency arXiv preprint arXiv:1907.01011, 2019 | 67 | 2019 |
Multimodal routing: Improving local and global interpretability of multimodal language analysis YHH Tsai, MQ Ma, M Yang, R Salakhutdinov, LP Morency Proceedings of the Conference on Empirical Methods in Natural Language …, 2020 | 65 | 2020 |
Transfer neural trees for heterogeneous domain adaptation WY Chen, TMH Hsu, YHH Tsai, YCF Wang, MS Chen Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The …, 2016 | 63 | 2016 |
Improving one-shot learning through fusing side information YHH Tsai, R Salakhutdinov arXiv preprint arXiv:1710.08347, 2017 | 51 | 2017 |
Strong and simple baselines for multimodal utterance embeddings PP Liang, YC Lim, YHH Tsai, R Salakhutdinov, LP Morency arXiv preprint arXiv:1906.02125, 2019 | 32 | 2019 |
Recognizing heterogeneous cross-domain data via generalized joint distribution adaptation YT Hsieh, SY Tao, YHH Tsai, YR Yeh, YCF Wang 2016 IEEE International Conference on Multimedia and Expo (ICME), 1-6, 2016 | 31 | 2016 |
A note on connecting barlow twins with negative-sample-free contrastive learning YHH Tsai, S Bai, LP Morency, R Salakhutdinov arXiv preprint arXiv:2104.13712, 2021 | 29 | 2021 |