Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks JG Xiujun Li, Xi Yin, Chunyuan Li, Pengchuan Zhang, Xiaowei Hu, Lei Zhang ... European Conference on Computer Vision (ECCV), 2020 | 644* | 2020 |
Large Scale Incremental Learning YF Yue Wu, Yinpeng Chen, Lijuan Wang, Yuancheng Ye, Zicheng Liu, Yandong Guo The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019 | 471* | 2019 |
Refining of segmental boundaries in speech waveforms using contextual-dependent models Y Zhao, M Chu, JL Zhou, L Wang US Patent 7,496,512, 2009 | 321 | 2009 |
VinVL: Making Visual Representations Matter in Vision-Language Models P Zhang, X Li, X Hu, J Yang, L Zhang, L Wang, Y Choi, J Gao CVPR2021, 2021 | 274* | 2021 |
Handwriting-based user interface for correction of speech recognition errors L Wang, FKP Soong US Patent App. 12/042,344, 2009 | 259 | 2009 |
Unnatural prosody detection in speech synthesis Y Zhao, FKP Soong, M Chu, L Wang US Patent 8,583,438, 2013 | 246 | 2013 |
Rethinking Classification and Localization for Object Detection YF Yue Wu, Yinpeng Chen, Lu Yuan, Zicheng Liu, Lijuan Wang, Hongzhi Li Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern ¡¦, 2020 | 210 | 2020 |
Real-time Animation for an Expressive Avatar N Xu, L Wang, FKP Soong, X Liang, Q Luo, YQ Xu, X Zou US Patent App. 12/950,801, 2012 | 163 | 2012 |
Speech and text driven HMM-based body animation synthesis L Wang, L Ma, FKP Soong US Patent 8,224,652, 2012 | 150 | 2012 |
End-to-End Human Pose and Mesh Reconstruction with Transformers K Lin, L Wang, Z Liu CVPR2021, 2020 | 143 | 2020 |
PHOTO-REAL TALKING HEAD WITH DEEP BIDIRECTIONAL LSTM B Fan, L Wang, FK Soong, L Xie ICASSP, 2015 | 116 | 2015 |
Incremental classifier learning with generative adversarial networks Y Wu, Y Chen, L Wang, Y Ye, Z Liu, Y Guo, Z Zhang, Y Fu arXiv preprint arXiv:1802.00853, 2018 | 93 | 2018 |
End-to-end semi-supervised object detection with soft teacher M Xu, Z Zhang, H Hu, J Wang, L Wang, F Wei, X Bai, Z Liu Proceedings of the IEEE/CVF International Conference on Computer Vision ¡¦, 2021 | 73 | 2021 |
SEED: Self-supervised Distillation For Visual Representation Z Fang, J Wang, L Wang, L Zhang, Y Yang, Z Liu ICLR 2021, 2021 | 60 | 2021 |
Florence: A new foundation model for computer vision L Yuan, D Chen, YL Chen, N Codella, X Dai, J Gao, H Hu, X Huang, B Li, ... arXiv preprint arXiv:2111.11432, 2021 | 58 | 2021 |
A deep bidirectional LSTM approach for video-realistic talking head B Fan, L Xie, S Yang, L Wang, FK Soong Multimedia Tools and Applications 75 (9), 5287-5309, 2016 | 55 | 2016 |
Synthesizing photo-real talking head via trajectory-guided sample selection L Wang, X Qian, W Han, FK Soong Eleventh Annual Conference of the International Speech Communication Association, 2010 | 52 | 2010 |
Refining segmental boundaries for TTS database using fine contextual-dependent boundary models L Wang, Y Zhao, M Chu, J Zhou, Z Cao 2004 IEEE International Conference on Acoustics, Speech, and Signal ¡¦, 2004 | 49 | 2004 |
TAP: Text-Aware Pre-training for Text-VQA and Text-Caption Z Yang, Y Lu, J Wang, X Yin, D Florencio, L Wang, C Zhang, L Zhang, ... CVPR2021, 2020 | 48 | 2020 |
Cross-Domain Complementary Learning Using Pose for Multi-Person Part Segmentation MTS Kevin Lin, Lijuan Wang, Kun Luo, Yinpeng Chen, Zicheng Liu IEEE Transactions on Circuits and Systems for Video Technology, 2020 | 48 | 2020 |