Destruction and construction learning for fine-grained image recognition Y Chen, Y Bai, W Zhang, T Mei Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019 | 538 | 2019 |
Visualizing and comparing AlexNet and VGG using deconvolutional layers W Yu, K Yang, Y Bai, T Xiao, H Yao, Y Rui Proceedings of the 33 rd International Conference on Machine Learning, 2016 | 260* | 2016 |
Rc-net: A general framework for incorporating knowledge into word representations C Xu, Y Bai, J Bian, B Gao, G Wang, X Liu, TY Liu Proceedings of the 23rd ACM international conference on conference on …, 2014 | 252 | 2014 |
Look-into-object: Self-supervised structure modeling for object recognition M Zhou, Y Bai, W Zhang, T Zhao, T Mei Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020 | 94 | 2020 |
Vrr-vg: Refocusing visually-relevant relationships Y Liang, Y Bai, W Zhang, X Qian, L Zhu, T Mei Proceedings of the IEEE/CVF international conference on computer vision …, 2019 | 88 | 2019 |
Deep Attention Neural Tensor Network for Visual Question Answering Y Bai, J Fu, T Zhao, T Mei Proceedings of the European Conference on Computer Vision (ECCV), 20-35, 2018 | 77 | 2018 |
Bag-of-words based deep neural network for image retrieval Y Bai, W Yu, T Xiao, C Xu, K Yang, WY Ma, T Zhao Proceedings of the 22nd ACM international conference on Multimedia, 229-232, 2014 | 55 | 2014 |
Responsive listening head generation: a benchmark dataset and baseline M Zhou, Y Bai, W Zhang, T Yao, T Zhao, T Mei European Conference on Computer Vision, 124-142, 2022 | 45 | 2022 |
Products-10k: A large-scale product recognition dataset Y Bai, Y Chen, W Yu, L Wang, W Zhang arXiv preprint arXiv:2008.10545, 2020 | 45 | 2020 |
Automatic image dataset construction from click-through logs using deep neural network Y Bai, K Yang, W Yu, C Xu, WY Ma, T Zhao Proceedings of the 23rd ACM international conference on Multimedia, 441-450, 2015 | 33 | 2015 |
Visualizing and understanding patch interactions in vision transformer J Ma, Y Bai, B Zhong, W Zhang, T Yao, T Mei IEEE Transactions on Neural Networks and Learning Systems, 2023 | 32 | 2023 |
Directional self-supervised learning for heavy image augmentations Y Bai, Y Yang, W Zhang, T Mei Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 28* | 2022 |
Freeform body motion generation from speech J Xu, W Zhang, Y Bai, Q Sun, T Mei arXiv preprint arXiv:2203.02291, 2022 | 19 | 2022 |
DNN Flow: DNN Feature Pyramid based Image Matching. W Yu, K Yang, Y Bai, H Yao, Y Rui BMVC, 2014 | 17 | 2014 |
Learning high-level image representation for image retrieval via multi-task dnn using clickthrough data Y Bai, K Yang, W Yu, WY Ma, T Zhao arXiv preprint arXiv:1312.4740, 2013 | 17 | 2013 |
Exploiting relationship for complex-scene image generation T Hua, H Zheng, Y Bai, W Zhang, XP Zhang, T Mei Proceedings of the AAAI Conference on Artificial Intelligence 35 (2), 1584-1592, 2021 | 16 | 2021 |
STAR: Scale-wise Text-to-image generation via Auto-Regressive representations X Ma, M Zhou, T Liang, Y Bai, T Zhao, H Chen, Y Jin arXiv preprint arXiv:2406.10797, 2024 | 15 | 2024 |
Dynamic Prompt Optimizing for Text-to-Image Generation W Mo, T Zhang, Y Bai, B Su, JR Wen, Q Yang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 9 | 2024 |
Interactive conversational head generation M Zhou, Y Bai, W Zhang, T Yao, T Zhao arXiv preprint arXiv:2307.02090, 2023 | 9 | 2023 |
Learning cross space mapping via DNN using large scale click-through logs W Yu, K Yang, Y Bai, H Yao, Y Rui IEEE Transactions on Multimedia 17 (11), 2000-2007, 2015 | 8 | 2015 |