Consensus graph representation learning for better grounded image captioning W Zhang, H Shi, S Tang, J Xiao, Q Yu, Y Zhuang Proceedings of the AAAI Conference on Artificial Intelligence 35 (4), 3394-3402, 2021 | 54 | 2021 |
Semi-supervised active learning for semi-supervised models: Exploit adversarial examples with graph-based virtual labels J Guo, H Shi, Y Kang, K Kuang, S Tang, Z Jiang, C Sun, F Wu, Y Zhuang Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021 | 45 | 2021 |
Magic: Multimodal relational graph adversarial inference for diverse and unpaired text-based image captioning W Zhang, H Shi, J Guo, S Zhang, Q Cai, J Li, S Luo, Y Zhuang Proceedings of the AAAI Conference on Artificial Intelligence 36 (3), 3335-3343, 2022 | 42 | 2022 |
Relational graph learning for grounded video description generation W Zhang, XE Wang, S Tang, H Shi, H Shi, J Xiao, Y Zhuang, WY Wang Proceedings of the 28th ACM International Conference on Multimedia, 3807-3828, 2020 | 35 | 2020 |
Adaptive hierarchical graph reasoning with semantic coherence for video-and-language inference J Li, S Tang, L Zhu, H Shi, X Huang, F Wu, Y Yang, Y Zhuang Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021 | 28 | 2021 |
Empower distantly supervised relation extraction with collaborative adversarial training T Chen, H Shi, L Liu, S Tang, J Shao, Z Chen, Y Zhuang Proceedings of the AAAI Conference on Artificial Intelligence 35 (14), 12675 …, 2021 | 21 | 2021 |
Alleviate dataset shift problem in fine-grained entity typing with virtual adversarial training H Shi, S Tang, X Gu, B Chen, Z Chen, J Shao, X Ren Proceedings of the Twenty-Ninth International Conference on International …, 2021 | 12 | 2021 |
Dilated context integrated network with cross-modal consensus for temporal emotion localization in videos J Li, J Xie, L Zhu, L Qian, S Tang, W Zhang, H Shi, S Zhang, L Wei, Q Tian, ... Proceedings of the 30th ACM International Conference on Multimedia, 5083-5092, 2022 | 10 | 2022 |
Boss: Bottom-up cross-modal semantic composition with hybrid counterfactual training for robust content-based image retrieval W Zhang, J Guo, M Li, H Shi, S Zhang, J Li, S Tang, Y Zhuang arXiv preprint arXiv:2207.04211, 2022 | 9 | 2022 |
Deciphering digital detectives: Understanding llm behaviors and capabilities in multi-agent mystery games D Wu, H Shi, Z Sun, B Liu arXiv preprint arXiv:2312.00746, 2023 | 6 | 2023 |
OPEx: A Component-Wise Analysis of LLM-Centric Agents in Embodied Instruction Following H Shi, Z Sun, X Yuan, MA Côté, B Liu arXiv preprint arXiv:2403.03017, 2024 | 4 | 2024 |
TradExpert: Revolutionizing Trading with Mixture of Expert LLMs Q Ding, H Shi, B Liu arXiv preprint arXiv:2411.00782, 2024 | 1 | 2024 |
Enhancing agent learning through world dynamics modeling Z Sun, H Shi, MA Côté, G Berseth, X Yuan, B Liu arXiv preprint arXiv:2407.17695, 2024 | 1 | 2024 |
OPEx: A Large Language Model-Powered Framework for Embodied Instruction Following H Shi, Z Sun, X Yuan, MA Côté, B Liu Proceedings of the 23rd International Conference on Autonomous Agents and …, 2024 | 1 | 2024 |
RADAR: Robust Two-stage Modality-incomplete Industrial Anomaly Detection B Miao, W Zhang, J Li, S Tang, Z Li, H Shi, J Xiao, Y Zhuang arXiv preprint arXiv:2410.01737, 2024 | | 2024 |
Reasoning Makes Good Annotators: An Automatic Task-specific Rules Distilling Framework for Low-resource Relation Extraction Y Lu, J Li, X Wang, H Shi, T Chen, S Tang Findings of the Association for Computational Linguistics: EMNLP 2023, 7447-7457, 2023 | | 2023 |
The ZJU-EDL System for Entity Discovery and Linking at TAC KBP 2019. Y Hu, H Shi, T Chen, S Tang, Q Liu, Z Chen, X Ren, F Wu, Y Zhuang TAC, 2019 | | 2019 |