OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion H Wang, P Ren, Z Jie, X Dong, C Feng, Y Qian, L Ma, D Jiang, Y Wang, ... arXiv preprint arXiv:2407.07844, 2024 | 1 | 2024 |
WL-MSR: Watch and Listen for Multimodal Subtitle Recognition J Liu, H Wang, W Wang, X He, J Liu ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and ¡¦, 2023 | 1 | 2023 |