Æȷοì
Sicong Leng
Sicong Leng
Nanyang Technological University & Alibaba DAMO Academy
e.ntu.edu.sgÀÇ À̸ÞÀÏ È®ÀÎµÊ - ȨÆäÀÌÁö
Á¦¸ñ
Àοë
Àοë
¿¬µµ
Mitigating object hallucinations in large vision-language models through visual contrastive decoding
S Leng, H Zhang, G Chen, X Li, S Lu, C Miao, L Bing
CVPR 2024, 2024
1772024
Videollama 2: Advancing spatial-temporal modeling and audio understanding in video-llms
Z Cheng, S Leng, H Zhang, Y Xin, X Li, G Chen, Y Zhu, W Zhang, Z Luo, ...
arXiv preprint arXiv:2406.07476, 2024
1752024
Interventional video grounding with dual contrastive learning
G Nan, R Qiao, Y Xiao, J Liu, S Leng, H Zhang, W Lu
CVPR 2021, 2021
1752021
Uncovering What, Why and How: A Comprehensive Benchmark for Causation Understanding of Video Anomaly
XT Hang Du , Sicheng Zhang , Binzhu Xie, Guoshun Nan, Jiayang Zhang, Junrui ...
CVPR 2024, 2024
16*2024
Agla: Mitigating object hallucinations in large vision-language models with assembly of global and local attention
W An, F Tian, S Leng, J Nie, H Lin, QY Wang, G Dai, P Chen, S Lu
CVPR 2025, 2024
132024
Tell2Design: A Dataset for Language-Guided Floor Plan Generation
S Leng, Y Zhou, MH Dupty, WS Lee, SC Joyce, W Lu
ACL 2023, Area Chair Award, 2023
92023
Speaker-oriented latent structures for dialogue-based relation extraction
G Nan, G Luo, S Leng, Y Xiao, W Lu
arXiv preprint arXiv:2109.05182, 2021
92021
The curse of multi-modalities: Evaluating hallucinations of large multimodal models across language, visual, and audio
S Leng, Y Xing, Z Cheng, Y Zhou, H Zhang, X Li, D Zhao, S Lu, C Miao, ...
arXiv preprint arXiv:2410.12787, 2024
62024
Constrained Layout Generation with Factor Graphs
MH Dupty, Y Dong, S Leng, G Fu, YL Goh, W Lu, WS Lee
CVPR 2024, 2024
42024
VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding
B Zhang, K Li, Z Cheng, Z Hu, Y Yuan, G Chen, S Leng, Y Jiang, H Zhang, ...
arXiv preprint arXiv:2501.13106, 2025
12025
MMR1: Advancing the Frontiers of Multimodal Reasoning
Sicong Leng, Jing Wang, Jiaxi Li, Hao Zhang, Zhiqiang Hu, Boqiang Zhang ...
GitHub, 2025
2025
Refining Positive and Toxic Samples for Dual Safety Self-Alignment of LLMs with Minimal Human Interventions
J Xu, G Nan, S Guan, S Leng, Y Liu, Z Wang, Y Ma, Z Zhou, Y Hou, X Tao
arXiv preprint arXiv:2502.08657, 2025
2025
Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss
Z Cheng, H Zhang, K Li, S Leng, Z Hu, F Wu, D Zhao, X Li, L Bing
CVPR 2025, 2024
2024
BenchX: A Unified Benchmark Framework for Medical Vision-Language Pretraining on Chest X-Rays
Y Zhou, T Faith, Y Xu, S Leng, X Xu, Y Liu, RSM Goh
NeurIPS 2024, 2024
2024
ÇöÀç ½Ã½ºÅÛÀÌ ÀÛµ¿µÇÁö ¾Ê½À´Ï´Ù. ³ªÁß¿¡ ´Ù½Ã ½ÃµµÇØ ÁÖ¼¼¿ä.
ÇмúÀÚ·á 1–14