Jie Lei 雷杰

Cited by

	All	Since 2019
Citations	3046	3042
h-index	19	19
i10-index	20	20

1400

700

350

1050

20192020202120222023202445 116 353 793 1326 407

Public access

View all

9 articles

1 article

available

not available

Based on funding mandates

Co-authors

Mohit BansalParker Distinguished Professor, Computer Science, UNC Chapel HillVerified email at cs.unc.edu
Tamara L BergAssociate Professor, Computer Science, UNC Chapel HillVerified email at cs.unc.edu
Licheng Yu 虞立成Research Scientist and Manager, Facebook AIVerified email at fb.com
Linjie (Lindsey) LiSenior Researcher, MicrosoftVerified email at microsoft.com
Zhe GanResearch Scientist, AppleVerified email at apple.com
Luowei ZhouResearch Scientist, Google DeepmindVerified email at google.com
Hao TanAdobe ResearchVerified email at adobe.com
Jaemin ChoPhD Student at UNC Chapel HillVerified email at cs.unc.edu
Gedas BertasiusAssistant Professor, University of North Carolina at Chapel HillVerified email at cs.unc.edu
Yelong ShenMicrosoftVerified email at microsoft.com
Liwei WangAssistant Professor at The Chinese University of Hong KongVerified email at cse.cuhk.edu.hk
Dong Yu (俞栋)Distinguished Scientist @ Tencent AI Lab, ACM/IEEE/ISCA FellowVerified email at global.tencent.com
Zineng TangUC BerkeleyVerified email at cs.unc.edu
Thomas WolfCo-founder at HuggingFaceVerified email at polytechnique.edu
Yang WangComputer Science, Concordia UniversityVerified email at concordia.ca

Jie Lei 雷杰

Research Scientist, Meta AI

Verified email at fb.com - Homepage

Computer Vision Natural Language Processing Vision and Language


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
TVQA: Localized, compositional video question answering J Lei, L Yu, M Bansal, TL Berg EMNLP 2018, 2018	594	2018
Less is More: ClipBERT for Video-and-Language Learning via Sparse Sampling J Lei, L Li, L Zhou, Z Gan, TL Berg, M Bansal, J Liu CVPR 2021, Best Student Paper Honorable Mention, 2021	584	2021
Unifying vision-and-language tasks via text generation J Cho, J Lei, H Tan, M Bansal ICML 2021, 2021	440	2021
Tvr: A large-scale dataset for video-subtitle moment retrieval J Lei, L Yu, TL Berg, M Bansal ECCV 2020, 2020	228	2020
TVQA+: Spatio-temporal grounding for video question answering J Lei, L Yu, TL Berg, M Bansal ACL 2020, 2020	219	2020
MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning J Lei, L Wang, Y Shen, D Yu, TL Berg, M Bansal ACL 2020, 2020	176	2020
QVHighlights: Detecting Moments and Highlights in Videos via Natural Language Queries J Lei, TL Berg, M Bansal NeurIPS 2021, 2021	123*	2021
VALUE: A Multi-Task Benchmark for Video-and-Language Understanding Evaluation L Li, J Lei, Z Gan, L Yu, YC Chen, R Pillai, Y Cheng, L Zhou, XE Wang, ... NeurIPS 2021 Datasets and Benchmarks Track, 2021	96	2021
Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners Z Wang, M Li, R Xu, L Zhou, J Lei, X Lin, S Wang, Z Yang, C Zhu, ... NeurIPS 2022, 2022	87	2022
Revealing single frame bias for video-and-language learning J Lei, TL Berg, M Bansal ACL 2023, 2022	83	2022
VIMPAC: Video Pre-Training via Masked Token Prediction and Contrastive Learning H Tan, J Lei, T Wolf, M Bansal CVPR 2022 workshop on Transformers for Vision, 2021	60	2021
Adversarial VQA: A New Benchmark for Evaluating the Robustness of VQA Models L Li, J Lei, Z Gan, J Liu ICCV 2021, 2021	58	2021
DeCEMBERT: Learning from Noisy Instructional Videos via Dense Captions and Entropy Minimization Z Tang, J Lei, M Bansal NAACL 2021, 2021	58	2021
What is More Likely to Happen Next? Video-and-Language Future Event Prediction J Lei, L Yu, TL Berg, M Bansal EMNLP 2020, 2020	57	2020
VindLU: A Recipe for Effective Video-and-Language Pretraining F Cheng, X Wang, J Lei, D Crandall, M Bansal, G Bertasius CVPR 2023, 2022	46	2022
Vision Transformers are Parameter-Efficient Audio-Visual Learners YB Lin, YL Sung, J Lei, M Bansal, G Bertasius CVPR 2023, 2022	32	2022
RESIN-11: Schema-guided event prediction for 11 newsworthy scenarios X Du, Z Zhang, S Li, P Yu, H Wang, T Lai, X Lin, Z Wang, I Liu, B Zhou, ... Proceedings of the 2022 Conference of the North American Chapter of the …, 2022	28	2022
ECLIPSE: Efficient Long-range Video Retrieval using Sight and Sound YB Lin, J Lei, M Bansal, G Bertasius ECCV 2022 Oral, 2022	28	2022
Weakly supervised image classification with coarse and fine labels J Lei, Z Guo, Y Wang 2017 14th conference on computer and robot vision (crv), 240-247, 2017	22	2017
mtvr: Multilingual moment retrieval in videos J Lei, TL Berg, M Bansal ACL 2021, 2021	11	2021

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors