Follow
Jaemin Cho
Title
Cited by
Cited by
Year
Unifying Vision-and-Language Tasks via Text Generation
J Cho, J Lei, H Tan, M Bansal
ICML, 2021
1862021
A Hierarchical Latent Structure for Variational Conversation Modeling
Y Park, J Cho, G Kim
NAACL, 2018
1112018
X-LXMERT: Paint, Caption and Answer Questions with Multi-Modal Transformers
J Cho, J Lu, D Schwenk, H Hajishirzi, A Kembhavi
EMNLP, 2020
652020
Mixture Content Selection for Diverse Sequence Generation
J Cho, M Seo, H Hajishirzi
EMNLP, 2019
482019
VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks
YL Sung, J Cho, M Bansal
CVPR, 2022
472022
DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generative Transformers
J Cho, A Zala, M Bansal
arXiv preprint arXiv:2202.04053, 2022
302022
LST: Ladder Side-Tuning for Parameter and Memory Efficient Transfer Learning
YL Sung, J Cho, M Bansal
NeurIPS, 2022
132022
Fine-grained Image Captioning with CLIP Reward
J Cho, S Yoon, A Kale, F Dernoncourt, T Bui, M Bansal
Findings of NAACL, 2022
122022
VidLanKD: Improving Language Understanding via Video-Distilled Knowledge Transfer
Z Tang, J Cho, H Tan, M Bansal
NeurIPS, 2021
122021
TVLT: Textless Vision-Language Transformer
Z Tang, J Cho, Y Nie, M Bansal
NeurIPS, 2022
32022
MuMuQA: Multimedia Multi-Hop News Question Answering via Cross-Media Knowledge Extraction and Grounding
RG Reddy, X Rui, M Li, X Lin, H Wen, J Cho, L Huang, M Bansal, A Sil, ...
AAAI, 2022
32022
Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention
Z Tang, J Cho, J Lei, M Bansal
WACV, 2023
12023
The system can't perform the operation now. Try again later.
Articles 1–12