Æȷοì
Abhay Zala
Á¦¸ñ
Àοë
Àοë
¿¬µµ
Dall-eval: Probing the reasoning skills and social biases of text-to-image generative transformers
J Cho, A Zala, M Bansal
arXiv preprint arXiv:2202.04053 1 (2), 3, 2022
932022
Dall-eval: Probing the reasoning skills and social biases of text-to-image generation models
J Cho, A Zala, M Bansal
Proceedings of the IEEE/CVF International Conference on Computer Vision ¡¦, 2023
472023
Visual programming for text-to-image generation and evaluation
J Cho, A Zala, M Bansal
arXiv preprint arXiv:2305.15328, 2023
212023
Videodirectorgpt: Consistent multi-scene video generation via llm-guided planning
H Lin, A Zala, J Cho, M Bansal
arXiv preprint arXiv:2309.15091, 2023
152023
Arramon: A joint navigation-assembly instruction interpretation task in dynamic environments
H Kim, A Zala, G Burri, H Tan, M Bansal
arXiv preprint arXiv:2011.07660, 2020
152020
Fixmypose: Pose correctional captioning and retrieval
H Kim, A Zala, G Burri, M Bansal
Proceedings of the AAAI Conference on Artificial Intelligence 35 (14), 13161 ¡¦, 2021
132021
Hierarchical video-moment retrieval and step-captioning
A Zala, J Cho, S Kottur, X Chen, B Oguz, Y Mehdad, M Bansal
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern ¡¦, 2023
92023
CoSIm: commonsense reasoning for counterfactual scene imagination
H Kim, A Zala, M Bansal
arXiv preprint arXiv:2207.03961, 2022
32022
EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents
A Zala, J Cho, H Lin, J Yoon, M Bansal
arXiv preprint arXiv:2403.12014, 2024
2024
Visual Programming for Step-by-Step Text-to-Image Generation and Evaluation
J Cho, A Zala, M Bansal
Advances in Neural Information Processing Systems 36, 2024
2024
DiagrammerGPT: Generating Open-Domain, Open-Platform Diagrams via LLM Planning
A Zala, H Lin, J Cho, M Bansal
arXiv preprint arXiv:2310.12128, 2023
2023
Supplementary Materials for DALL-EVAL: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models
J Cho, A Zala, M Bansal
Supplementary Material for Hierarchical Video-Moment Retrieval and Step-Captioning
A Zala, J Cho, S Kottur, X Chen, B Oguz, Y Mehdad, M Bansal, UNCC Hill
Health 5, 13, 0
ÇöÀç ½Ã½ºÅÛÀÌ ÀÛµ¿µÇÁö ¾Ê½À´Ï´Ù. ³ªÁß¿¡ ´Ù½Ã ½ÃµµÇØ ÁÖ¼¼¿ä.
ÇмúÀÚ·á 1–13