Follow
Jaemin Cho
Title
Cited by
Cited by
Year
Unifying Vision-and-Language Tasks via Text Generation
J Cho, J Lei, H Tan, M Bansal
ICML, 2021
3312021
VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks
YL Sung, J Cho, M Bansal
CVPR, 2022
1382022
A Hierarchical Latent Structure for Variational Conversation Modeling
Y Park, J Cho, G Kim
NAACL, 2018
1202018
X-LXMERT: Paint, Caption and Answer Questions with Multi-Modal Transformers
J Cho, J Lu, D Schwenk, H Hajishirzi, A Kembhavi
EMNLP, 2020
892020
DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models
J Cho, A Zala, M Bansal
ICCV, 2023
77*2023
LST: Ladder Side-Tuning for Parameter and Memory Efficient Transfer Learning
YL Sung, J Cho, M Bansal
NeurIPS, 2022
542022
Mixture Content Selection for Diverse Sequence Generation
J Cho, M Seo, H Hajishirzi
EMNLP, 2019
532019
Fine-grained Image Captioning with CLIP Reward
J Cho, S Yoon, A Kale, F Dernoncourt, T Bui, M Bansal
Findings of NAACL, 2022
352022
VidLanKD: Improving Language Understanding via Video-Distilled Knowledge Transfer
Z Tang, J Cho, H Tan, M Bansal
NeurIPS, 2021
192021
TVLT: Textless Vision-Language Transformer
Z Tang, J Cho, Y Nie, M Bansal
NeurIPS, 2022
122022
MuMuQA: Multimedia Multi-Hop News Question Answering via Cross-Media Knowledge Extraction and Grounding
RG Reddy, X Rui, M Li, X Lin, H Wen, J Cho, L Huang, M Bansal, A Sil, ...
AAAI, 2022
52022
Self-Chained Image-Language Model for Video Localization and Question Answering
S Yu, J Cho, P Yadav, M Bansal
NeurIPS, 2023
32023
Visual Programming for Text-to-Image Generation and Evaluation
J Cho, A Zala, M Bansal
NeurIPS, 2023
22023
Paxion: Patching Action Knowledge in Video-Language Foundation Models
Z Wang, A Blume, S Li, G Liu, J Cho, Z Tang, M Bansal, H Ji
NeurIPS, 2023
22023
Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention
Z Tang, J Cho, J Lei, M Bansal
WACV, 2023
22023
Diagnostic Benchmark and Iterative Inpainting for Layout-Guided Image Generation
J Cho, L Li, Z Yang, Z Gan, L Wang, M Bansal
arXiv preprint arXiv:2304.06671, 2023
12023
Hierarchical Video-Moment Retrieval and Step-Captioning
A Zala, J Cho, S Kottur, X Chen, B Oğuz, Y Mehdad, M Bansal
CVPR, 2023
12023
VideoDirectorGPT: Consistent Multi-scene Video Generation via LLM-Guided Planning
H Lin, A Zala, J Cho, M Bansal
arXiv preprint arXiv:2309.15091, 2023
2023
Image captioning
J Cho, S Yoon, AG Kale, TH Bui, F Dernoncourt
US Patent App. 17/455,533, 2023
2023
Supplementary Materials for DALL-EVAL: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models
J Cho, A Zala, M Bansal
The system can't perform the operation now. Try again later.
Articles 1–20