Jaemin Cho

Cited by

	All	Since 2019
Citations	1495	1486
h-index	14	14
i10-index	16	16

740

370

185

555

20182019202020212022202320247 32 45 93 272 729 315

Public access

View all

12 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Mohit BansalParker Distinguished Professor, Computer Science, UNC Chapel HillVerified email at cs.unc.edu
Abhay ZalaUniversity of North Carolina at Chapel HillVerified email at cs.unc.edu
Yi-Lin SungUNC Chapel HillVerified email at cs.unc.edu
Hao TanAdobe ResearchVerified email at adobe.com
Jie Lei 雷杰Research Scientist, Meta AIVerified email at fb.com
Zineng TangUC BerkeleyVerified email at cs.unc.edu
Hannaneh HajishirziUniversity of Washington; Allen AIVerified email at cs.washington.edu
Gunhee KimProfessor, Seoul National UniversityVerified email at snu.ac.kr
Yookoon ParkColumbia UniversityVerified email at columbia.edu
Seunghyun YoonAdobe ResearchVerified email at adobe.com
Trung H. BuiSenior Research Scientist & Research Manager, Adobe ResearchVerified email at adobe.com
Han LinPhD Student, UNC NLP GroupVerified email at cs.unc.edu
Jiasen LuSenior Research Scientist, Allen Institute of Artificial IntelligenceVerified email at allenai.org
Aniruddha KembhaviSenior Director of Computer Vision, Allen Institute of Artificial IntelligenceVerified email at allenai.org
Minjoon SeoKAIST; Twelve LabsVerified email at kaist.ac.kr
Heng JiProfessor of Computer Science, University of Illinois Urbana-Champaign, Amazon ScholarVerified email at illinois.edu
Shoubin YuUNC, Chapel HillVerified email at cs.unc.edu
Prateek YadavPhD, University of North Carolina Chapel HillVerified email at cs.unc.edu
Yixin NieMeta, UNC Chapel HillVerified email at meta.com
Peter AndersonSenior AI Researcher, Balyasny Asset ManagementVerified email at bamfunds.com

Jaemin Cho

PhD Student at UNC Chapel Hill

Verified email at cs.unc.edu - Homepage

Multimodal Learning Natural Language Processing Machine Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Unifying Vision-and-Language Tasks via Text Generation J Cho, J Lei, H Tan, M Bansal ICML, 2021	440	2021
VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks YL Sung, J Cho, M Bansal CVPR, 2022	239	2022
DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models J Cho, A Zala, M Bansal ICCV, 2023	149*	2023
A Hierarchical Latent Structure for Variational Conversation Modeling Y Park, J Cho, G Kim NAACL, 2018	128	2018
LST: Ladder Side-Tuning for Parameter and Memory Efficient Transfer Learning YL Sung, J Cho, M Bansal NeurIPS, 2022	124	2022
X-LXMERT: Paint, Caption and Answer Questions with Multi-Modal Transformers J Cho, J Lu, D Schwenk, H Hajishirzi, A Kembhavi EMNLP, 2020	102	2020
Mixture Content Selection for Diverse Sequence Generation J Cho, M Seo, H Hajishirzi EMNLP, 2019	65	2019
Fine-grained Image Captioning with CLIP Reward J Cho, S Yoon, A Kale, F Dernoncourt, T Bui, M Bansal Findings of NAACL, 2022	56	2022
Self-Chained Image-Language Model for Video Localization and Question Answering S Yu, J Cho, P Yadav, M Bansal NeurIPS, 2023	48	2023
VidLanKD: Improving Language Understanding via Video-Distilled Knowledge Transfer Z Tang, J Cho, H Tan, M Bansal NeurIPS, 2021	26	2021
Visual Programming for Step-by-Step Text-to-Image Generation and Evaluation J Cho, A Zala, M Bansal NeurIPS, 2023	25*	2023
TVLT: Textless Vision-Language Transformer Z Tang, J Cho, Y Nie, M Bansal NeurIPS, 2022	20	2022
VideoDirectorGPT: Consistent Multi-Scene Video Generation via LLM-Guided Planning H Lin, A Zala, J Cho, M Bansal arXiv preprint arXiv:2309.15091, 2023	16	2023
MuMuQA: Multimedia Multi-Hop News Question Answering via Cross-Media Knowledge Extraction and Grounding RG Reddy, X Rui, M Li, X Lin, H Wen, J Cho, L Huang, M Bansal, A Sil, ... AAAI, 2022	16	2022
Davidsonian Scene Graph: Improving Reliability in Fine-grained Evaluation for Text-to-Image Generation J Cho, Y Hu, R Garg, P Anderson, R Krishna, J Baldridge, M Bansal, ... ICLR, 2024	14	2024
Hierarchical Video-Moment Retrieval and Step-Captioning A Zala, J Cho, S Kottur, X Chen, B Oğuz, Y Mehdad, M Bansal CVPR, 2023	10	2023
Paxion: Patching Action Knowledge in Video-Language Foundation Models Z Wang, A Blume, S Li, G Liu, J Cho, Z Tang, M Bansal, H Ji NeurIPS, 2023	8	2023
Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention Z Tang, J Cho, J Lei, M Bansal WACV, 2023	6	2023
Diagnostic benchmark and iterative inpainting for layout-guided image generation J Cho, L Li, Z Yang, Z Gan, L Wang, M Bansal arXiv preprint arXiv:2304.06671, 2023	2	2023
DiagrammerGPT: Generating Open-Domain, Open-Platform Diagrams via LLM Planning A Zala, H Lin, J Cho, M Bansal arXiv preprint arXiv:2310.12128, 2023	1	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors