YoungJae Yu

Cited by

	All	Since 2019
Citations	3078	2944
h-index	24	24
i10-index	29	28

1100

550

275

825

2017201820192020202120222023202422 103 150 168 311 567 1043 701

Public access

View all

3 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Gunhee KimProfessor, Seoul National UniversityVerified email at snu.ac.kr
Yejin ChoiUniversity of Washington / Allen Institute for Artificial IntelligenceVerified email at cs.washington.edu
Jack Hesselsamaya.aiVerified email at samaya.ai
Ximing LuUniversity of WashingtonVerified email at cs.washington.edu
Yale SongFAIR, MetaVerified email at csail.mit.edu
Rowan ZellersOpenAIVerified email at cs.washington.edu
Sangho LeePostdoctoral Researcher at the Allen Institute for AIVerified email at allenai.org
jongseok kimML Research Scientist, Twelve LabsVerified email at vision.snu.ac.kr
Jae Sung (James) ParkUniversity of WashingtonVerified email at cs.washington.edu
Ali FarhadiProfessor, Computer Science and Engineering, University of WashingtonVerified email at cs.uw.edu
Heeseung YunSeoul National UniversityVerified email at vision.snu.ac.kr
Jongwook ChoiUniversity of MichiganVerified email at umich.edu
Youngjin KimUnknotVerified email at vision.snu.ac.kr
Yunseok JangUniversity of Michigan, Ann ArborVerified email at umich.edu
Jinyoung SungKorea Advanced Institute of Science and Technology (KAIST)Verified email at kaist.ac.kr
Sang-Hun LeeSeoul National UniversityVerified email at snu.ac.kr
Joonil NaSeoul National University Graduate StudentVerified email at vision.snu.ac.kr
Hongryul AhnAssistant Professor, Division of Data Science, The University of SuwonVerified email at suwon.ac.kr
Sun KimProfessor, Seoul National University; CEO and CTO, AIGENDRUG Co. Ltd.Verified email at snu.ac.kr
Kyuri JoChungbuk National UniversityVerified email at chungbuk.ac.kr

YoungJae Yu

Allen Institute for AI, Yonsei University

Verified email at yonsei.ac.kr - Homepage

Machine Learning Computer Vision NLP


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Tgif-qa: Toward spatio-temporal reasoning in visual question answering Y Jang, Y Song, Y Yu, Y Kim, G Kim Proceedings of the IEEE conference on computer vision and pattern …, 2017	541	2017
A joint sequence fusion model for video question answering and retrieval Y Yu, J Kim, G Kim Proceedings of the European conference on computer vision (ECCV), 471-487, 2018	373	2018
Merlot: Multimodal neural script knowledge models R Zellers, X Lu, J Hessel, Y Yu, JS Park, J Cao, A Farhadi, Y Choi Advances in neural information processing systems 34, 23634-23651, 2021	351	2021
End-to-end concept word detection for video captioning, retrieval, and question answering Y Yu, H Ko, J Choi, G Kim Proceedings of the IEEE conference on computer vision and pattern …, 2017	296*	2017
Merlot reserve: Neural script knowledge through vision and language and sound R Zellers, J Lu, X Lu, Y Yu, Y Zhao, M Salehi, A Kusupati, J Hessel, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022	215	2022
Neurologic a* esque decoding: Constrained text generation with lookahead heuristics X Lu, S Welleck, P West, L Jiang, J Kasai, D Khashabi, RL Bras, L Qin, ... arXiv preprint arXiv:2112.08726, 2021	133	2021
Multimodal c4: An open, billion-scale corpus of images interleaved with text W Zhu, J Hessel, A Awadalla, SY Gadre, J Dodge, A Fang, Y Yu, ... Advances in Neural Information Processing Systems 36, 2024	108	2024
Parameter efficient multimodal transformers for video representation learning S Lee, Y Yu, G Kim, T Breuel, J Kautz, Y Song arXiv preprint arXiv:2012.04124, 2020	89	2020
Supervising neural attention models for video captioning by human gaze data Y Yu, J Choi, Y Kim, K Yoo, SH Lee, G Kim Proceedings of the IEEE conference on computer vision and pattern …, 2017	86	2017
Soda: Million-scale dialogue distillation with social commonsense contextualization H Kim, J Hessel, L Jiang, P West, X Lu, Y Yu, P Zhou, RL Bras, M Alikhani, ... arXiv preprint arXiv:2212.10465, 2022	84	2022
Prosocialdialog: A prosocial backbone for conversational agents H Kim, Y Yu, L Jiang, X Lu, D Khashabi, G Kim, Y Choi, M Sap arXiv preprint arXiv:2205.12688, 2022	80	2022
Pano-avqa: Grounded audio-visual question answering on 360deg videos H Yun, Y Yu, W Yang, K Lee, G Kim Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021	75	2021
Symbolic chain-of-thought distillation: Small models can also" think" step-by-step LH Li, J Hessel, Y Yu, X Ren, KW Chang, Y Choi arXiv preprint arXiv:2306.14050, 2023	73	2023
Dual compositional learning in interactive image retrieval J Kim, Y Yu, H Kim, G Kim Proceedings of the AAAI Conference on Artificial Intelligence 35 (2), 1771-1779, 2021	73	2021
A memory network approach for story-based temporal summarization of 360 videos S Lee, J Sung, Y Yu, G Kim Proceedings of the IEEE conference on computer vision and pattern …, 2018	71	2018
Video question answering with spatio-temporal reasoning Y Jang, Y Song, CD Kim, Y Yu, Y Kim, G Kim International Journal of Computer Vision 127, 1385-1412, 2019	52	2019
A deep ranking model for spatio-temporal highlight detection from a 360◦ video Y Yu, S Lee, J Na, J Kang, G Kim Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018	48	2018
TimesVector: a vectorized clustering approach to the analysis of time series transcriptome data from multiple phenotypes I Jung, K Jo, H Kang, H Ahn, Y Yu, S Kim Bioinformatics 33 (23), 3827-3835, 2017	39	2017
Acav100m: Automatic curation of large-scale datasets for audio-visual video representation learning S Lee, J Chung, Y Yu, G Kim, T Breuel, G Chechik, Y Song Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021	37	2021
Augmenting data for sarcasm detection with unlabeled conversation context H Lee, Y Yu, G Kim arXiv preprint arXiv:2006.06259, 2020	34	2020

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors