Seuraa
Jae Sung (James) Park
Jae Sung (James) Park
Vahvistettu sähköpostiosoite verkkotunnuksessa cs.washington.edu
Nimike
Viittaukset
Viittaukset
Vuosi
Merlot: Multimodal neural script knowledge models
R Zellers, X Lu, J Hessel, Y Yu, JS Park, J Cao, A Farhadi, Y Choi
Advances in Neural Information Processing Systems 34, 23634-23651, 2021
2162021
VisualCOMET: Reasoning about the Dynamic Context of a Still Image
JS Park, C Bhagavatula, R Mottaghi, A Farhadi, Y Choi
arXiv preprint arXiv:2004.10796, 2020
862020
Adversarial inference for multi-sentence video description
JS Park, M Rohrbach, T Darrell, A Rohrbach
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019
832019
Natural language rationales with full-stack visual reasoning: From pixels to semantic frames to commonsense graphs
A Marasović, C Bhagavatula, JS Park, RL Bras, NA Smith, Y Choi
arXiv preprint arXiv:2010.07526, 2020
422020
Identity-aware multi-sentence video description
JS Park, T Darrell, A Rohrbach
Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020
172020
The Abduction of Sherlock Holmes: A Dataset for Visual Abductive Reasoning
J Hessel, JD Hwang, JS Park, R Zellers, C Bhagavatula, A Rohrbach, ...
European Conference on Computer Vision, 558-575, 2022
122022
Multimodal knowledge alignment with reinforcement learning
Y Yu, J Chung, H Yun, J Hessel, JS Park, X Lu, P Ammanabrolu, R Zellers, ...
arXiv preprint arXiv:2205.12630, 2022
102022
Llc: Accurate, multi-purpose learnt low-dimensional binary codes
A Kusupati, M Wallingford, V Ramanujan, R Somani, JS Park, K Pillutla, ...
Advances in neural information processing systems 34, 23900-23913, 2021
82021
Exposing the limits of video-text models through contrast sets
JS Park, S Shen, A Farhadi, T Darrell, Y Choi, A Rohrbach
Proceedings of the 2022 Conference of the North American Chapter of the …, 2022
72022
Järjestelmä ei voi suorittaa toimenpidettä nyt. Yritä myöhemmin uudelleen.
Artikkelit 1–9