Seuraa
Licheng Yu 虞立成
Licheng Yu 虞立成
Research Scientist and Manager, Facebook AI
Vahvistettu sähköpostiosoite verkkotunnuksessa fb.com - Kotisivu
Nimike
Viittaukset
Viittaukset
Vuosi
UNITER: UNiversal Image-TExt Representation Learning
YC Chen*, L Li*, L Yu*, A El Kholy, F Ahmed, Z Gan, Y Cheng, J Liu
ECCV, 2020
2142*2020
Modeling context in referring expressions
L Yu, P Poirson, S Yang, AC Berg, TL Berg
Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The …, 2016
9212016
Mattnet: Modular attention network for referring expression comprehension
L Yu, Z Lin, X Shen, J Yang, X Lu, M Bansal, TL Berg
Proceedings of the IEEE conference on computer vision and pattern …, 2018
7392018
Tvqa: Localized, compositional video question answering
J Lei, L Yu, M Bansal, TL Berg
arXiv preprint arXiv:1809.01696, 2018
5702018
Hero: Hierarchical encoder for video+ language omni-representation pre-training
L Li, YC Chen, Y Cheng, Z Gan, L Yu, J Liu
arXiv preprint arXiv:2005.00200, 2020
4382020
Visual madlibs: Fill in the blank description generation and question answering
L Yu, E Park, AC Berg, TL Berg
Proceedings of the ieee international conference on computer vision, 2461-2469, 2015
297*2015
A joint speaker-listener-reinforcer model for referring expressions
L Yu, H Tan, M Bansal, TL Berg
Proceedings of the IEEE conference on computer vision and pattern …, 2017
2802017
Learning to navigate unseen environments: Back translation with environmental dropout
H Tan, L Yu, M Bansal
arXiv preprint arXiv:1904.04195, 2019
2782019
Tvr: A large-scale dataset for video-subtitle moment retrieval
J Lei, L Yu, TL Berg, M Bansal
Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020
2152020
Tvqa+: Spatio-temporal grounding for video question answering
J Lei, L Yu, TL Berg, M Bansal
arXiv preprint arXiv:1904.11574, 2019
2152019
Vector sparse representation of color image using quaternion matrix analysis
Y Xu, L Yu, H Xu, H Zhang, T Nguyen
IEEE Transactions on image processing 24 (4), 1315-1329, 2015
1522015
Physics-inspired garment recovery from a single-view image
S Yang, Z Pan, T Amert, K Wang, L Yu, T Berg, MC Lin
ACM Transactions on Graphics (TOG) 37 (5), 1-14, 2018
146*2018
Behind the scene: Revealing the secrets of pre-trained vision-and-language models
J Cao, Z Gan, Y Cheng, L Yu, YC Chen, J Liu
Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020
1262020
Value: A multi-task benchmark for video-and-language understanding evaluation
L Li, J Lei, Z Gan, L Yu, YC Chen, R Pillai, Y Cheng, L Zhou, XE Wang, ...
arXiv preprint arXiv:2106.04632, 2021
902021
Multi-target embodied question answering
L Yu, X Chen, G Gkioxari, M Bansal, TL Berg, D Batra
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019
822019
Hierarchically-attentive rnn for album summarization and storytelling
L Yu, M Bansal, TL Berg
arXiv preprint arXiv:1708.02977, 2017
762017
Violin: A large-scale dataset for video-and-language inference
J Liu, W Chen, Y Cheng, Z Gan, L Yu, Y Yang, J Liu
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020
662020
What is more likely to happen next? video-and-language future event prediction
J Lei, L Yu, TL Berg, M Bansal
arXiv preprint arXiv:2010.07999, 2020
582020
Bachgan: High-resolution image synthesis from salient object layout
Y Li, Y Cheng, Z Gan, L Yu, L Wang, J Liu
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020
452020
Unsupervised Vision-and-Language Pre-training via Retrieval-based Multi-Granular Alignment
M Zhou*, L Yu*, A Singh, M Wang, Z Yu, N Zhang
arXiv preprint arXiv:2203.00242, 2022
302022
Järjestelmä ei voi suorittaa toimenpidettä nyt. Yritä myöhemmin uudelleen.
Artikkelit 1–20