Seuraa
Jiasen Lu
Jiasen Lu
Research Scientist, Allen Institute of Artificial Intelligence
Vahvistettu sähköpostiosoite verkkotunnuksessa allenai.org - Kotisivu
Nimike
Viittaukset
Viittaukset
Vuosi
Vqa: Visual question answering
A Agrawal*, J Lu*, S Antol*, M Mitchell, CL Zitnick, D Parikh, D Batra
International Journal of Computer Vision 123 (1), 4-31, 2017
3807*2017
Vqa: Visual question answering
S Antol, A Agrawal, J Lu, M Mitchell, D Batra, C Lawrence Zitnick, ...
Proceedings of the IEEE International Conference on Computer Vision, 2425-2433, 2015
38002015
Vilbert: Pretraining task-agnostic visiolinguistic representations for vision-and-language tasks
J Lu, D Batra, D Parikh, S Lee
Advances in neural information processing systems, 2019
14882019
Hierarchical question-image co-attention for visual question answering
J Lu, J Yang, D Batra, D Parikh
Advances in neural information processing systems 29, 2016
14572016
Knowing when to look: Adaptive attention via a visual sentinel for image captioning
J Lu*, C Xiong*, D Parikh, R Socher
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2017
12252017
Graph R-CNN for Scene Graph Generation
J Yang*, J Lu*, S Lee, D Batra, D Parikh
arXiv preprint arXiv:1808.00191, 2018
5902018
Neural Baby Talk
J Lu*, J Yang*, D Batra, D Parikh
In Proceedings of the IEEE conference on computer vision and pattern …, 2018
3942018
Parlai: A dialog research software platform
AH Miller, W Feng, A Fisch, J Lu, D Batra, A Bordes, D Parikh, J Weston
arXiv preprint arXiv:1705.06476, 2017
2762017
12-in-1: Multi-Task Vision and Language Representation Learning
J Lu*, V Goswami*, M Rohrbach, D Parikh, S Lee
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2019
2662019
Self-monitoring navigation agent via auxiliary progress estimation
CY Ma, J Lu, Z Wu, G AlRegib, Z Kira, R Socher, C Xiong
arXiv preprint arXiv:1901.03035, 2019
1682019
Best of both worlds: Transferring knowledge from discriminative learning to a generative visual dialog model
J Lu, A Kannan, J Yang, D Parikh, D Batra
Advances in Neural Information Processing Systems 30, 2017
1242017
A Faster Pytorch Implementation of Faster R-CNN
J Yang*, J Lu*, D Batra, D Parikh
https://github.com/jwyang/faster-rcnn.pytorch, 2018
952018
Deeper lstm and normalized cnn visual question answering model
J Lu, X Lin, D Batra, D Parikh
GitHub repository 6, 2015
742015
Sentinel gate for modulating auxiliary information in a long short-term memory (lstm) neural network
LU Jiasen, C Xiong, R Socher
US Patent 10,565,306, 2020
732020
Human action segmentation with hierarchical supervoxel consistency
J Lu, R Xu, JJ Corso
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2015
662015
X-lxmert: Paint, caption and answer questions with multi-modal transformers
J Cho, J Lu, D Schwenk, H Hajishirzi, A Kembhavi
arXiv preprint arXiv:2009.11278, 2020
462020
Spatially aware multimodal transformers for textvqa
Y Kant, D Batra, P Anderson, A Schwing, D Parikh, J Lu, H Agrawal
European Conference on Computer Vision, 715-732, 2020
392020
Emergence of compositional language with deep generational transmission
M Cogswell, J Lu, S Lee, D Parikh, D Batra
arXiv preprint arXiv:1904.09067, 2019
382019
Visual Curiosity: Learning to Ask Questions to Learn Visual Recognition
J Yang*, J Lu*, S Lee, D Batra, D Parikh
arXiv preprint arXiv:1810.00912, 2018
322018
Container: Context aggregation network
P Gao, J Lu, H Li, R Mottaghi, A Kembhavi
arXiv preprint arXiv:2106.01401, 2021
292021
Järjestelmä ei voi suorittaa toimenpidettä nyt. Yritä myöhemmin uudelleen.
Artikkelit 1–20