Dhruv Batra
Dhruv Batra
Georgia Tech and Facebook AI Research
Verified email at gatech.edu - Homepage
Title
Cited by
Cited by
Year
Grad-cam: Visual explanations from deep networks via gradient-based localization
RR Selvaraju, M Cogswell, A Das, R Vedantam, D Parikh, D Batra
Proceedings of the IEEE international conference on computer vision, 618-626, 2017
45942017
VQA: Visual Question Answering
S Antol, A Agrawal, J Lu, M Mitchell, D Batra, C Lawrence Zitnick, ...
Proceedings of the IEEE International Conference on Computer Vision, 2425-2433, 2015
26132015
Hierarchical question-image co-attention for visual question answering
J Lu, J Yang, D Batra, D Parikh
arXiv preprint arXiv:1606.00061, 2016
10292016
Making the v in vqa matter: Elevating the role of image understanding in visual question answering
Y Goyal, T Khot, D Summers-Stay, D Batra, D Parikh
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2017
8282017
Visual dialog
A Das, S Kottur, K Gupta, A Singh, D Yadav, JMF Moura, D Parikh, ...
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2017
5612017
icoseg: Interactive co-segmentation with intelligent scribble guidance
D Batra, A Kowdle, D Parikh, J Luo, T Chen
2010 IEEE Computer Society Conference on Computer Vision and Pattern …, 2010
5272010
Joint unsupervised learning of deep representations and image clusters
J Yang, D Parikh, D Batra
Proceedings of the IEEE conference on computer vision and pattern …, 2016
4572016
Vilbert: Pretraining task-agnostic visiolinguistic representations for vision-and-language tasks
J Lu, D Batra, D Parikh, S Lee
arXiv preprint arXiv:1908.02265, 2019
4222019
A comparative study of modern inference techniques for structured discrete energy minimization problems
JH Kappes, B Andres, FA Hamprecht, C Schnörr, S Nowozin, D Batra, ...
International Journal of Computer Vision 115 (2), 155-184, 2015
373*2015
Learning cooperative visual dialog agents with deep reinforcement learning
A Das, S Kottur, JMF Moura, S Lee, D Batra
Proceedings of the IEEE international conference on computer vision, 2951-2960, 2017
3192017
Diverse m-best solutions in markov random fields
D Batra, P Yadollahpour, A Guzman-Rivera, G Shakhnarovich
European Conference on Computer Vision, 1-16, 2012
3052012
Graph r-cnn for scene graph generation
J Yang, J Lu, S Lee, D Batra, D Parikh
Proceedings of the European conference on computer vision (ECCV), 670-685, 2018
3042018
Human attention in visual question answering: Do humans and deep networks look at the same regions?
A Das, H Agrawal, L Zitnick, D Parikh, D Batra
Computer Vision and Image Understanding 163, 90-100, 2017
3002017
Embodied question answering
A Das, S Datta, G Gkioxari, S Lee, D Parikh, D Batra
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2018
2882018
Reducing overfitting in deep networks by decorrelating representations
M Cogswell, F Ahmed, R Girshick, L Zitnick, D Batra
arXiv preprint arXiv:1511.06068, 2015
2662015
A corpus and cloze evaluation for deeper understanding of commonsense stories
N Mostafazadeh, N Chambers, X He, D Parikh, D Batra, L Vanderwende, ...
Proceedings of the 2016 Conference of the North American Chapter of the …, 2016
2562016
Neural baby talk
J Lu, J Yang, D Batra, D Parikh
Proceedings of the IEEE conference on computer vision and pattern …, 2018
2432018
Habitat: A platform for embodied ai research
M Savva, A Kadian, O Maksymets, Y Zhao, E Wijmans, B Jain, J Straub, ...
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2019
2282019
Don't just assume; look and answer: Overcoming priors for visual question answering
A Agrawal, D Batra, D Parikh, A Kembhavi
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2018
2232018
Visual Storytelling
THK Huang, F Ferraro, N Mostafazadeh, I Misra, A Agrawal, J Devlin, ...
NAACL, 2016
2212016
The system can't perform the operation now. Try again later.
Articles 1–20