Video paragraph captioning using hierarchical recurrent neural networks H Yu, J Wang, Z Huang, Y Yang, W Xu Proceedings of the IEEE conference on computer vision and pattern …, 2016 | 475 | 2016 |
Recognize human activities from partially observed videos Y Cao, D Barrett, A Barbu, S Narayanaswamy, H Yu, A Michaux, Y Lin, ... Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2013 | 187 | 2013 |
Grounded Language Learning from Video Described with Sentences H Yu, JM Siskind The annual meeting of the Association for Computational Linguistics (ACL), 2013 | 150 | 2013 |
One ticket to win them all: generalizing lottery ticket initializations across datasets and optimizers AS Morcos, H Yu, M Paganini, Y Tian arXiv preprint arXiv:1906.02773, 2019 | 58 | 2019 |
Interactive grounded language acquisition and generalization in a 2d world H Yu, H Zhang, W Xu arXiv preprint arXiv:1802.01433, 2018 | 39 | 2018 |
Automatic interesting object extraction from images using complementary saliency maps H Yu, J Li, Y Tian, T Huang Proceedings of the 18th ACM international conference on Multimedia, 891-894, 2010 | 39 | 2010 |
Resource-efficient neural architect Y Zhou, S Ebrahimi, SÖ Arık, H Yu, H Liu, G Diamos arXiv preprint arXiv:1806.07912, 2018 | 38 | 2018 |
A compositional framework for grounding language inference, generation, and acquisition in video H Yu, N Siddharth, A Barbu, JM Siskind Journal of Artificial Intelligence Research 52, 601-713, 2015 | 38 | 2015 |
A deep compositional framework for human-like language acquisition in virtual environment H Yu, H Zhang, W Xu arXiv preprint arXiv:1703.09831, 2017 | 30 | 2017 |
Playing the lottery with rewards and multiple languages: lottery tickets in rl and nlp H Yu, S Edunov, Y Tian, AS Morcos arXiv preprint arXiv:1906.02768, 2019 | 29 | 2019 |
Salient region detection and segmentation for general object recognition and image understanding TJ Huang, YH Tian, J Li, HN Yu Science China Information Sciences 54 (12), 2461-2470, 2011 | 28 | 2011 |
Systems and methods for video paragraph captioning using hierarchical recurrent neural networks H Yu, J Wang, Z Huang, Y Yang, W Xu US Patent 10,395,118, 2019 | 25 | 2019 |
Correlating videos and sentences JM Siskind, A Barbu, S Narayanaswamy, H Yu US Patent 9,183,466, 2015 | 18 | 2015 |
Driving under the influence (of language) DP Barrett, SA Bronikowski, H Yu, JM Siskind IEEE transactions on neural networks and learning systems 29 (7), 2668-2683, 2017 | 11 | 2017 |
Robot language learning, generation, and comprehension DP Barrett, SA Bronikowski, H Yu, JM Siskind arXiv preprint arXiv:1508.06161, 2015 | 11 | 2015 |
Guided feature transformation (gft): A neural language grounding module for embodied agents H Yu, X Lian, H Zhang, W Xu Conference on Robot Learning, 81-98, 2018 | 9 | 2018 |
Listen, interact and talk: Learning to speak via interaction H Zhang, H Yu, W Xu arXiv preprint arXiv:1705.09906, 2017 | 9 | 2017 |
Craftassist: A framework for dialogue-enabled interactive agents J Gray, K Srinet, Y Jernite, H Yu, Z Chen, D Guo, S Goyal, CL Zitnick, ... arXiv preprint arXiv:1907.08584, 2019 | 8 | 2019 |
Sentence directed video object codiscovery H Yu, JM Siskind International Journal of Computer Vision 124 (3), 312-334, 2017 | 8 | 2017 |
Learning to Describe Video with Weak Supervision by Exploiting Negative Sentential Information H Yu, JM Siskind AAAI Conference on Artificial Intelligence, 2015 | 8 | 2015 |