Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks X Li, X Yin, C Li, P Zhang, X Hu, L Zhang, L Wang, H Hu, L Dong, F Wei, ... Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020 | 2124 | 2020 |
Stacked cross attention for image-text matching KH Lee, X Chen, G Hua, H Hu, X He Proceedings of the European conference on computer vision (ECCV), 201-216, 2018 | 1394 | 2018 |
Unified vision-language pre-training for image captioning and vqa L Zhou, H Palangi, L Zhang, H Hu, J Corso, J Gao Proceedings of the AAAI conference on artificial intelligence 34 (07), 13041 …, 2020 | 993 | 2020 |
Florence: A new foundation model for computer vision L Yuan, D Chen, YL Chen, N Codella, X Dai, J Gao, H Hu, X Huang, B Li, ... arXiv preprint arXiv:2111.11432, 2021 | 882 | 2021 |
Elevater: A benchmark and toolkit for evaluating language-augmented visual models C Li, H Liu, L Li, P Zhang, J Aneja, J Yang, P Jin, H Hu, Z Liu, YJ Lee, ... Advances in Neural Information Processing Systems 35, 9287-9301, 2022 | 136 | 2022 |
Web-scale responsive visual search at bing H Hu, Y Wang, L Yang, P Komlev, L Huang, X Chen, J Huang, Y Wu, ... Proceedings of the 24th ACM SIGKDD international conference on knowledge …, 2018 | 73 | 2018 |
Plasmonic dark field microscopy H Hu, C Ma, Z Liu Applied Physics Letters 96 (11), 2010 | 67 | 2010 |
Florence-2: Advancing a unified representation for a variety of vision tasks B Xiao, H Wu, W Xu, X Dai, H Hu, Y Lu, M Zeng, C Liu, L Yuan Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 50 | 2024 |
Product identification in image with multiple products H Hu, L Huang US Patent 10,902,051, 2021 | 49 | 2021 |
Learning visual relation priors for image-text matching and image captioning with neural scene graph generators KH Lee, H Palangi, X Chen, H Hu, J Gao arXiv preprint arXiv:1909.09953, 2019 | 42 | 2019 |
System and method for attribute-based visual search over a computer communication network L Huang, M Merchant, H Hu, A Sacheti US Patent 11,120,070, 2021 | 36 | 2021 |
Image scene graph generation (sgg) benchmark X Han, J Yang, H Hu, L Zhang, J Gao, P Zhang arXiv preprint arXiv:2107.12604, 2021 | 32 | 2021 |
Mmptrack: Large-scale densely annotated multi-camera multiple people tracking benchmark X Han, Q You, C Wang, Z Zhang, P Chu, H Hu, J Wang, Z Liu Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2023 | 23 | 2023 |
Transforming audio content into images L Huang, H Hu, C Su US Patent 10,891,969, 2021 | 20 | 2021 |
Training small multimodal models to bridge biomedical competency gap: A case study in radiology imaging JMZ Chaves, SC Huang, Y Xu, H Xu, N Usuyama, S Zhang, F Wang, ... CoRR, 2024 | 17 | 2024 |
Generating and applying an object-level relational index for images K Wu, S Yiran, H Hu, S Sreepada, A Sacheti, MD Gupta, RR Gandhi, ... US Patent 11,182,408, 2021 | 16 | 2021 |
An universal image attractiveness ranking framework N Ma, A Volkov, A Livshits, P Pietrusinski, H Hu, M Bolin 2019 IEEE winter conference on applications of computer vision (WACV), 657-665, 2019 | 16 | 2019 |
Stacked cross-modal matching KH Lee, G Hua, X Chen, H Hu, H Xiaodong US Patent 11,093,560, 2021 | 15 | 2021 |
Florence: A new foundation model for computer vision. arXiv 2021 L Yuan, D Chen, YL Chen, N Codella, X Dai, J Gao, H Hu, X Huang, B Li, ... arXiv preprint arXiv:2111.11432, 2021 | 14 | 2021 |
Object detection from image content A Sacheti, X Chen, H Hu, L Huang, J Huang, M Merchant US Patent App. 15/900,606, 2019 | 14 | 2019 |