FILIP: fine-grained interactive language-image pre-training L Yao*, R Huang*, L Hou*, G Lu, M Niu, H Xu, X Liang, Z Li, X Jiang, C Xu arXiv preprint arXiv:2111.07783, 2021 | 396 | 2021 |
Wukong: 100 million large-scale chinese cross-modal pre-training dataset and a foundation framework HX Jiaxi Gu, Xiaojun Meng, Guansong Lu, Lu Hou, Minzhe Niu, Xiaodan Liang ... arXiv preprint arXiv:2202.06767, 2022 | 68* | 2022 |
Deep Feature Fusion with Multiple Granularity for Vehicle Re-identification. P Huang, R Huang, J Huang, R Yangchen, Z He, X Li, J Chen CVPR workshops, 80-88, 2019 | 24 | 2019 |
Nlip: Noise-robust language-image pre-training R Huang, Y Long, J Han, H Xu, X Liang, C Xu, X Liang Proceedings of the AAAI Conference on Artificial Intelligence 37 (1), 926-934, 2023 | 12 | 2023 |
Fine-Grained Visual–Text Prompt-Driven Self-Training for Open-Vocabulary Object Detection Y Long, J Han, R Huang, H Xu, Y Zhu, C Xu, X Liang IEEE Transactions on Neural Networks and Learning Systems, 2023 | 7 | 2023 |
Boosting visual-language models by exploiting hard samples H Wang, M Huang, R Huang, L Hong, H Xu, T Hu, X Liang, Z Li arXiv preprint arXiv:2305.05208, 2023 | 4 | 2023 |
Growclip: Data-aware automatic model growing for large-scale contrastive language-image pre-training X Deng, H Shi, R Huang, C Li, H Xu, J Han, J Kwok, S Zhao, W Zhang, ... Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 2 | 2023 |
LayerDiff: Exploring Text-guided Multi-layered Composable Image Synthesis via Layer-Collaborative Diffusion Model R Huang, K Cai, J Han, X Liang, R Pei, G Lu, S Xu, W Zhang, H Xu arXiv preprint arXiv:2403.11929, 2024 | | 2024 |
SYSTEM AND METHOD FOR CROSS-MODAL INTERACTION BASED ON PRE-TRAINED MODEL H XU, L Hou, G LU, M Niu, Z LI, R Huang, L Yao, C XU, X Liang US Patent App. 17/900,592, 2024 | | 2024 |
UniDiff: Advancing Vision-Language Models with Generative and Discriminative Learning X Dong, R Huang, X Wei, Z Jie, J Yu, J Yin, X Liang arXiv preprint arXiv:2306.00813, 2023 | | 2023 |
DiffDis: Empowering Generative Diffusion Model with Cross-Modal Discrimination Capability R Huang, J Han, G Lu, X Liang, Y Zeng, W Zhang, H Xu Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | | 2023 |