Geolocalized modeling for dish recognition R Xu, L Herranz, S Jiang, S Wang, X Song, R Jain IEEE transactions on multimedia 17 (8), 1187-1199, 2015 | 69 | 2015 |
Depth cnns for rgb-d scene recognition: Learning from scratch better than transferring from rgb-cnns X Song, L Herranz, S Jiang Proceedings of the AAAI conference on artificial intelligence 31 (1), 2017 | 46 | 2017 |
Multi-scale multi-feature context modeling for scene recognition in the semantic manifold X Song, S Jiang, L Herranz IEEE Transactions on Image Processing 26 (6), 2721-2735, 2017 | 38 | 2017 |
Joint multi-feature spatial context for scene recognition on the semantic manifold X Song, S Jiang, L Herranz Proceedings of the IEEE conference on computer vision and pattern …, 2015 | 31 | 2015 |
Learning effective rgb-d representations for scene recognition X Song, S Jiang, L Herranz, C Chen IEEE Transactions on Image Processing 28 (2), 980-993, 2018 | 18 | 2018 |
Combining Models from Multiple Sources for RGB-D Scene Recognition. X Song, S Jiang, L Herranz IJCAI, 4523-4529, 2017 | 18 | 2017 |
Image captioning with both object and scene information X Li, X Song, L Herranz, Y Zhu, S Jiang Proceedings of the 24th ACM international conference on Multimedia, 1107-1110, 2016 | 17 | 2016 |
Category co-occurrence modeling for large scale scene recognition X Song, S Jiang, L Herranz, Y Kong, K Zheng Pattern Recognition 59, 98-111, 2016 | 14 | 2016 |
Relative image similarity learning with contextual information for Internet cross-media retrieval S Jiang, X Song, Q Huang Multimedia systems 20 (6), 645-657, 2014 | 12 | 2014 |
RGB-D scene recognition with object-to-object relation X Song, C Chen, S Jiang Proceedings of the 25th ACM international conference on Multimedia, 600-608, 2017 | 10 | 2017 |
Image captioning via semantic element embedding X Zhang, S He, X Song, RWH Lau, J Jiao, Q Ye Neurocomputing 395, 212-221, 2020 | 7 | 2020 |
Rich image description based on regions X Zhang, X Song, X Lv, S Jiang, Q Ye, J Jiao Proceedings of the 23rd ACM international conference on Multimedia, 1315-1318, 2015 | 6 | 2015 |
Image representations with spatial object-to-object relations for RGB-D scene recognition X Song, S Jiang, B Wang, C Chen, G Chen IEEE Transactions on Image Processing 29, 525-537, 2019 | 5 | 2019 |
Deep patch representations with shared codebook for scene classification S Jiang, G Chen, X Song, L Liu ACM Transactions on Multimedia Computing, Communications, and Applications …, 2019 | 5 | 2019 |
Keyword-driven image captioning via Context-dependent Bilateral LSTM X Zhang, S He, X Song, P Wei, S Jiang, Q Ye, J Jiao, RWH Lau 2017 IEEE International Conference on Multimedia and Expo (ICME), 781-786, 2017 | 5 | 2017 |
Scene recognition with prototype-agnostic scene layout G Chen, X Song, H Zeng, S Jiang IEEE Transactions on Image Processing 29, 5877-5888, 2020 | 4 | 2020 |
Joint Learning of CNN and LSTM for Image Captioning. Y Zhu, X Li, X Li, J Sun, X Song, S Jiang CLEF (Working Notes), 421-427, 2016 | 4 | 2016 |
MIAR ICT Participation at Robot Vision 2013. R Xu, S Jiang, X Song, S Wang, Y Xie, F Wang, X Lv CLEF (Working Notes), 2013 | 4 | 2013 |
Semantic features for food image recognition with geo-constraints X Song, S Jiang, R Xu, L Herranz 2014 IEEE International Conference on Data Mining Workshop, 1020-1025, 2014 | 3 | 2014 |
Generalized Zero-shot Learning with Multi-source Semantic Embeddings for Scene Recognition X Song, H Zeng, S Zhang, L Herranz, S Jiang Proceedings of the 28th ACM International Conference on Multimedia, 3976-3985, 2020 | 2 | 2020 |