Luowei Zhou
Luowei Zhou
Senior Researcher, Microsoft
Vahvistettu sähköpostiosoite verkkotunnuksessa - Kotisivu
Unified vision-language pre-training for image captioning and vqa
L Zhou, H Palangi, L Zhang, H Hu, J Corso, J Gao
Proceedings of the AAAI Conference on Artificial Intelligence 34 (07), 13041 …, 2020
End-to-end dense video captioning with masked transformer
L Zhou, Y Zhou, JJ Corso, R Socher, C Xiong
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2018
Towards automatic learning of procedures from web instructional videos
L Zhou, C Xu, JJ Corso
Thirty-Second AAAI Conference on Artificial Intelligence, 2018
Grounded video description
L Zhou, Y Kalantidis, X Chen, JJ Corso, M Rohrbach
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019
Less is more: Clipbert for video-and-language learning via sparse sampling
J Lei, L Li, L Zhou, Z Gan, TL Berg, M Bansal, J Liu
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021
Watch what you just said: Image captioning with text-conditional attention
L Zhou, C Xu, P Koch, JJ Corso
Proceedings of the on Thematic Workshops of ACM Multimedia 2017, 305-313, 2017
Weakly-Supervised Video Object Grounding from Text by Loss Weighting and Object Interaction
L Zhou, N Louis, JJ Corso
British Machine Vision Conference, 2018
Multiagent reinforcement learning with sparse interactions by negotiation and knowledge transfer
L Zhou, P Yang, C Chen, Y Gao
IEEE transactions on cybernetics 47 (5), 1238-1250, 2016
Image caption generation with text-conditional semantic attention
L Zhou, C Xu, P Koch, JJ Corso
arXiv preprint arXiv:1606.04621 2, 2016
Dense video captioning
Y Zhou, L Zhou, C Xiong, R Socher
US Patent 10,542,270, 2020
Uc2: Universal cross-lingual cross-modal vision-and-language pre-training
M Zhou, L Zhou, S Wang, Y Cheng, L Li, Z Yu, J Liu
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021
Florence: A New Foundation Model for Computer Vision
L Yuan, D Chen, YL Chen, N Codella, X Dai, J Gao, H Hu, X Huang, B Li, ...
arXiv preprint arXiv:2111.11432, 2021
A balanced heuristic mechanism for multirobot task allocation of intelligent warehouses
L Zhou, Y Shi, J Wang, P Yang
Mathematical Problems in Engineering 2014, 2014
Value: A multi-task benchmark for video-and-language understanding evaluation
L Li, J Lei, Z Gan, L Yu, YC Chen, R Pillai, Y Cheng, L Zhou, XE Wang, ...
arXiv preprint arXiv:2106.04632, 2021
Procnets: Learning to segment procedures in untrimmed and unconstrained videos
L Zhou, C Xu, JJ Corso
arXiv preprint arXiv:1703.09788 2 (6), 7, 2017
Cluster-Former: Clustering-based Sparse Transformer for Question Answering
S Wang, L Zhou, Z Gan, YC Chen, Y Fang, S Sun, Y Cheng, J Liu
Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 …, 2021
Dynamic graph modules for modeling object-object interactions in activity recognition
H Huang, L Zhou, W Zhang, JJ Corso, C Xu
arXiv preprint arXiv:1812.05637, 2018
Bevt: Bert pretraining of video transformers
R Wang, D Chen, Z Wu, Y Chen, X Dai, M Liu, YG Jiang, L Zhou, L Yuan
arXiv preprint arXiv:2112.01529, 2021
Cupid: Adaptive curation of pre-training data for video-and-language representation learning
L Zhou, J Liu, Y Cheng, Z Gan, L Zhang
arXiv preprint arXiv:2104.00285, 2021
CLIP-Event: Connecting Text and Images with Event Structures
M Li, R Xu, S Wang, L Zhou, X Lin, C Zhu, M Zeng, H Ji, SF Chang
arXiv preprint arXiv:2201.05078, 2022
Järjestelmä ei voi suorittaa toimenpidettä nyt. Yritä myöhemmin uudelleen.
Artikkelit 1–20