Seuraa
Yining Li
Yining Li
Shanghai AI Laboratory
Vahvistettu sähköpostiosoite verkkotunnuksessa pjlab.org.cn - Kotisivu
Nimike
Viittaukset
Viittaukset
Vuosi
Learning deep representation for imbalanced classification
C Huang, Y Li, CC Loy, X Tang
Proceedings of the IEEE conference on computer vision and pattern …, 2016
13312016
Deep imbalanced learning for face recognition and attribute prediction
C Huang, Y Li, CC Loy, X Tang
IEEE transactions on pattern analysis and machine intelligence 42 (11), 2781 …, 2019
4132019
Openmmlab pose estimation toolbox and benchmark
MMP Contributors
3932020
Internlm2 technical report
Z Cai, M Cao, H Chen, K Chen, K Chen, X Chen, X Chen, Z Chen, Z Chen, ...
arXiv preprint arXiv:2403.17297, 2024
2852024
Internlm-xcomposer2: Mastering free-form text-image composition and comprehension in vision-language large model
X Dong, P Zhang, Y Zang, Y Cao, B Wang, L Ouyang, X Wei, S Zhang, ...
arXiv preprint arXiv:2401.16420, 2024
2592024
Dense intrinsic appearance flow for human pose transfer
Y Li, C Huang, CC Loy
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019
2212019
Human attribute recognition by deep hierarchical contexts
Y Li, C Huang, CC Loy, X Tang
Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The …, 2016
2102016
Rtmpose: Real-time multi-person pose estimation based on mmpose
T Jiang, P Lu, L Zhang, N Ma, R Han, C Lyu, Y Li, K Chen
arXiv preprint arXiv:2303.07399, 2023
1912023
Internlm-xcomposer2-4khd: A pioneering large vision-language model handling resolutions from 336 pixels to 4k hd
X Dong, P Zhang, Y Zang, Y Cao, B Wang, L Ouyang, S Zhang, H Duan, ...
Advances in Neural Information Processing Systems 37, 42566-42592, 2024
1232024
Internlm-xcomposer-2.5: A versatile large vision language model supporting long-contextual input and output
P Zhang, X Dong, Y Zang, Y Cao, R Qian, L Chen, Q Guo, H Duan, ...
arXiv preprint arXiv:2407.03320, 2024
992024
Omg-seg: Is one model good enough for all segmentation?
X Li, H Yuan, W Li, H Ding, S Wu, W Zhang, Y Li, K Chen, CC Loy
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2024
552024
Open-vocabulary SAM: Segment and recognize twenty-thousand classes interactively
H Yuan, X Li, C Zhou, Y Li, K Chen, CC Loy
European Conference on Computer Vision, 419-437, 2024
412024
Mmbench-video: A long-form multi-shot benchmark for holistic video understanding
X Fang, K Mao, H Duan, X Zhao, Y Li, D Lin, K Chen
Advances in Neural Information Processing Systems 37, 89098-89124, 2024
402024
RTMO: Towards High-Performance One-Stage Real-Time Multi-Person Pose Estimation
P Lu, T Jiang, Y Li, X Li, K Chen, W Yang
Proceedings of the IEEE conference on computer vision and pattern recognition, 2024
382024
Learning to disambiguate by asking discriminative questions
Y Li, C Huang, X Tang, C Change Loy
Proceedings of the IEEE International Conference on Computer Vision, 3419-3428, 2017
312017
An open and comprehensive pipeline for unified object grounding and detection
X Zhao, Y Chen, S Xu, X Li, X Wang, Y Li, H Huang
arXiv preprint arXiv:2401.02361, 2024
292024
Motionbooth: Motion-aware customized text-to-video generation
J Wu, X Li, Y Zeng, J Zhang, Q Zhou, Y Li, Y Tong, K Chen
arXiv preprint arXiv:2406.17758, 2024
232024
Towards language-driven video inpainting via multimodal large language models
J Wu, X Li, C Si, S Zhou, J Yang, J Zhang, Y Li, K Chen, Y Tong, Z Liu, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
202024
Dst-det: Simple dynamic self-training for open-vocabulary object detection
S Xu, X Li, S Wu, W Zhang, Y Tong, CC Loy
arXiv preprint arXiv:2310.01393, 2023
132023
Mg-llava: Towards multi-granularity visual instruction tuning
X Zhao, X Li, H Duan, H Huang, Y Li, K Chen, H Yang
arXiv preprint arXiv:2406.17770, 2024
102024
Järjestelmä ei voi suorittaa toimenpidettä nyt. Yritä myöhemmin uudelleen.
Artikkelit 1–20