Follow
Dacheng Li
Dacheng Li
Verified email at berkeley.edu - Homepage
Title
Cited by
Cited by
Year
Judging llm-as-a-judge with mt-bench and chatbot arena
L Zheng, WL Chiang, Y Sheng, S Zhuang, Z Wu, Y Zhuang, Z Lin, Z Li, ...
Advances in Neural Information Processing Systems 36, 46595-46623, 2023
1961*2023
Chatbot arena: An open platform for evaluating llms by human preference
WL Chiang, L Zheng, Y Sheng, AN Angelopoulos, T Li, D Li, H Zhang, ...
ICML 2024, 2024
1632024
How Long Can Context Length of Open-Source LLMs truly Promise?
D Li, R Shao, A Xie, Y Sheng, L Zheng, J Gonzalez, I Stoica, X Ma, ...
NeurIPS 2023 Workshop on Instruction Tuning and Instruction Following, 2023
109*2023
Dual contradistinctive generative autoencoder
G Parmar, D Li, K Lee, Z Tu
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021
922021
Mpcformer: fast, performant and private transformer inference with mpc
D Li, R Shao, H Wang, H Guo, EP Xing, H Zhang
The Eleventh International Conference on Learning Representations, 2022
542022
SLoRA: Scalable Serving of Thousands of LoRA Adapters
Y Sheng, S Cao, D Li, C Hooper, N Lee, S Yang, C Chou, B Zhu, L Zheng, ...
Proceedings of Machine Learning and Systems 6, 296-311, 2024
50*2024
Fairness in serving large language models
Y Sheng, S Cao, D Li, B Zhu, Z Li, D Zhuo, JE Gonzalez, I Stoica
18th USENIX Symposium on Operating Systems Design and Implementation (OSDI’24), 2023
202023
DISTFLASHATTN: Distributed Memory-efficient Attention for Long-context LLMs Training
D Li, R Shao, A Xie, EP Xing, X Ma, I Stoica, JE Gonzalez, H Zhang
First Conference on Language Modeling, 2024
15*2024
Amp: Automatically finding model parallel strategies with heterogeneity awareness
D Li, H Wang, E Xing, H Zhang
Advances in Neural Information Processing Systems 35, 6630-6639, 2022
142022
Sorry-bench: Systematically evaluating large language model safety refusal behaviors
T Xie, X Qi, Y Zeng, Y Huang, UM Sehwag, K Huang, L He, B Wei, D Li, ...
arXiv preprint arXiv:2406.14598, 2024
82024
LongVILA: Scaling Long-Context Visual Language Models for Long Videos
F Xue, Y Chen, D Li, Q Hu, L Zhu, X Li, Y Fang, H Tang, S Yang, Z Liu, ...
arXiv preprint arXiv:2408.10188, 2024
32024
Does compressing activations help model parallel training?
S Bian, D Li, H Wang, E Xing, S Venkataraman
Proceedings of Machine Learning and Systems 6, 239-252, 2024
22024
VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation
Y Wu, Z Zhang, J Chen, H Tang, D Li, Y Fang, L Zhu, E Xie, H Yin, L Yi, ...
arXiv preprint arXiv:2409.04429, 2024
12024
MPC-Minimized Secure LLM Inference
D Rathee, D Li, I Stoica, H Zhang, R Popa
arXiv preprint arXiv:2408.03561, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–14