Seuraa
Qingpeng Cai
Qingpeng Cai
Kuaishou Technology
Vahvistettu sähköpostiosoite verkkotunnuksessa mails.tsinghua.edu.cn - Kotisivu
Nimike
Viittaukset
Viittaukset
Vuosi
A Deep Reinforcement Learning Framework for Rebalancing Dockless Bike Sharing Systems
L Pan, Q Cai, Z Fang, P Tang, L Huang
1732020
Reinforcement Mechanism Design for e-commerce
Q Cai, A Filos-Ratsikas, P Tang, Y Zhang
Proceedings of the 2018 World Wide Web Conference, 1339-1348, 2018
802018
Softmax Deep Double Deterministic Policy Gradients
L Pan, Q Cai, L Huang
772020
Reinforcement learning with dynamic boltzmann softmax updates
L Pan, Q Cai, Q Meng, W Chen, L Huang
422019
Facility location with minimax envy
Q Cai, A Filos-Ratsikas, P Tang
IJCAI 2016, 137-143, 2016
422016
Reinforcement mechanism design for fraudulent behaviour in e-commerce
Q Cai, A Filos-Ratsikas, P Tang, Y Zhang
Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018
392018
Policy gradients for contextual recommendations
F Pan, Q Cai, P Tang, F Zhuang, Q He
The World Wide Web Conference, 1421-1431, 2019
362019
Reinforcement Learning Driven Heuristic Optimization
Q Cai, W Hang, A Mirhoseini, G Tucker, J Wang, W Wei
352019
Two-Stage Constrained Actor-Critic for Short Video Recommendation
Q Cai, Z Xue, C Zhang, W Xue, S Liu, R Zhan, X Wang, T Zuo, W Xie, ...
23*2023
Reinforcing User Retention in a Billion Scale Short Video Recommender System
Q Cai, S Liu, X Wang, T Zuo, W Xie, B Yang, D Zheng, P Jiang, K Gai
222023
Multi-Task Recommendations with Reinforcement Learning
Z Liu, J Tian, Q Cai, X Zhao, J Gao, S Liu, D Chen, T He, D Zheng, P Jiang, ...
172023
PrefRec: Recommender Systems with Human Preferences for Reinforcing Long-term User Engagement
W Xue, Q Cai, Z Xue, S Sun, S Liu, D Zheng, P Jiang, K Gai, B An
17*2023
ResAct: Reinforcing Long-term Engagement in Sequential Recommendation with Residual Actor
W Xue, Q Cai, R Zhan, D Zheng, P Jiang, B An
162023
Policy optimization with model-based explorations
F Pan, Q Cai, AX Zeng, CX Pan, Q Da, H He, Q He, P Tang
Proceedings of the AAAI Conference on Artificial Intelligence 33 (01), 4675-4682, 2019
152019
Mechanism design for personalized recommender systems
Q Cai, A Filos-Ratsikas, C Liu, P Tang
Proceedings of the 10th ACM Conference on Recommender Systems, 159-166, 2016
132016
Exploration and Regularization of the Latent Action Space in Recommendation
S Liu, Q Cai, B Sun, Y Wang, J Jiang, D Zheng, K Gai, P Jiang, X Zhao, ...
122023
A large language model enhanced conversational recommender system
Y Feng, S Liu, Z Xue, Q Cai, L Hu, P Jiang, K Gai, F Sun
arXiv preprint arXiv:2308.06212, 2023
102023
Generator and critic: A deep reinforcement learning approach for slate re-ranking in e-commerce
J Wei, A Zeng, Y Wu, P Guo, Q Hua, Q Cai
arXiv preprint arXiv:2005.12206, 2020
92020
Generative Flow Network for Listwise Recommendation
S Liu, Q Cai, Z He, B Sun, J McAuley, D Zheng, P Jiang, K Gai
72023
KuaiSim: A Comprehensive Simulator for Recommender Systems
K Zhao, S Liu, Q Cai, X Zhao, Z Liu, D Zheng, P Jiang, K Gai
62023
Järjestelmä ei voi suorittaa toimenpidettä nyt. Yritä myöhemmin uudelleen.
Artikkelit 1–20