Seuraa
Zhenghai Xue
Zhenghai Xue
Vahvistettu sähköpostiosoite verkkotunnuksessa e.ntu.edu.sg - Kotisivu
Nimike
Viittaukset
Viittaukset
Vuosi
Metadrive: Composing diverse driving scenarios for generalizable reinforcement learning
Q Li, Z Peng, L Feng, Q Zhang, Z Xue, B Zhou
IEEE transactions on pattern analysis and machine intelligence 45 (3), 3461-3475, 2022
1332022
Regret Minimization Experience Replay in Off-Policy Reinforcement Learning
XH Liu, Z Xue, J Pang, S Jiang, F Xu, Y Yu
Advances in Neural Information Processing Systems 34, 17604-17615, 2021
292021
PrefRec: Preference-based Recommender Systems for Reinforcing Long-term User Engagement
W Xue, Q Cai, Z Xue, S Sun, S Liu, D Zheng, P Jiang, B An
Proceedings of the 29th ACM SIGKDD International Conference on Knowledge …, 2023
17*2023
Two-Stage Constrained Actor-Critic for Short Video Recommendation
Q Cai, Z Xue, C Zhang, W Xue, S Liu, R Zhan, X Wang, T Zuo, W Xie, ...
The Web Conference 2023 Research Track, 2023
162023
A Large Language Model Enhanced Conversational Recommender System
Y Feng, S Liu, Z Xue, Q Cai, L Hu, P Jiang, K Gai, F Sun
arXiv preprint arXiv:2308.06212, 2023
102023
State Regularized Policy Optimization on Data with Dynamics Shift
Z Xue, Q Cai, S Liu, D Zheng, P Jiang, K Gai, B An
Advances in Neural Information Processing Systems 36, 32926--32937, 2023
42023
Guarded Policy Optimization with Imperfect Online Demonstrations
Z Xue, Z Peng, Q Li, Z Liu, B Zhou
The Eleventh International Conference on Learning Representations, 2023
32023
AdaRec: Adaptive Sequential Recommendation for Reinforcing Long-term User Engagement
Z Xue, Q Cai, T Zuo, B Yang, L Hu, P Jiang, K Gai, B An
arXiv preprint arXiv:2310.03984, 2023
22023
AgentStudio: A Toolkit for Building General Virtual Agents
L Zheng, Z Huang, Z Xue, X Wang, B An, S Yan
arXiv preprint arXiv:2403.17918, 2024
2024
: Energy-Based Reinforcement Learning with Stein Soft Actor Critc
S Messaoud, B Mokeddem, Z Xue, B An, H Chen, S Chawla
The Twelfth International Conference on Learning Representations, 2024
2024
Järjestelmä ei voi suorittaa toimenpidettä nyt. Yritä myöhemmin uudelleen.
Artikkelit 1–10