Follow
Xiuyuan Lu
Xiuyuan Lu
Google DeepMind
Verified email at google.com
Title
Cited by
Cited by
Year
Ensemble sampling
X Lu, B Van Roy
Advances in Neural Information Processing Systems 31, 2017
1372017
Epistemic neural networks
I Osband, Z Wen, SM Asghari, V Dwaracherla, M Ibrahimi, X Lu, ...
Advances in Neural Information Processing Systems 37, 2023
772023
Reinforcement learning, bit by bit
X Lu, B Van Roy, V Dwaracherla, M Ibrahimi, I Osband, Z Wen
Foundations and TrendsŪ in Machine Learning 16 (6), 733-865, 2023
622023
Information-theoretic confidence bounds for reinforcement learning
X Lu, B Van Roy
Advances in Neural Information Processing Systems 33, 2019
532019
Hypermodels for exploration
V Dwaracherla, X Lu, M Ibrahimi, I Osband, Z Wen, B Van Roy
International Conference on Learning Representations, 2020
432020
Efficient online recommendation via low-rank ensemble sampling
X Lu, Z Wen, B Kveton
Proceedings of the 12th ACM Conference on Recommender Systems, 460-464, 2018
202018
An analysis of ensemble sampling
C Qin, Z Wen, X Lu, B Van Roy
Advances in Neural Information Processing Systems 36, 2022
172022
The neural testbed: Evaluating joint predictions
I Osband, Z Wen, SM Asghari, V Dwaracherla, X Lu, M Ibrahimi, ...
Advances in Neural Information Processing Systems 36, 12554-12565, 2022
152022
Ensembles for uncertainty estimation: Benefits of prior functions and bootstrapping
V Dwaracherla, Z Wen, I Osband, X Lu, SM Asghari, B Van Roy
Transactions on Machine Learning Research, 2023
112023
From predictions to decisions: The importance of joint predictive distributions
Z Wen, I Osband, C Qin, X Lu, M Ibrahimi, V Dwaracherla, M Asghari, ...
arXiv preprint arXiv:2107.09224, 2021
82021
Approximate Thompson Sampling via Epistemic Neural Networks
I Osband, Z Wen, SM Asghari, V Dwaracherla, M Ibrahimi, X Lu, ...
Proceedings of the 39th Conference on Uncertainty in Artificial Intelligence, 2023
72023
Evaluating High-Order Predictive Distributions in Deep Learning
I Osband, Z Wen, SM Asghari, V Dwaracherla, X Lu, B Van Roy
Proceedings of the 38th Conference on Uncertainty in Artificial Intelligence, 2022
62022
Information-directed sampling for reinforcement learning
X Lu
Stanford University, 2020
42020
Robustness of epinets against distributional shifts
X Lu, I Osband, SM Asghari, S Gowal, V Dwaracherla, Z Wen, B Van Roy
arXiv preprint arXiv:2207.00137, 2022
12022
RLHF and IIA: Perverse Incentives
W Xu, S Dong, X Lu, G Lam, Z Wen, B Van Roy
arXiv e-prints, arXiv: 2312.01057, 2023
2023
Exploration using hyper-models
B Van Roy, X Lu, VR Dwaracherla, Z Wen, M Ibrahimi, IDM Osband
US Patent App. 17/639,504, 2022
2022
The system can't perform the operation now. Try again later.
Articles 1–16