Seuraa
Yangchen Pan
Yangchen Pan
Vahvistettu sähköpostiosoite verkkotunnuksessa ualberta.ca - Kotisivu
Nimike
Viittaukset
Viittaukset
Vuosi
Maxmin Q-learning: Controlling the Estimation Bias of Q-learning
Q Lan, Y Pan, A Fyshe, M White
International Conference on Learning Representations (ICLR), 2020., 2020
672020
Fuzzy Tiling Activations: A Simple Approach to Learning Sparse Representations Online
Y Pan, K Banman, W Martha
International Conference on Learning Representations (ICLR), 2021., 2019
52*2019
Organizing experience: a deeper look at replay mechanisms for sample-based planning in continuous state domains
Y Pan, M Zaheer, A White, A Patterson, M White
International Joint Conference on Artificial Intelligence (IJCAI), 2019., 2019
432019
Accelerated gradient temporal difference learning
Y Pan, A White, M White
Thirty-First AAAI Conference on Artificial Intelligence, 2017
242017
Incremental truncated LSTD
C Gehring, Y Pan, M White
Proceedings of the Twenty-Fifth International Joint Conference on Artificial …, 2015
142015
Hill climbing on value estimates for search-control in dyna
Y Pan, H Yao, A Farahmand, M White
International Joint Conference on Artificial Intelligence (IJCAI), 2019., 2019
122019
Frequency-based Search-control in Dyna
Y Pan, J Mei, A Farahmand
International Conference on Learning Representations (ICLR), 2020., 2020
92020
Effective sketching methods for value function approximation
Y Pan, ES Azer, M White
Uncertainty in Artificial Intelligence (UAI), 2017., 2017
92017
Actor-expert: A framework for using action-value methods in continuous action spaces
S Lim, A Joseph, L Le, Y Pan, M White
arXiv preprint arXiv:1810.09103 22, 2018
82018
Adapting kernel representations online using submodular maximization
M Schlegel, Y Pan, J Chen, M White
International Conference on Machine Learning, 3037-3046, 2017
62017
Reinforcement learning with function-valued action spaces for partial differential equation control
Y Pan, A Farahmand, M White, S Nabi, P Grover, D Nikovski
International Conference on Machine Learning, 3986-3995, 2018
52018
An implicit function learning approach for parametric modal regression
Y Pan, E Imani, A Farahmand, M White
Advances in Neural Information Processing Systems 33, 2020
42020
Understanding and Mitigating the Limitations of Prioritized Experience Replay
Y Pan, J Mei, A Farahmand, M White, H Yao, M Rohani, J Luo
Uncertainty in Artificial Intelligence (UAI), 2022., 2022
1*2022
An Alternate Policy Gradient Estimator for Softmax Policies
S Garg, S Tosatto, Y Pan, M White, AR Mahmood
arXiv preprint arXiv:2112.11622, 2021
2021
Improving Sample Efficiency of Online Temporal Difference Learning
Y Pan
2021
Beyond Prioritized Replay: Sampling States in Model-Based Reinforcement Learning via Simulated Priorities
J Mei, Y Pan, A Farahmand, H Yao, M White
arXiv preprint arXiv:2007.09569, 2020
2020
Fuzzy Tiling Activations: A Simple Approach to Learning Sparse Representations Online Download PDF
Y Pan, K Banman, M White
Making Policy Gradient Estimators for Softmax Policies More Robust to Non-stationarities
S Garg, S Tosatto, Y Pan, M White, AR Mahmood
Actor-Expert: A Framework for using Q-learning in Continuous Action Spaces Download PDF
S Lim, A Joseph, L Le, Y Pan, M White
Järjestelmä ei voi suorittaa toimenpidettä nyt. Yritä myöhemmin uudelleen.
Artikkelit 1–19