Maxmin Q-learning: Controlling the Estimation Bias of Q-learning Q Lan, Y Pan, A Fyshe, M White International Conference on Learning Representations (ICLR), 2020., 2020 | 67 | 2020 |
Fuzzy Tiling Activations: A Simple Approach to Learning Sparse Representations Online Y Pan, K Banman, W Martha International Conference on Learning Representations (ICLR), 2021., 2019 | 52* | 2019 |
Organizing experience: a deeper look at replay mechanisms for sample-based planning in continuous state domains Y Pan, M Zaheer, A White, A Patterson, M White International Joint Conference on Artificial Intelligence (IJCAI), 2019., 2019 | 43 | 2019 |
Accelerated gradient temporal difference learning Y Pan, A White, M White Thirty-First AAAI Conference on Artificial Intelligence, 2017 | 24 | 2017 |
Incremental truncated LSTD C Gehring, Y Pan, M White Proceedings of the Twenty-Fifth International Joint Conference on Artificial …, 2015 | 14 | 2015 |
Hill climbing on value estimates for search-control in dyna Y Pan, H Yao, A Farahmand, M White International Joint Conference on Artificial Intelligence (IJCAI), 2019., 2019 | 12 | 2019 |
Frequency-based Search-control in Dyna Y Pan, J Mei, A Farahmand International Conference on Learning Representations (ICLR), 2020., 2020 | 9 | 2020 |
Effective sketching methods for value function approximation Y Pan, ES Azer, M White Uncertainty in Artificial Intelligence (UAI), 2017., 2017 | 9 | 2017 |
Actor-expert: A framework for using action-value methods in continuous action spaces S Lim, A Joseph, L Le, Y Pan, M White arXiv preprint arXiv:1810.09103 22, 2018 | 8 | 2018 |
Adapting kernel representations online using submodular maximization M Schlegel, Y Pan, J Chen, M White International Conference on Machine Learning, 3037-3046, 2017 | 6 | 2017 |
Reinforcement learning with function-valued action spaces for partial differential equation control Y Pan, A Farahmand, M White, S Nabi, P Grover, D Nikovski International Conference on Machine Learning, 3986-3995, 2018 | 5 | 2018 |
An implicit function learning approach for parametric modal regression Y Pan, E Imani, A Farahmand, M White Advances in Neural Information Processing Systems 33, 2020 | 4 | 2020 |
Understanding and Mitigating the Limitations of Prioritized Experience Replay Y Pan, J Mei, A Farahmand, M White, H Yao, M Rohani, J Luo Uncertainty in Artificial Intelligence (UAI), 2022., 2022 | 1* | 2022 |
An Alternate Policy Gradient Estimator for Softmax Policies S Garg, S Tosatto, Y Pan, M White, AR Mahmood arXiv preprint arXiv:2112.11622, 2021 | | 2021 |
Improving Sample Efficiency of Online Temporal Difference Learning Y Pan | | 2021 |
Beyond Prioritized Replay: Sampling States in Model-Based Reinforcement Learning via Simulated Priorities J Mei, Y Pan, A Farahmand, H Yao, M White arXiv preprint arXiv:2007.09569, 2020 | | 2020 |
Fuzzy Tiling Activations: A Simple Approach to Learning Sparse Representations Online Download PDF Y Pan, K Banman, M White | | |
Making Policy Gradient Estimators for Softmax Policies More Robust to Non-stationarities S Garg, S Tosatto, Y Pan, M White, AR Mahmood | | |
Actor-Expert: A Framework for using Q-learning in Continuous Action Spaces Download PDF S Lim, A Joseph, L Le, Y Pan, M White | | |