Seuraa
Martha White
Martha White
Vahvistettu sähköpostiosoite verkkotunnuksessa ualberta.ca - Kotisivu
Nimike
Viittaukset
Viittaukset
Vuosi
Off-Policy Actor-Critic
T Degris, M White, RS Sutton
Twenty-Ninth International Conference on Machine Learning, 2012
5192012
Meta-learning representations for continual learning
K Javed, M White
Advances in Neural Information Processing Systems 32, 2019
2582019
An emphatic approach to the problem of off-policy temporal-difference learning
RS Sutton, AR Mahmood, M White
The Journal of Machine Learning Research 17 (1), 2603-2631, 2016
2422016
Supervised autoencoders: Improving generalization performance with unsupervised regularizers
L Le, A Patterson, M White
Advances in neural information processing systems 31, 2018
2052018
Convex multi-view subspace learning
M White, X Zhang, D Schuurmans, Y Yu
Advances in neural information processing systems 25, 2012
1812012
Estimating the class prior and posterior from noisy positives and unlabeled data
S Jain, M White, P Radivojac
Advances in neural information processing systems 29, 2016
1142016
Maxmin Q-learning: Controlling the Estimation Bias of Q-learning
Q Lan, Y Pan, A Fyshe, M White
International Conference on Learning Representations, 2020
1012020
Unifying task specification in reinforcement learning
M White
International Conference on Machine Learning, 2016
922016
An off-policy policy gradient theorem using emphatic weightings
E Imani, E Graves, M White
Advances in Neural Information Processing Systems 31, 2018
692018
Recovering true classifier performance in positive-unlabeled learning
S Jain, M White, P Radivojac
Proceedings of the AAAI Conference on Artificial Intelligence 31 (1), 2017
642017
Nonparametric semi-supervised learning of class proportions
S Jain, M White, MW Trosset, P Radivojac
arXiv preprint arXiv:1601.01944, 2016
542016
Fuzzy Tiling Activations: A Simple Approach to Learning Sparse Representations Online
Y Pan, K Banman, M White
International Conference on Learning Representations, 2021
53*2021
Organizing experience: a deeper look at replay mechanisms for sample-based planning in continuous state domains
Y Pan, M Zaheer, A White, A Patterson, M White
International Joint Conference on Artificial Intelligence, 2018
522018
Relaxed clipping: A global training method for robust regression and classification
Y Yu, M Yang, L Xu, M White, D Schuurmans
Advances in Neural Information Processing Systems 23, 2011
512011
Sim2Real in Robotics and Automation: Applications and Challenges
S Höfer, K Bekris, A Handa, JC Gamboa, M Mozifian, F Golemo, ...
IEEE Transactions on Automation Science and Engineering 18 (2), 398-400, 2021
452021
Optimizing for the Future in Non-Stationary MDPs
Y Chandak, G Theocharous, S Shankar, S Mahadevan, M White, ...
International Conference on Machine Learning, 2020
452020
The utility of sparse representations for control in reinforcement learning
V Liu, R Kumaraswamy, L Le, M White
AAAI Conference on Artificial Intelligence, 2019
442019
Investigating practical, linear temporal difference learning
A White, M White
Autonomous Agents and Multiagent Sytems, 2016
412016
A greedy approach to adapting the trace parameter for temporal difference learning
M White, A White
International Conference on Autonomous Agents & Multiagent Systems, 557-565, 2016
402016
Convex Sparse Coding, Subspace Learning, and Semi-Supervised Extensions.
X Zhang, Y Yu, M White, R Huang, D Schuurmans
Proceedings of the AAAI Conference on Artificial Intelligence, 2011
392011
Järjestelmä ei voi suorittaa toimenpidettä nyt. Yritä myöhemmin uudelleen.
Artikkelit 1–20