Pipeline PSRO: A scalable approach for finding approximate nash equilibria in large games S McAleer, JB Lanier, R Fox, P Baldi
Advances in Neural Information Processing Systems 33, 2020
83 2020 XDO: A double oracle algorithm for extensive-form games S McAleer, JB Lanier, K Wang, P Baldi, R Fox
Advances in Neural Information Processing Systems 34, 2021
55 2021 Curiosity-Driven Multi-Criteria Hindsight Experience Replay JB Lanier, S McAleer, P Baldi
arXiv preprint arXiv:1906.03710, 2019
23 2019 OffWorld gym: Open-access physical robotics environment for real-world reinforcement learning benchmark and research A Kumar, T Buckley, JB Lanier, Q Wang, A Kavelaars, I Kuzovkin
arXiv preprint arXiv:1910.08639, 2019
13 2019 Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games S McAleer, JB Lanier, K Wang, P Baldi, R Fox, T Sandholm
arXiv preprint arXiv:2207.06541, 2022
12 2022 Anytime Optimal PSRO for Two-Player Zero-Sum Games S McAleer, K Wang, M Lanctot, J Lanier, P Baldi, R Fox
arXiv preprint arXiv:2201.07700, 2022
11 2022 Anytime PSRO for Two-Player Zero-Sum Games S McAleer, K Wang, JB Lanier, M Lanctot, P Baldi, T Sandholm, R Fox
11 2022 Selective Perception: Optimizing State Descriptions with Reinforcement Learning for Language Model Actors K Nottingham, Y Razeghi, K Kim, JB Lanier, P Baldi, R Fox, S Singh
arXiv preprint arXiv:2307.11922, 2023
7 2023 Improving Social Welfare While Preserving Autonomy via a Pareto Mediator S McAleer, J Lanier, M Dennis, P Baldi, R Fox
arXiv preprint arXiv:2106.03927, 2021
6 2021 Feasible Adversarial Robust Reinforcement Learning for Underspecified Environments JB Lanier, S McAleer, P Baldi, R Fox
arXiv preprint arXiv:2207.09597, 2022
5 2022 ColosseumRL: A Framework for Multiagent Reinforcement Learning in -Player Games A Shmakov, J Lanier, S McAleer, R Achar, C Lopes, P Baldi
arXiv preprint arXiv:1912.04451, 2019
2 2019 CFR-DO: A Double Oracle Algorithm for Extensive-Form Games S McAleer, J Lanier, P Baldi, R Fox
AAAI-21 Workshop on Reinforcement Learning in Games, 2021
1 2021 Anytime Optimal PSRO for Two-Player Zero-Sum Games S McAleer123, K Wang, J Lanier, M Lanctot, P Baldi, T Sandholm, R Fox
2021 OffWorld Gym: open-access physical lunar analog environment for reinforcement learning and robotics research I Kuzovkin, J Lanier, A Kumar, Q Wang
43rd COSPAR Scientific Assembly. Held 28 January-4 February 43, 164, 2021
2021