Seuraa
Yannick Schroecker
Yannick Schroecker
DeepMind
Vahvistettu sähköpostiosoite verkkotunnuksessa google.com
Nimike
Viittaukset
Viittaukset
Vuosi
Imitating Latent Policies from Observation
AD Edwards, H Sahni, Y Schroecker, CL Isbell
International Conference on Machine Learning, 2018
1382018
Bootstrapped meta-learning
S Flennerhag, Y Schroecker, T Zahavy, H van Hasselt, D Silver, S Singh
arXiv preprint arXiv:2109.04504, 2021
662021
Human-timescale adaptation in an open-ended task space
AA Team, J Bauer, K Baumli, S Baveja, F Behbahani, A Bhoopchand, ...
arXiv preprint arXiv:2301.07608, 2023
63*2023
Generative predecessor models for sample-efficient imitation learning
Y Schroecker, M Vecerik, J Scholz
International Conference on Learning Representations, 2019
382019
State aware imitation learning
Y Schroecker, CL Isbell
Advances in Neural Information Processing Systems 30, 2017
302017
Structured state space models for in-context reinforcement learning
C Lu, Y Schroecker, A Gu, E Parisotto, J Foerster, S Singh, F Behbahani
Advances in Neural Information Processing Systems 36, 2024
282024
Discovering policies with domino: Diversity optimization maintaining near optimality
T Zahavy, Y Schroecker, F Behbahani, K Baumli, S Flennerhag, S Hou, ...
arXiv preprint arXiv:2205.13521, 2022
272022
Active learning within constrained environments through imitation of an expert questioner
K Bullard, Y Schroecker, S Chernova
International Joint Conference on Artificial Intelligence, 2019
182019
Universal value density estimation for imitation learning and goal-conditioned reinforcement learning
Y Schroecker, C Isbell
arXiv preprint arXiv:2002.06473, 2020
132020
Directing policy search with interactively taught via-points
Y Schroecker, H Ben Amor, A Thomaz
International Conference on Autonomous Agents & Multiagent Systems, 1052-1059, 2016
112016
Meta-gradients in non-stationary environments
J Luketina, S Flennerhag, Y Schroecker, D Abel, T Zahavy, S Singh
Conference on Lifelong Learning Agents, 886-901, 2022
72022
Imitation learning using a generative predecessor neural network
M Vecerik, Y Schroecker, JK Scholz
US Patent 10,872,294, 2020
72020
Vision-language models as a source of rewards
K Baumli, S Baveja, F Behbahani, H Chan, G Comanici, S Flennerhag, ...
arXiv preprint arXiv:2312.09187, 2023
12023
Manipulating State Space Distributions for Sample-Efficient Imitation-Learning.
Y Schroecker
Georgia Institute of Technology, Atlanta, GA, USA, 2020
12020
Järjestelmä ei voi suorittaa toimenpidettä nyt. Yritä myöhemmin uudelleen.
Artikkelit 1–14