Seuraa
Ziyu Wang
Ziyu Wang
Deepmind
Vahvistettu sähköpostiosoite verkkotunnuksessa google.com - Kotisivu
Nimike
Viittaukset
Viittaukset
Vuosi
Taking the human out of the loop: A review of Bayesian optimization
B Shahriari, K Swersky, Z Wang, RP Adams, N De Freitas
Proceedings of the IEEE 104 (1), 148-175, 2015
28492015
Dueling network architectures for deep reinforcement learning
Z Wang, T Schaul, M Hessel, H Hasselt, M Lanctot, N Freitas
International conference on machine learning, 1995-2003, 2016
27052016
Grandmaster level in StarCraft II using multi-agent reinforcement learning
O Vinyals, I Babuschkin, WM Czarnecki, M Mathieu, A Dudzik, J Chung, ...
Nature 575 (7782), 350-354, 2019
18402019
Emergence of locomotion behaviours in rich environments
N Heess, D TB, S Sriram, J Lemmon, J Merel, G Wayne, Y Tassa, T Erez, ...
arXiv preprint arXiv:1707.02286, 2017
7292017
Sample efficient actor-critic with experience replay
Z Wang, V Bapst, N Heess, V Mnih, R Munos, K Kavukcuoglu, ...
arXiv preprint arXiv:1611.01224, 2016
6772016
Alphastar: Mastering the real-time strategy game starcraft ii
O Vinyals, I Babuschkin, J Chung, M Mathieu, M Jaderberg, ...
DeepMind blog 2, 2019
4092019
Bayesian optimization in a billion dimensions via random embeddings
Z Wang, F Hutter, M Zoghi, D Matheson, N de Feitas
Journal of Artificial Intelligence Research 55, 361-387, 2016
3122016
Deep fried convnets
Z Yang, M Moczulski, M Denil, N De Freitas, A Smola, L Song, Z Wang
Proceedings of the IEEE international conference on computer vision, 1476-1483, 2015
3062015
Bayesian Optimization in High Dimensions via Random Embeddings.
Z Wang, M Zoghi, F Hutter, D Matheson, N De Freitas
IJCAI, 1778-1784, 2013
2932013
Reinforcement and imitation learning for diverse visuomotor skills
Y Zhu, Z Wang, J Merel, A Rusu, T Erez, S Cabi, S Tunyasuvunakool, ...
arXiv preprint arXiv:1802.09564, 2018
2372018
Learning an embedding space for transferable robot skills
K Hausman, JT Springenberg, Z Wang, N Heess, M Riedmiller
International Conference on Learning Representations, 2018
2142018
Playing hard exploration games by watching youtube
Y Aytar, T Pfaff, D Budden, T Paine, Z Wang, N De Freitas
Advances in neural information processing systems 31, 2018
2082018
Parallel multiscale autoregressive density estimation
S Reed, A Oord, N Kalchbrenner, SG Colmenarejo, Z Wang, Y Chen, ...
International Conference on Machine Learning, 2912-2921, 2017
1752017
Robust imitation of diverse behaviors
Z Wang, JS Merel, SE Reed, N de Freitas, G Wayne, N Heess
Advances in Neural Information Processing Systems 30, 2017
1672017
Learning human behaviors from motion capture by adversarial imitation
J Merel, Y Tassa, D TB, S Srinivasan, J Lemmon, Z Wang, G Wayne, ...
arXiv preprint arXiv:1707.02201, 2017
1652017
Adaptive hamiltonian and riemann manifold monte carlo
Z Wang, S Mohamed, N Freitas
International conference on machine learning, 1462-1470, 2013
1312013
Acme: A research framework for distributed reinforcement learning
M Hoffman, B Shahriari, J Aslanides, G Barth-Maron, F Behbahani, ...
arXiv preprint arXiv:2006.00979, 2020
952020
Autonomous navigation of stratospheric balloons using reinforcement learning
MG Bellemare, S Candido, PS Castro, J Gong, MC Machado, S Moitra, ...
Nature 588 (7836), 77-82, 2020
942020
Bright room temperature single photon source at telecom range in cubic silicon carbide
J Wang, Y Zhou, Z Wang, A Rasmita, J Yang, X Li, HJ von Bardeleben, ...
Nature communications 9 (1), 1-6, 2018
882018
Bayesian optimization in alphago
Y Chen, A Huang, Z Wang, I Antonoglou, J Schrittwieser, D Silver, ...
arXiv preprint arXiv:1812.06855, 2018
872018
Järjestelmä ei voi suorittaa toimenpidettä nyt. Yritä myöhemmin uudelleen.
Artikkelit 1–20