Gregory Farquhar

Viittaukset

	Kaikki	2019 lähtien
Sitaatit	7073	6830
h-indeksi	14	14
i10-indeksi	18	18

2100

1050

525

1575

2017201820192020202120222023202448 155 436 768 1186 1639 2052 743

Yleisessä käytössä

Näytä kaikki

13 artikkelia

0 artikkelia

käytettävissä

ei käytettävissä

Perustuu rahoitusehtoihin

Muut kirjoittajat

Shimon WhitesonProfessor of Computer Science, University of Oxford / Senior Staff Research Scientist, WaymoVahvistettu sähköpostiosoite verkkotunnuksessa cs.ox.ac.uk
Jakob FoersterAssociate Professor, University of OxfordVahvistettu sähköpostiosoite verkkotunnuksessa eng.ox.ac.uk
Nantas NardelliStealthVahvistettu sähköpostiosoite verkkotunnuksessa arbitrarygravitas.com
Philip TorrProfessor, University of OxfordVahvistettu sähköpostiosoite verkkotunnuksessa eng.ox.ac.uk
Triantafyllos AfourasFAIR, Meta, University of OxfordVahvistettu sähköpostiosoite verkkotunnuksessa fb.com
Tim RocktäschelProfessor of Artificial Intelligence at UCL, Open-Endedness Team Lead at Google DeepMindVahvistettu sähköpostiosoite verkkotunnuksessa cs.ucl.ac.uk
Pushmeet KohliDeepMindVahvistettu sähköpostiosoite verkkotunnuksessa google.com

Seuraa

Gregory Farquhar

DeepMind

Vahvistettu sähköpostiosoite verkkotunnuksessa google.com

Reinforcement Learning Artificial Intelligence


Nimike Lajittele sitaattien mukaan Lajittele vuoden mukaan Lajittele otsikon mukaan	Viittaukset Viittaukset	Vuosi
Monotonic value function factorisation for deep multi-agent reinforcement learning T Rashid, M Samvelyan, CS De Witt, G Farquhar, J Foerster, S Whiteson Journal of Machine Learning Research 21 (178), 1-51, 2020	2170	2020
Counterfactual multi-agent policy gradients J Foerster, G Farquhar, T Afouras, N Nardelli, S Whiteson Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018	2068	2018
The starcraft multi-agent challenge M Samvelyan, T Rashid, CS De Witt, G Farquhar, N Nardelli, TGJ Rudner, ... arXiv preprint arXiv:1902.04043, 2019	924	2019
Stabilising experience replay for deep multi-agent reinforcement learning J Foerster, N Nardelli, G Farquhar, T Afouras, PHS Torr, P Kohli, ... International conference on machine learning, 1146-1155, 2017	712	2017
Weighted qmix: Expanding monotonic value function factorisation for deep multi-agent reinforcement learning T Rashid, G Farquhar, B Peng, S Whiteson Advances in neural information processing systems 33, 10199-10210, 2020	321	2020
A survey of reinforcement learning informed by natural language J Luketina, N Nardelli, G Farquhar, J Foerster, J Andreas, E Grefenstette, ... arXiv preprint arXiv:1906.03926, 2019	284	2019
Treeqn and atreec: Differentiable tree-structured models for deep reinforcement learning G Farquhar, T Rocktäschel, M Igl, S Whiteson arXiv preprint arXiv:1710.11417, 2017	141	2017
Multi-agent common knowledge reinforcement learning C Schroeder de Witt, J Foerster, G Farquhar, P Torr, W Boehmer, ... Advances in neural information processing systems 32, 2019	112*	2019
Dice: The infinitely differentiable monte carlo estimator J Foerster, G Farquhar, M Al-Shedivat, T Rocktäschel, E Xing, S Whiteson International Conference on Machine Learning, 1529-1538, 2018	93	2018
Transient non-stationarity and generalisation in deep reinforcement learning M Igl, G Farquhar, J Luketina, W Boehmer, S Whiteson arXiv preprint arXiv:2006.05826, 2020	65	2020
Growing action spaces G Farquhar, L Gustafson, Z Lin, S Whiteson, N Usunier, G Synnaeve International Conference on Machine Learning, 3040-3051, 2020	33	2020
Proper value equivalence C Grimm, A Barreto, G Farquhar, D Silver, S Singh Advances in Neural Information Processing Systems 34, 7773-7786, 2021	32	2021
The impact of non-stationarity on generalisation in deep reinforcement learning M Igl, G Farquhar, J Luketina, W Boehmer, S Whiteson arXiv preprint arXiv:2006.05826 8, 2020	30	2020
Psiphi-learning: Reinforcement learning with demonstrations using successor features and inverse temporal difference learning A Filos, C Lyle, Y Gal, S Levine, N Jaques, G Farquhar International Conference on Machine Learning, 3305-3317, 2021	25	2021
A baseline for any order gradient estimation in stochastic computation graphs J Mao, J Foerster, T Rocktäschel, M Al-Shedivat, G Farquhar, S Whiteson International Conference on Machine Learning, 4343-4351, 2019	12	2019
Counterfactual multi-agent policy gradients. CoRR abs/1705.08926 (2017) JN Foerster, G Farquhar, T Afouras, N Nardelli, S Whiteson arXiv preprint arXiv:1705.08926, 2017	11	2017
Self-consistent models and values G Farquhar, K Baumli, Z Marinho, A Filos, M Hessel, HP van Hasselt, ... Advances in Neural Information Processing Systems 34, 1111-1125, 2021	10	2021
Loaded DiCE: Trading off bias and variance in any-order score function gradient estimators for reinforcement learning G Farquhar, S Whiteson, J Foerster Advances in Neural Information Processing Systems 32, 2019	10	2019
Model-value inconsistency as a signal for epistemic uncertainty A Filos, E Vértes, Z Marinho, G Farquhar, D Borsa, A Friesen, ... arXiv preprint arXiv:2112.04153, 2021	9	2021
No DICE: An investigation of the bias-variance tradeoff in meta-gradients R Vuorio, JA Beck, G Farquhar, JN Foerster, S Whiteson Deep RL Workshop NeurIPS 2021, 2021	5	2021

Järjestelmä ei voi suorittaa toimenpidettä nyt. Yritä myöhemmin uudelleen.

Artikkelit 1–20

Sitaatteja vuodessa

Päällekkäiset lähteet

Yhdistetyt sitaatit

Lisää muut kirjoittajatMuut kirjoittajat

Seuraa

Viittaukset

Muut kirjoittajat