Adam Gleave

Viittaukset

	Kaikki	2019 lähtien
Sitaatit	4097	4000
h-indeksi	16	15
i10-indeksi	17	17

1300

650

325

975

2017201820192020202120222023202426 55 144 341 607 868 1254 774

Yleisessä käytössä

Näytä kaikki

2 artikkelia

0 artikkelia

käytettävissä

ei käytettävissä

Perustuu rahoitusehtoihin

Muut kirjoittajat

Antonin RaffinDLRVahvistettu sähköpostiosoite verkkotunnuksessa dlr.de
Ashley WD HillResearch EngineerVahvistettu sähköpostiosoite verkkotunnuksessa ensta-paristech.fr
Anssi KanervistoResearcher, Meta FAIRVahvistettu sähköpostiosoite verkkotunnuksessa meta.com
Stuart RussellProfessor of Computer Science, University of California, BerkeleyVahvistettu sähköpostiosoite verkkotunnuksessa cs.berkeley.edu
Sergey LevineUC Berkeley, Physical IntelligenceVahvistettu sähköpostiosoite verkkotunnuksessa eecs.berkeley.edu
Ionel GogGoogleVahvistettu sähköpostiosoite verkkotunnuksessa google.com
Steven HandCambridgeVahvistettu sähköpostiosoite verkkotunnuksessa cl.cam.ac.uk
Malte SchwarzkopfBrown UniversityVahvistettu sähköpostiosoite verkkotunnuksessa cs.brown.edu
Robert N. M. WatsonDepartment of Computer Science and Technology, University of CambridgeVahvistettu sähköpostiosoite verkkotunnuksessa cl.cam.ac.uk
Dylan Hadfield-MenellMassachusetts Institute of TechnologyVahvistettu sähköpostiosoite verkkotunnuksessa csail.mit.edu
Rohin ShahResearch Scientist, Google DeepMindVahvistettu sähköpostiosoite verkkotunnuksessa deepmind.com
Sören MindermannUniversity of Oxford, OATMLVahvistettu sähköpostiosoite verkkotunnuksessa cs.ox.ac.uk

Seuraa

Adam Gleave

CEO at FAR AI

Vahvistettu sähköpostiosoite verkkotunnuksessa far.ai - Kotisivu

Machine Learning Deep RL


Nimike Lajittele sitaattien mukaan Lajittele vuoden mukaan Lajittele otsikon mukaan	Viittaukset Viittaukset	Vuosi
Stable-baselines3: Reliable reinforcement learning implementations A Raffin, A Hill, A Gleave, A Kanervisto, M Ernestus, N Dormann Journal of Machine Learning Research 22 (268), 1-8, 2021	2007	2021
Stable baselines A Hill, A Raffin, M Ernestus, A Gleave, A Kanervisto, R Traore, P Dhariwal, ...	911	2018
Adversarial policies: Attacking deep reinforcement learning A Gleave, M Dennis, C Wild, N Kant, S Levine, S Russell International Conference on Learning Representations, 2020	409	2020
Firmament: Fast, centralized cluster scheduling at scale I Gog, M Schwarzkopf, A Gleave, RNM Watson, S Hand 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI …, 2016	284	2016
imitation: Clean imitation learning implementations A Gleave, M Taufeeque, J Rocamonde, E Jenner, SH Wang, S Toyer, ... arXiv preprint arXiv:2211.11972, 2022	67*	2022
Quantifying differences in reward functions A Gleave, M Dennis, S Legg, S Russell, J Leike International Conference on Learning Representations, 2021	59	2021
Inverse reinforcement learning for video games A Tucker, A Gleave, S Russell Deep Reinforcement Learning Workshop at NeurIPS, 2018	56	2018
Adversarial Policies Beat Superhuman Go AIs TT Wang, A Gleave, T Tseng, N Belrose, J Miller, MD Dennis, Y Duan, ... arXiv preprint arXiv:2211.00241, 2022	45*	2022
Multi-task maximum entropy inverse reinforcement learning A Gleave, O Habryka GoalsRL Workshop at ICML, 2018	44	2018
Invariance in policy optimisation and partial identifiability in reward learning JMV Skalse, M Farrugia-Roberts, S Russell, A Abate, A Gleave International Conference on Machine Learning, 32033-32058, 2023	33	2023
Understanding learned reward functions EJ Michaud, A Gleave, S Russell Deep Reinforcement Learning Workshop at NeurIPS, 2020	30	2020
Active inverse reward design S Mindermann, R Shah, A Gleave, D Hadfield-Menell GoalsRL Workshop at ICML, 2018	30	2018
Uncertainty estimation for language reward models A Gleave, G Irving arXiv preprint arXiv:2203.07472, 2022	23	2022
On the fragility of learned reward functions L McKinney, Y Duan, D Krueger, A Gleave arXiv preprint arXiv:2301.03652, 2023	16	2023
A primer on maximum causal entropy inverse reinforcement learning A Gleave, S Toyer arXiv preprint arXiv:2203.11409, 2022	16	2022
Making compression algorithms for Unicode text A Gleave, C Steinruecken Data Compression Conference, 2017	16	2017
Exploiting novel gpt-4 apis K Pelrine, M Taufeeque, M Zając, E McLean, A Gleave arXiv preprint arXiv:2312.14302, 2023	14	2023
Preprocessing reward functions for interpretability E Jenner, A Gleave arXiv preprint arXiv:2203.13553, 2022	9	2022
Reducing exploitability with population based training P Czempin, A Gleave arXiv preprint arXiv:2208.05083, 2022	7	2022
DERAIL: Diagnostic Environments for Reward And Imitation Learning P Freire, A Gleave, S Toyer, S Russell Deep Reinforcement Learning Workshop at NeurIPS, 2020	7	2020

Järjestelmä ei voi suorittaa toimenpidettä nyt. Yritä myöhemmin uudelleen.

Artikkelit 1–20

Sitaatteja vuodessa

Päällekkäiset lähteet

Yhdistetyt sitaatit

Lisää muut kirjoittajatMuut kirjoittajat

Seuraa

Viittaukset

Muut kirjoittajat