Tor Lattimore

Cited by

	All	Since 2019
Citations	6522	6055
h-index	37	35
i10-index	67	62

1600

800

400

1200

20132014201520162017201820192020202120222023202424 26 53 56 95 179 344 766 1219 1445 1577 701

Public access

View all

21 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Csaba SzepesvariDeepMind & University of AlbertaVerified email at cs.ualberta.ca
Marcus HutterResearcher@DeepMind & Professor at ANUVerified email at anu.edu.au
Botao HaoDeepmindVerified email at google.com
Andras GyorgyDeepMindVerified email at google.com
Laurent OrseauResearch Scientist at Google DeepMindVerified email at google.com
Branislav KvetonAmazonVerified email at amazon.com
Eren SezenerDeepMindVerified email at google.com
Ian OsbandOpenAIVerified email at openai.com
Christoph DannResearch Scientist, GoogleVerified email at google.com
Emma BrunskillAssociate Professor of Computer Science, Stanford UniversityVerified email at cs.stanford.edu
Julian ZimmertGoogle ResearchVerified email at google.com
Mengdi WangCenter for Statistics & Machine Learning, ECE, Princeton UniversityVerified email at princeton.edu
Joel VenessGoogle DeepMindVerified email at google.com
Benjamin Van RoyStanford UniversityVerified email at stanford.edu
Satinder SinghGoogle DeepMind / U. of MichiganVerified email at umich.edu
Johannes KirschnerSwiss Data Science Center, ETH ZurichVerified email at sdsc.ethz.ch
Dale SchuurmansUniversity of Alberta, Google DeepMindVerified email at cs.ualberta.ca
Emilie KaufmannCNRS & Univ. Lille (CRIStAL)Verified email at inria.fr
Avishkar BhoopchandResearch Engineer, DeepMindVerified email at google.com
Agnieszka Grabska BarwińskaDeepMindVerified email at google.com

Tor Lattimore

DeepMind

Verified email at google.com - Homepage

machine learning learning theory reinforcement learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Bandit algorithms T Lattimore, C Szepesvári Cambridge University Press, 2020	2630	2020
Unifying PAC and regret: Uniform PAC bounds for episodic reinforcement learning C Dann, T Lattimore, E Brunskill Advances in Neural Information Processing Systems 30, 2017	301	2017
Causal bandits: Learning good interventions via causal inference F Lattimore, T Lattimore, MD Reid Advances in neural information processing systems 29, 2016	258*	2016
Degenerate feedback loops in recommender systems R Jiang, S Chiappa, T Lattimore, A György, P Kohli Proceedings of the 2019 AAAI/ACM Conference on AI, Ethics, and Society, 383-390, 2019	215	2019
Learning with good feature representations in bandits and in rl with a generative model T Lattimore, C Szepesvari, G Weisz International conference on machine learning, 5662-5670, 2020	178	2020
Behaviour suite for reinforcement learning I Osband, Y Doron, M Hessel, J Aslanides, E Sezener, A Saraiva, ... arXiv preprint arXiv:1908.03568, 2019	176	2019
PAC bounds for discounted MDPs T Lattimore, M Hutter Algorithmic Learning Theory: 23rd International Conference, ALT 2012, Lyon …, 2012	139	2012
The end of optimism? an asymptotic analysis of finite-armed linear bandits T Lattimore, C Szepesvari Artificial Intelligence and Statistics, 728-737, 2017	131	2017
Conservative bandits Y Wu, R Shariff, T Lattimore, C Szepesvári International Conference on Machine Learning, 1254-1262, 2016	120	2016
On explore-then-commit strategies A Garivier, T Lattimore, E Kaufmann Advances in Neural Information Processing Systems 29, 2016	114	2016
A geometric perspective on optimal representations for reinforcement learning M Bellemare, W Dabney, R Dadashi, A Ali Taiga, PS Castro, N Le Roux, ... Advances in neural information processing systems 32, 2019	95	2019
Model selection in contextual stochastic bandit problems A Pacchiano, M Phan, Y Abbasi Yadkori, A Rao, J Zimmert, T Lattimore, ... Advances in Neural Information Processing Systems 33, 10328-10337, 2020	91	2020
Garbage in, reward out: Bootstrapping exploration in multi-armed bandits B Kveton, C Szepesvari, S Vaswani, Z Wen, T Lattimore, M Ghavamzadeh International Conference on Machine Learning, 3601-3610, 2019	72	2019
Toprank: A practical algorithm for online stochastic ranking T Lattimore, B Kveton, S Li, C Szepesvari Advances in Neural Information Processing Systems 31, 2018	71	2018
The sample-complexity of general reinforcement learning T Lattimore, M Hutter, P Sunehag International Conference on Machine Learning, 28-36, 2013	69	2013
Near-optimal PAC bounds for discounted MDPs T Lattimore, M Hutter Theoretical Computer Science 558, 125-143, 2014	68	2014
Linear bandits with stochastic delayed feedback C Vernade, A Carpentier, T Lattimore, G Zappella, B Ermis, M Brueckner International Conference on Machine Learning, 9712-9721, 2020	67	2020
Bounded Regret for Finite-Armed Structured Bandits T Lattimore, R Munos	67	2014
Adaptive exploration in linear contextual bandit B Hao, T Lattimore, C Szepesvari International Conference on Artificial Intelligence and Statistics, 3536-3545, 2020	60	2020
An information-theoretic approach to minimax regret in partial monitoring T Lattimore, C Szepesvári Conference on Learning Theory, 2111-2139, 2019	59	2019

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors