Yinlam Chow

Cited by

	All	Since 2019
Citations	4133	3722
h-index	25	24
i10-index	42	40

1000

500

250

750

201520162017201820192020202120222023202435 47 81 110 271 501 714 890 999 346

Public access

View all

7 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Ofir NachumOpenAIVerified email at openai.com
Marco PavoneStanford University and NVIDIAVerified email at stanford.edu
Shie MannorProfessor of Electrical Engineering @ Technion & Researcher @ Nvidia ResearchVerified email at technion.ac.il
Aviv TamarTechnionVerified email at technion.ac.il
Jiyan YangStanford UniversityVerified email at stanford.edu
Junjie QinAssistant Professor, Purdue UniversityVerified email at purdue.edu
Ram RajagopalAssociate Professor, Stanford UniversityVerified email at stanford.edu
Lucas JansonAssistant Professor, Harvard University Department of StatisticsVerified email at fas.harvard.edu
Marek PetrikUniversity of New HampshireVerified email at cs.unh.edu
Mehrdad FarajtabarResearch Scientist at AppleVerified email at apple.com
Stefano CarpinProfessor, University of California, MercedVerified email at ucmerced.edu
Sumeet KatariyaAmazonVerified email at wisc.edu
Alan MalekMITVerified email at mit.edu
Sumeet SinghResearch Scientist, Google Brain RoboticsVerified email at google.com
Anirudha MajumdarPrinceton UniversityVerified email at princeton.edu
Christopher RéComputer Science, Stanford UniversityVerified email at cs.stanford.edu
Bo LiuAAAI SM, IEEE SMVerified email at cs.umass.edu
Brian M SadlerThe University of Texas at AustinVerified email at ieee.org
Martin CorlessAeronautics & Astronautics, Purdue UniversityVerified email at purdue.edu

Yinlam Chow

Research Scientist, Google Research

Verified email at google.com

Reinforcement learning Optimal Control Sequential Decision Making Robust Control Nonlinear Systems


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
A lyapunov-based approach to safe reinforcement learning Y Chow, O Nachum, E Duenez-Guzman, M Ghavamzadeh Advances in neural information processing systems 31, 2018	527	2018
Risk-constrained reinforcement learning with percentile risk criteria Y Chow, M Ghavamzadeh, L Janson, M Pavone Journal of Machine Learning Research 18 (167), 1-51, 2018	497	2018
Algorithms for CVaR optimization in MDPs Y Chow, M Ghavamzadeh Advances in neural information processing systems 27, 2014	397	2014
Risk-sensitive and robust decision-making: a cvar optimization approach Y Chow, A Tamar, S Mannor, M Pavone Advances in neural information processing systems 28, 2015	348	2015
Dualdice: Behavior-agnostic estimation of discounted stationary distribution corrections O Nachum, Y Chow, B Dai, L Li Advances in neural information processing systems 32, 2019	316	2019
More robust doubly robust off-policy evaluation M Farajtabar, Y Chow, M Ghavamzadeh International Conference on Machine Learning, 1447-1456, 2018	252	2018
Lyapunov-based safe policy optimization for continuous control Y Chow, O Nachum, A Faust, E Duenez-Guzman, M Ghavamzadeh arXiv preprint arXiv:1901.10031, 2019	239	2019
Algaedice: Policy gradient from arbitrary experience O Nachum, B Dai, I Kostrikov, Y Chow, L Li, D Schuurmans arXiv preprint arXiv:1912.02074, 2019	225	2019
Safe policy improvement by minimizing robust baseline regret M Ghavamzadeh, M Petrik, Y Chow Advances in Neural Information Processing Systems 29, 2016	146	2016
Policy gradient for coherent risk measures A Tamar, Y Chow, M Ghavamzadeh, S Mannor Advances in neural information processing systems 28, 2015	131	2015
Coindice: Off-policy confidence interval estimation B Dai, O Nachum, Y Chow, L Li, C Szepesvári, D Schuurmans Advances in neural information processing systems 33, 9398-9411, 2020	79	2020
Sequential decision making with coherent risk A Tamar, Y Chow, M Ghavamzadeh, S Mannor IEEE transactions on automatic control 62 (7), 3323-3338, 2016	78	2016
A framework for time-consistent, risk-sensitive model predictive control: Theory and algorithms S Singh, Y Chow, A Majumdar, M Pavone IEEE Transactions on Automatic Control 64 (7), 2905-2912, 2018	65	2018
Online modified greedy algorithm for storage control under uncertainty J Qin, Y Chow, J Yang, R Rajagopal IEEE Transactions on Power Systems 31 (3), 1729-1743, 2015	63	2015
CAQL: Continuous action Q-learning M Ryu, Y Chow, R Anderson, C Tjandraatmadja, C Boutilier arXiv preprint arXiv:1909.12397, 2019	50	2019
Weighted SGD for Regression with Randomized Preconditioning J Yang, YL Chow, C Ré, MW Mahoney Journal of Machine Learning Research 18 (211), 1-43, 2018	50	2018
Latent bandits revisited J Hong, B Kveton, M Zaheer, Y Chow, A Ahmed, C Boutilier Advances in Neural Information Processing Systems 33, 13423-13433, 2020	46	2020
A framework for time-consistent, risk-averse model predictive control: Theory and algorithms YL Chow, M Pavone 2014 American Control Conference, 4204-4211, 2014	44	2014
Distributed online modified greedy algorithm for networked storage operation under uncertainty J Qin, Y Chow, J Yang, R Rajagopal IEEE Transactions on Smart Grid 7 (2), 1106-1118, 2015	42	2015
Path consistency learning in tsallis entropy regularized mdps Y Chow, O Nachum, M Ghavamzadeh International conference on machine learning, 979-988, 2018	35	2018

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors