Zeyuan Allen-Zhu (朱澤園)
Zeyuan Allen-Zhu (朱澤園)
Microsoft Research Redmond
Verified email at csail.mit.edu - Homepage
Title
Cited by
Cited by
Year
A convergence theory for deep learning via over-parameterization
Z Allen-Zhu, Y Li, Z Song
ICML 2019: International Conference on Machine Learning, 2019
6472019
Katyusha: the first direct acceleration of stochastic gradient methods
Z Allen-Zhu
STOC 2017: Symposium on Theory of Computing, 19-23, 2017
4882017
Learning and generalization in overparameterized neural networks, going beyond two layers
Z Allen-Zhu, Y Li, Y Liang
NeurIPS 2019: Neural Information Processing Systems, 2019
4142019
Is Q-learning Provably Efficient?
C Jin, Z Allen-Zhu, S Bubeck, MI Jordan
NIPS 2018: Neural Information Processing Systems, 2018
3592018
Variance reduction for faster non-convex optimization
Z Allen-Zhu, E Hazan
ICML 2016: International Conference on Machine Learning, 699-707, 2016
3502016
Linear coupling: An ultimate unification of gradient and mirror descent
Z Allen-Zhu, L Orecchia
ITCS 2017: Innovations in Theoretical Computer Science, 2017
290*2017
Finding approximate local minima faster than gradient descent
N Agarwal, Z Allen-Zhu, B Bullins, E Hazan, T Ma
STOC 2017: Symposium on Theory of Computing, 1195-1199, 2017
266*2017
A simple, combinatorial algorithm for solving SDD systems in nearly-linear time
JA Kelner, L Orecchia, A Sidford, ZA Zhu
STOC 2013: Symposium on Theory of Computing, 911-920, 2013
2382013
Natasha 2: Faster Non-Convex Optimization Than SGD
Z Allen-Zhu
NIPS 2018: Neural Information Processing Systems, 2018
2042018
Improved SVRG for non-strongly-convex or sum-of-non-convex objectives
Z Allen-Zhu, Y Yuan
ICML 2016: International Conference on Machine Learning, 1080-1089, 2016
1762016
Even faster accelerated coordinate descent using non-uniform sampling
Z Allen-Zhu, Z Qu, P Richtárik, Y Yuan
ICML 2016: International Conference on Machine Learning, 1110-1119, 2016
1572016
Byzantine Stochastic Gradient Descent
D Alistarh, Z Allen-Zhu, J Li
NIPS 2018: Neural Information Processing Systems, 2018
1532018
Asymptotically optimal strategy-proof mechanisms for two-facility games
P Lu, X Sun, Y Wang, ZA Zhu
ACM-EC 2010: Conference on Economics and Computation, 315-324, 2010
1452010
Randomized accuracy-aware program transformations for efficient approximate computations
ZA Zhu, S Misailovic, JA Kelner, M Rinard
POPL 2012: Symposium on Principles of Programming Languages, 441-454, 2012
1122012
A novel click model and its applications to online advertising
ZA Zhu, W Chen, T Minka, C Zhu, Z Chen
WSDM 2010: International Conference on Web Search and Data Mining, 321-330, 2010
1102010
LazySVD: Even faster SVD decomposition yet without agonizing pain
Z Allen-Zhu, Y Li
NIPS 2016: Neural Information Processing Systems, 974-982, 2016
1012016
Neon2: Finding Local Minima via First-Order Oracles
Z Allen-Zhu, Y Li
NIPS 2018: Neural Information Processing Systems, 2018
972018
Natasha: Faster Non-Convex Stochastic Optimization Via Strongly Non-Convex Parameter
Z Allen-Zhu
ICML 2017: International Conference on Machine Learning, 2017
94*2017
P-packSVM: Parallel primal gradient descent kernel SVM
ZA Zhu, W Chen, G Wang, C Zhu, Z Chen
ICDM 2009: International Conference on Data Mining, 677-686, 2009
93*2009
On the convergence rate of training recurrent neural networks
Z Allen-Zhu, Y Li, Z Song
NeurIPS 2019: Neural Information Processing Systems, 2019
922019
The system can't perform the operation now. Try again later.
Articles 1–20