Implicit regularization in deep matrix factorization S Arora, N Cohen, W Hu, Y Luo Advances in Neural Information Processing Systems 32, 2019 | 557 | 2019 |
Algorithmic framework for model-based deep reinforcement learning with theoretical guarantees Y Luo, H Xu, Y Li, Y Tian, T Darrell, T Ma arXiv preprint arXiv:1807.03858, 2018 | 259 | 2018 |
Towards resolving the implicit bias of gradient descent for matrix factorization: Greedy low-rank learning Z Li, Y Luo, K Lyu arXiv preprint arXiv:2012.09839, 2020 | 136 | 2020 |
Provably efficient Q-learning with function approximation via distribution shift error checking oracle SS Du, Y Luo, R Wang, H Zhang Advances in Neural Information Processing Systems 32, 2019 | 105 | 2019 |
Safe reinforcement learning by imagining the near future G Thomas, Y Luo, T Ma Advances in Neural Information Processing Systems 34, 13859-13869, 2021 | 73 | 2021 |
Provable representation learning for imitation learning via bi-level optimization S Arora, S Du, S Kakade, Y Luo, N Saunshi International Conference on Machine Learning, 367-376, 2020 | 66 | 2020 |
Learning online alignments with continuous rewards policy gradient Y Luo, CC Chiu, N Jaitly, I Sutskever 2017 IEEE International Conference on Acoustics, Speech and Signal …, 2017 | 52 | 2017 |
Learning barrier certificates: Towards safe reinforcement learning with zero training-time violations Y Luo, T Ma Advances in Neural Information Processing Systems 34, 25621-25632, 2021 | 46 | 2021 |
On the expressivity of neural networks for deep reinforcement learning K Dong, Y Luo, T Yu, C Finn, T Ma International conference on machine learning, 2627-2637, 2020 | 32 | 2020 |
Towards learning to play piano with dexterous hands and touch H Xu, Y Luo, S Wang, T Darrell, R Calandra 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2022 | 24 | 2022 |
Learning self-correctable policies and value functions from demonstrations with negative sampling Y Luo, H Xu, T Ma arXiv preprint arXiv:1907.05634, 2019 | 19 | 2019 |
Recurrent neural networks for online sequence generation CC Chiu, N Jaitly, I Sutskever, Y Luo US Patent 10,281,885, 2019 | 8 | 2019 |
An online sequence-to-sequence model for noisy speech recognition CC Chiu, D Lawson, Y Luo, G Tucker, K Swersky, I Sutskever, N Jaitly arXiv preprint arXiv:1706.06428, 2017 | 8 | 2017 |
Bootstrapping the expressivity with model-based planning K Dong, Y Luo, T Ma | 2 | 2019 |
Towards Efficient and Effective Deep Model-Based Reinforcement Learning Y Luo Princeton University, 2022 | | 2022 |
Recurrent neural networks for online sequence generation CC Chiu, N Jaitly, I Sutskever, Y Luo US Patent 10,656,605, 2020 | | 2020 |