RL for Latent MDPs: Regret Guarantees and a Lower Bound J Kwon, Y Efroni, C Caramanis, S Mannor Advances in Neural Information Processing Systems 34, 2021 | 67 | 2021 |
Global Convergence of the EM Algorithm for Mixtures of Two Component Linear Regression J Kwon, W Qian, C Caramanis, Y Chen, D Davis Conference on Learning Theory, 2055-2110, 2019 | 66 | 2019 |
EM Converges for a Mixture of Many Linear Regressions J Kwon, C Caramanis International Conference on Artificial Intelligence and Statistics, 1727-1736, 2020 | 40 | 2020 |
On the Minimax Optimality of the EM Algorithm for Learning Two-Component Mixed Linear Regression J Kwon, N Ho, C Caramanis International Conference on Artificial Intelligence and Statistics, 1405-1413, 2021 | 39 | 2021 |
The EM Algorithm gives Sample-Optimality for Learning Mixtures of Well-Separated Gaussians J Kwon, C Caramanis Conference on Learning Theory, 2425-2487, 2020 | 31* | 2020 |
On the Computational and Statistical Complexity of Over-Parameterized Matrix Sensing J Zhuo, J Kwon, N Ho, C Caramanis arXiv preprint arXiv:2102.02756, 2021 | 28 | 2021 |
A Fully First-Order Method for Stochastic Bilevel Optimization J Kwon, D Kwon, S Wright, RD Nowak International Conference on Machine Learning, 18083-18113, 2023 | 26 | 2023 |
Reinforcement Learning in Reward-Mixing MDPs J Kwon, Y Efroni, C Caramanis, S Mannor Advances in Neural Information Processing Systems 34, 2253-2264, 2021 | 16 | 2021 |
Feed two birds with one scone: Exploiting wild data for both out-of-distribution generalization and detection H Bai, G Canal, X Du, J Kwon, RD Nowak, Y Li International Conference on Machine Learning, 1454-1471, 2023 | 12 | 2023 |
Coordinated Attacks against Contextual Bandits: Fundamental Limits and Defense Mechanisms J Kwon, Y Efroni, C Caramanis, S Mannor International Conference on Machine Learning, 11772-11789, 2022 | 7 | 2022 |
On Penalty Methods for Nonconvex Bilevel Optimization and First-Order Stochastic Approximation J Kwon, D Kwon, S Wright, R Nowak arXiv preprint arXiv:2309.01753, 2023 | 6 | 2023 |
Reward-Mixing MDPs with Few Latent Contexts are Learnable J Kwon, Y Efroni, C Caramanis, S Mannor International Conference on Machine Learning, 18057-18082, 2023 | 2 | 2023 |
Tractable Optimality in Episodic Latent MABs J Kwon, Y Efroni, C Caramanis, S Mannor Advances in Neural Information Processing Systems 35, 23634-23645, 2022 | 1 | 2022 |
Modeling and simulation of nonlinear transient responses of high-voltage wordline generators in NAND flash memories J Lee, JY Kwon, J Kim 2015 International SoC Design Conference (ISOCC), 323-324, 2015 | 1 | 2015 |
Future Prediction Can be a Strong Evidence of Good History Representation in Partially Observable Environments J Kwon, L Yang, R Nowak, J Hanna arXiv preprint arXiv:2402.07102, 2024 | | 2024 |
On the Complexity of First-Order Methods in Stochastic Bilevel Optimization J Kwon, D Kwon, H Lyu arXiv preprint arXiv:2402.07101, 2024 | | 2024 |
Prospective Side Information for Latent MDPs J Kwon, Y Efroni, S Mannor, C Caramanis arXiv preprint arXiv:2310.07596, 2023 | | 2023 |
Statistical learning with latent variables: mixture models and reinforcement learning J Kwon | | 2022 |