Unified algorithms for rl with decision-estimation coefficients: No-regret, pac, and reward-free learning F Chen, S Mei, Y Bai arXiv preprint arXiv:2209.11745, 2022 | 23 | 2022 |
Transformers as Statisticians: Provable In-Context Learning with In-Context Algorithm Selection Y Bai, F Chen, H Wang, C Xiong, S Mei arXiv preprint arXiv:2306.04637, 2023 | 14 | 2023 |
Partially observable rl with b-stability: Unified structural condition and sharp sample-efficient algorithms F Chen, Y Bai, S Mei arXiv preprint arXiv:2209.14990, 2022 | 12 | 2022 |
Independent natural policy gradient methods for potential games: Finite-time global convergence with entropy regularization S Cen, F Chen, Y Chi 2022 IEEE 61st Conference on Decision and Control (CDC), 2833-2838, 2022 | 9 | 2022 |
A unified primal-dual algorithm framework for inequality constrained problems Z Zhu, F Chen, J Zhang, Z Wen Journal of Scientific Computing 97 (2), 1-39, 2023 | 3 | 2023 |
Lower Bounds for Learning in Revealing POMDPs F Chen, H Wang, C Xiong, S Mei, Y Bai arXiv preprint arXiv:2302.01333, 2023 | 3 | 2023 |
A Near-Optimal Primal-Dual Method for Off-Policy Learning in CMDP F Chen, J Zhang, Z Wen Advances in Neural Information Processing Systems 35, 10521-10532, 2022 | 1 | 2022 |
On the Optimal Lower and Upper Complexity Bounds for a Class of Composite Optimization Problems Z Zhu, F Chen, J Zhang, Z Wen arXiv preprint arXiv:2308.06470, 2023 | | 2023 |
Provable Convergence of Variational Monte Carlo Methods T Li, F Chen, H Chen, Z Wen arXiv preprint arXiv:2303.10599, 2023 | | 2023 |