关注
Zaiwei Chen
Zaiwei Chen
CMI Postdoctoral Fellow, California Institute of Technology
在 caltech.edu 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Finite-Sample Analysis of Nonlinear Stochastic Approximation with Applications in Reinforcement Learning
Z Chen, S Zhang, TT Doan, ST Maguluri, JP Clarke
Automatica, 2022
126*2022
Finite-Sample Analysis of Contractive Stochastic Approximation Using Smooth Convex Envelopes
Z Chen, ST Maguluri, S Shakkottai, K Shanmugam
The 34th Conference on Neural Information Processing Systems, 2020
78*2020
A Lyapunov theory for finite-sample guarantees of Markovian stochastic approximation
Z Chen, ST Maguluri, S Shakkottai, K Shanmugam
Operations Research, 2023
62*2023
Finite-Sample Analysis of Off-Policy Natural Actor-Critic Algorithm
Z Chen, S Khodadadian, ST Maguluri
The 38th International Conference on Machine Learning, 2021
362021
Finite-sample analysis of off-policy natural actor–critic with linear function approximation
Z Chen, S Khodadadian, ST Maguluri
IEEE Control Systems Letters 6, 2611-2616, 2022
352022
Global convergence of localized policy iteration in networked multi-agent reinforcement learning
Y Zhang, G Qu, P Xu, Y Lin, Z Chen, A Wierman
Proceedings of the ACM on Measurement and Analysis of Computing Systems 7 (1 …, 2023
202023
Nested vehicle routing problem: Optimizing drone-truck surveillance operations
F Zeng, Z Chen, JP Clarke, D Goldsman
Transportation Research Part C: Emerging Technologies 139, 103645, 2022
192022
Sample complexity of policy-based methods under off-policy sampling and linear function approximation
Z Chen, ST Maguluri
International Conference on Artificial Intelligence and Statistics, 11195-11214, 2022
182022
Target Network and Truncation Overcome the Deadly Triad in -Learning
Z Chen, JP Clarke, ST Maguluri
SIAM Journal on Mathematics of Data Science 5 (4), 1078-1101, 2023
162023
Stationary Behavior of Constant Stepsize SGD Type Algorithms: An Asymptotic Characterization
Z Chen, S Mou, ST Maguluri
Proceedings of the ACM on Measurement and Analysis of Computing Systems 6 (1 …, 2022
142022
Finite-sample analysis of off-policy TD-learning via generalized Bellman operators
Z Chen, ST Maguluri, S Shakkottai, K Shanmugam
Advances in Neural Information Processing Systems 34, 21440-21452, 2021
122021
A finite-sample analysis of payoff-based independent learning in zero-sum stochastic games
Z Chen, K Zhang, E Mazumdar, A Ozdaglar, A Wierman
Advances in Neural Information Processing Systems 36, 2024
92024
Convergence rates for localized actor-critic in networked markov potential games
Z Zhou, Z Chen, Y Lin, A Wierman
Uncertainty in Artificial Intelligence, 2563-2573, 2023
72023
Concentration of contractive stochastic approximation: Additive and multiplicative noise
Z Chen, ST Maguluri, M Zubeldia
arXiv preprint arXiv:2303.15740, 2023
72023
Approximate Global Convergence of Independent Learning in Multi-Agent Systems
R Jin, Z Chen, Y Lin, J Song, A Wierman
arXiv preprint arXiv:2405.19811, 2024
2024
Two-Timescale Q-Learning with Function Approximation in Zero-Sum Stochastic Games
Z Chen, K Zhang, E Mazumdar, A Ozdaglar, A Wierman
arXiv preprint arXiv:2312.04905, 2023
2023
A Unified Lyapunov Framework for Finite-Sample Analysis of Reinforcement Learning Algorithms
Z Chen
ACM SIGMETRICS Performance Evaluation Review 50 (3), 12-15, 2023
2023
系统目前无法执行此操作,请稍后再试。
文章 1–17