Online Robust Reinforcement Learning with Model Uncertainty Y Wang, S Zou Advances in Neural Information Processing Systems 34, 2021 | 92 | 2021 |
Policy gradient method for robust reinforcement learning Y Wang, S Zou International Conference on Machine Learning, 23484-23526, 2022 | 62 | 2022 |
A Robust and Constrained Multi-Agent Reinforcement Learning Electric Vehicle Rebalancing Method in AMoD Systems S He, Y Wang, S Han, S Zou, F Miao 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2023 | 33* | 2023 |
Finite-sample analysis of Greedy-GQ with linear function approximation under Markovian noise Y Wang, S Zou Conference on Uncertainty in Artificial Intelligence, 11-20, 2020 | 26 | 2020 |
Non-asymptotic analysis for two time-scale TDC with general smooth function approximation Y Wang, S Zou, Y Zhou Advances in Neural Information Processing Systems 34, 9747-9758, 2021 | 18* | 2021 |
Provably efficient offline reinforcement learning with trajectory-wise reward T Xu, Y Wang, S Zou, Y Liang IEEE Transactions on Information Theory, 2024 | 14 | 2024 |
Robust constrained reinforcement learning Y Wang, F Miao, S Zou arXiv preprint arXiv:2209.06866, 2022 | 11 | 2022 |
Robust average-reward Markov decision processes Y Wang, A Velasquez, G Atia, A Prater-Bennette, S Zou AAAI 2023, 2023 | 8 | 2023 |
Model-free robust average-reward reinforcement learning Y Wang, A Velasquez, GK Atia, A Prater-Bennette, S Zou International Conference on Machine Learning, 36431-36469, 2023 | 5 | 2023 |
Achieving the Asymptotically Optimal Sample Complexity of Offline Reinforcement Learning: A DRO-Based Approach Y Wang, J Xiong, S Zou Transactions on Machine Learning Research, 2024 | 2* | 2024 |
Non-Asymptotic Analysis for Single-Loop (Natural) Actor-Critic with Compatible Function Approximation Y Wang, Y Wang, Y Zhou, S Zou arXiv preprint arXiv:2406.01762, 2024 | 1 | 2024 |
Finite-time error bounds for Greedy-GQ Y Wang, Y Zhou, S Zou Machine Learning, 1-38, 2024 | 1 | 2024 |
Data-driven robust multi-agent reinforcement learning Y Wang, Y Wang, Y Zhou, A Velasquez, S Zou 2022 IEEE 32nd International Workshop on Machine Learning for Signal …, 2022 | 1 | 2022 |
Robust Average-Reward Reinforcement Learning Y Wang, A Velasquez, G Atia, A Prater-Bennette, S Zou Journal of Artificial Intelligence Research 80, 719-803, 2024 | | 2024 |
Model-Free Robust Reinforcement Learning with Sample Complexity Analysis Y Wang, S Zou, Y Wang The 40th Conference on Uncertainty in Artificial Intelligence, 2024 | | 2024 |