Near-Optimal Randomized Exploration for Tabular Markov Decision Processes Z Xiong*, R Shen*, Q Cui*, M Fazel, SS Du Advances in Neural Information Processing Systems 35, 6358-6371, 2022 | 24* | 2022 |
Learning in congestion games with bandit feedback Q Cui*, Z Xiong*, M Fazel, SS Du Advances in Neural Information Processing Systems 35, 11009-11022, 2022 | 14 | 2022 |
Parameterized indexed value function for efficient exploration in reinforcement learning T Tan*, Z Xiong*, VR Dwaracherla Proceedings of the AAAI Conference on Artificial Intelligence 34 (04), 5948-5955, 2020 | 7 | 2020 |
Selective sampling for online best-arm identification R Camilleri*, Z Xiong*, M Fazel, L Jain, KG Jamieson Advances in Neural Information Processing Systems 34, 11071-11082, 2021 | 6 | 2021 |
A/B Testing and Best-arm Identification for Linear Bandits with Robustness to Non-stationarity Z Xiong*, R Camilleri*, M Fazel, L Jain, K Jamieson arXiv preprint arXiv:2307.15154, 2023 | 1 | 2023 |
Offline congestion games: How feedback type affects data coverage requirement H Jiang*, Q Cui*, Z Xiong, M Fazel, SS Du International Conference on Learning Representations, 2022 | 1 | 2022 |
Fourier Learning with Cyclical Data Y Yang*, Z Xiong*, T Liu*, T Wang, C Wang International Conference on Machine Learning, 25280-25301, 2022 | 1 | 2022 |
Machine learning with periodic data Y Yang, T Liu, T Wang, C Wang, Z Xiong US Patent US20230267363A1, 2023 | | 2023 |
A Black-box Approach for Non-stationary Multi-agent Reinforcement Learning H Jiang, Q Cui, Z Xiong, M Fazel, SS Du International Conference on Learning Representations, 2023 | | 2023 |