Learning policies with zero or bounded constraint violation for constrained mdps T Liu, R Zhou, D Kalathil, P Kumar, C Tian Advances in Neural Information Processing Systems 34, 17183-17193, 2021 | 82 | 2021 |
Capacity-achieving private information retrieval codes from MDS-coded databases with minimum message size R Zhou, C Tian, H Sun, T Liu IEEE Transactions on Information Theory 66 (8), 4904-4916, 2020 | 69 | 2020 |
On the information leakage in private information retrieval systems T Guo, R Zhou, C Tian IEEE Transactions on Information Forensics and Security 15, 2999-3012, 2020 | 43 | 2020 |
Individually conditional individual mutual information bound on generalization error R Zhou, C Tian, T Liu IEEE Transactions on Information Theory 68 (5), 3304-3316, 2022 | 39 | 2022 |
Generalized global bandit and its application in cellular coverage optimization C Shen, R Zhou, C Tekin, M van der Schaar IEEE Journal of Selected Topics in Signal Processing 12 (1), 218-232, 2018 | 33 | 2018 |
Regional multi-armed bandits Z Wang, R Zhou, C Shen International Conference on Artificial Intelligence and Statistics, 510-518, 2018 | 24 | 2018 |
Regional multi-armed bandits with partial informativeness Z Wang, R Zhou, C Shen IEEE Transactions on Signal Processing 66 (21), 5705-5717, 2018 | 20 | 2018 |
Weakly private information retrieval under the maximal leakage metric R Zhou, T Guo, C Tian 2020 IEEE International Symposium on Information Theory (ISIT), 1089-1094, 2020 | 19 | 2020 |
Fast global convergence of policy optimization for constrained MDPs T Liu, R Zhou, D Kalathil, PR Kumar, C Tian arXiv preprint arXiv:2111.00552, 2021 | 15 | 2021 |
Cost-aware cascading bandits C Gan, R Zhou, J Yang, C Shen IEEE Transactions on Signal Processing 68, 3692-3706, 2020 | 15 | 2020 |
Anchor-changing regularized natural policy gradient for multi-objective reinforcement learning R Zhou, T Liu, D Kalathil, PR Kumar, C Tian Advances in Neural Information Processing Systems 35, 13584-13596, 2022 | 14 | 2022 |
Policy optimization for constrained mdps with provable fast global convergence T Liu, R Zhou, D Kalathil, PR Kumar, C Tian arXiv preprint arXiv:2111.00552, 2021 | 14 | 2021 |
New results on the storage-retrieval tradeoff in private information retrieval systems T Guo, R Zhou, C Tian IEEE Journal on Selected Areas in Information Theory 2 (1), 403-414, 2021 | 13 | 2021 |
Natural actor-critic for robust reinforcement learning with function approximation R Zhou, T Liu, M Cheng, D Kalathil, PR Kumar, C Tian Advances in neural information processing systems 36, 2024 | 12 | 2024 |
Stochastic chaining and strengthened information-theoretic generalization bounds R Zhou, C Tian, T Liu Journal of the Franklin Institute 360 (6), 4114-4134, 2023 | 12 | 2023 |
Cost-aware cascading bandits R Zhou, C Gan, J Yan, C Shen arXiv preprint arXiv:1805.08638, 2018 | 12 | 2018 |
Neighbor cell list optimization in handover management using cascading bandits algorithm C Wang, J Yang, H He, R Zhou, S Chen, X Jiang IEEE Access 8, 134137-134150, 2020 | 11 | 2020 |
Cost-aware learning and optimization for opportunistic spectrum access C Gan, R Zhou, J Yang, C Shen IEEE Transactions on Cognitive Communications and Networking 5 (1), 15-27, 2018 | 11 | 2018 |
Improved weakly private information retrieval codes C Qian, R Zhou, C Tian, T Liu 2022 IEEE International Symposium on Information Theory (ISIT), 2827-2832, 2022 | 10 | 2022 |
Two-level private information retrieval R Zhou, C Tian, H Sun, JS Plank IEEE Journal on Selected Areas in Information Theory 3 (2), 337-349, 2022 | 8 | 2022 |