Modeling opponent learning in multiagent repeated games Y Hu, C Han, H Li, T Guo Applied Intelligence 53 (13), 17194-17210, 2023 | 5 | 2023 |
Rethinking optimal pivoting paths of simplex method A Li, B Li, C Han, T Guo arXiv preprint arXiv:2210.02945, 2022 | 5 | 2022 |
General Method for Solving Four Types of SAT Problems A Li, C Han, T Guo, H Li, B Li arXiv preprint arXiv:2312.16423, 2023 | 3 | 2023 |
SC-PSRO: A Unified Strategy Learning Method for Normal-form Games Y Hu, H Li, C Han, T Guo, M Li, B Li arXiv preprint arXiv:2308.12520, 2023 | 1 | 2023 |
SAMBO-RL: Shifts-aware Model-based Offline Reinforcement Learning W Luo, H Li, Z Zhang, C Han, J Lv, T Guo arXiv preprint arXiv:2408.12830, 2024 | | 2024 |
Optimal pivot path of the simplex method for linear programming based on reinforcement learning A Li, T Guo, C Han, B Li, H Li Science China Mathematics 67 (6), 1263-1286, 2024 | | 2024 |
Towards Optimal Adversarial Robust Q-learning with Bellman Infinity-error H Li, Z Zhang, W Luo, C Han, Y Hu, T Guo, S Liao The Forty-first International Conference on Machine Learning (ICML 2024 Oral), 2024 | | 2024 |