Optimal transport for treatment effect estimation H Wang, J Fan, Z Chen, H Li, W Liu, T Liu, Q Dai, Y Wang, Z Dong, ... Advances in Neural Information Processing Systems 36, 2024 | 24 | 2024 |
A review for deep reinforcement learning in atari: Benchmarks, challenges, and solutions J Fan arXiv preprint arXiv:2112.04145, 2021 | 17 | 2021 |
Learnable behavior control: Breaking atari human world records via sample-efficient behavior selection J Fan, Y Zhuang, Y Liu, J Hao, B Wang, J Zhu, H Wang, ST Xia The Eleventh International Conference on Learning Representations, 2023 | 16 | 2023 |
Generalized data distribution iteration J Fan, C Xiao The Thirty-ninth International Conference on Machine Learning, 2022 | 13 | 2022 |
Gdi: Rethinking what makes reinforcement learning different from supervised learning J Fan, C Xiao, Y Huang arXiv preprint arXiv:2106.06232, 2021 | 10 | 2021 |
An Entropy Regularization Free Mechanism for Policy-based Reinforcement Learning C Xiao, H Shi, J Fan, S Deng arXiv preprint arXiv:2106.00707, 2021 | 5 | 2021 |
CASA: A bridge between gradient of policy improvement and policy evaluation C Xiao, H Shi, J Fan, S Deng CoRR, abs/2105.03923, 2021a. URL https://arxiv. org/abs/2105.03923, 2021 | 4 | 2021 |
Critic PI2: Master continuous planning via policy improvement with path integrals and deep actor-critic reinforcement learning J Fan, H Ba, X Guo, J Hao arXiv preprint arXiv:2011.06752, 2020 | 4 | 2020 |
Entire space counterfactual learning: Tuning, analytical properties and industrial applications H Wang, Z Chen, J Fan, Y Huang, W Liu, X Liu arXiv preprint arXiv:2210.11039, 2022 | 3 | 2022 |
Convformer: Revisiting transformer for sequential user modeling H Wang, J Lian, M Wu, H Li, J Fan, W Xu, C Li, X Xie arXiv preprint arXiv:2308.02925, 2023 | 2 | 2023 |
PRANCE: Joint Token-Optimization and Structural Channel-Pruning for Adaptive ViT Inference Y Li, C Tang, Y Meng, J Fan, Z Chai, X Ma, Z Wang, W Zhu arXiv preprint arXiv:2407.05010, 2024 | | 2024 |
Proximity Matters: Local Proximity Preserved Balancing for Treatment Effect Estimation H Wang, Z Chen, Y Shen, J Fan, Z Liu, D Yang, X Liu, H Li arXiv preprint arXiv:2407.01111, 2024 | | 2024 |
Sinkhorn Discrepancy for Counterfactual Generalization H Wang, Q Dai, J Fan, W Liu, Z Chen, T Liu, Y Wang, Z Dong, R Tang | | |