Causality-driven hierarchical structure discovery for reinforcement learning X Hu, R Zhang, K Tang, J Guo, Q Yi, R Chen, Z Du, L Li, Q Guo, Y Chen Advances in Neural Information Processing Systems 35, 20064-20076, 2022 | 12 | 2022 |
Hindsight value function for variance reduction in stochastic dynamic environment J Guo, R Zhang, X Zhang, S Peng, Q Yi, Z Du, X Hu, Q Guo, Y Chen arXiv preprint arXiv:2107.12216, 2021 | 9 | 2021 |
Object-category aware reinforcement learning Q Yi, R Zhang, J Guo, X Hu, Z Du, Q Guo, Y Chen Advances in Neural Information Processing Systems 35, 36453-36465, 2022 | 8 | 2022 |
Conceptual reinforcement learning for language-conditioned tasks S Peng, X Hu, R Zhang, J Guo, Q Yi, R Chen, Z Du, L Li, Q Guo, Y Chen Proceedings of the AAAI Conference on Artificial Intelligence 37 (8), 9426-9434, 2023 | 6 | 2023 |
Efficient symbolic policy learning with differentiable symbolic expression J Guo, R Zhang, S Peng, Q Yi, X Hu, R Chen, Z Du, L Li, Q Guo, Y Chen Advances in Neural Information Processing Systems 36, 2024 | 4 | 2024 |
Context shift reduction for offline meta-reinforcement learning Y Gao, R Zhang, J Guo, F Wu, Q Yi, S Peng, S Lan, R Chen, Z Du, X Hu, ... Advances in Neural Information Processing Systems 36, 2024 | 4 | 2024 |
Learning controllable elements oriented representations for reinforcement learning Q Yi, R Zhang, S Peng, J Guo, X Hu, Z Du, Q Guo, R Chen, L Li, Y Chen Neurocomputing 549, 126455, 2023 | 4 | 2023 |
Online prototype alignment for few-shot policy transfer Q Yi, R Zhang, S Peng, J Guo, Y Gao, K Yuan, R Chen, S Lan, X Hu, Z Du, ... International Conference on Machine Learning, 39968-39983, 2023 | 3 | 2023 |
Contrastive modules with temporal attention for multi-task reinforcement learning S Lan, R Zhang, Q Yi, J Guo, S Peng, Y Gao, F Wu, R Chen, Z Du, X Hu, ... Advances in Neural Information Processing Systems 36, 2024 | 2 | 2024 |
Prompt-based Visual Alignment for Zero-shot Policy Transfer H Gao, R Zhang, Q Yi, H Yao, H Li, J Guo, S Peng, Y Gao, QC Wang, ... arXiv preprint arXiv:2406.03250, 2024 | | 2024 |
OCEAN-MBRL: Offline Conservative Exploration for Model-Based Offline Reinforcement Learning F Wu, R Zhang, Q Yi, Y Gao, J Guo, S Peng, S Lan, H Han, Y Pan, K Yuan, ... Proceedings of the AAAI Conference on Artificial Intelligence 38 (14), 15897 …, 2024 | | 2024 |
Hypothesis, Verification, and Induction: Grounding Large Language Models with Self-Driven Skill Learning S Peng, X Hu, Q Yi, R Zhang, J Guo, D Huang, Z Tian, R Chen, Z Du, ... Proceedings of the AAAI Conference on Artificial Intelligence 38 (13), 14599 …, 2024 | | 2024 |
Contextual Symbolic Policy For Meta-Reinforcement Learning J Guo, R Zhang, S Peng, Q Yi, X Hu, R Chen, K Long, Z Du, X Zhang, L Li, ... | | |
Causality-driven Hierarchical Structure Discovery for Reinforcement Learning–Appendix S Peng, X Hu, R Zhang, K Tang, J Guo, Q Yi, R Chen, X Zhang, Z Du, L Li, ... survival 200, 400, 0 | | |