Evolutionary action selection for gradient-based policy learning Y Ma, T Liu, B Wei, Y Liu, K Xu, W Li ICONIP 2022, 579-590, 2022 | 9 | 2022 |
A self-learning Monte Carlo tree search algorithm for robot path planning W Li, Y Liu, Y Ma, K Xu, J Qiu, Z Gan Frontiers in Neurorobotics 17, 1039644, 2023 | 3 | 2023 |
Open-ended diverse solution discovery with regulated behavior patterns for cross-domain adaptation K Xu, Y Ma, B Wei, W Li AAAI 2023 37 (9), 10585-10593, 2023 | 3 | 2023 |
OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI Z Huang, Z Wang, S Xia, X Li, H Zou, R Xu, RZ Fan, L Ye, E Chern, Y Ye, ... arXiv preprint arXiv:2406.12753, 2024 | 1 | 2024 |
Dynamics-aware novelty search with behavior repulsion K Xu, Y Ma, W Li GECCO 2022, 1112-1120, 2022 | 1 | 2022 |
Weak-to-Strong Reasoning Y Yang, Y Ma, P Liu arXiv preprint arXiv:2407.13647, 2024 | | 2024 |
ANOLE: An Open, Autoregressive, Native Large Multimodal Models for Interleaved Image-Text Generation E Chern, J Su, Y Ma, P Liu arXiv preprint arXiv:2407.06135, 2024 | | 2024 |
MoPS: Modular Story Premise Synthesis for Open-Ended Automatic Story Generation Y Ma, Y Qiao, P Liu ACL 2024, 2024 | | 2024 |