Rl4co: an extensive reinforcement learning for combinatorial optimization benchmark F Berto, C Hua, J Park, L Luttmann, Y Ma, F Bu, J Wang, H Ye, M Kim, ... arXiv preprint arXiv:2306.17100, 2023 | 10 | 2023 |
PARCO: Learning Parallel Autoregressive Policies for Efficient Multi-Agent Combinatorial Optimization F Berto, C Hua, L Luttmann, J Son, J Park, K Ahn, C Kwon, L Xie, J Park arXiv preprint arXiv:2409.03811, 2024 | | 2024 |