Y Yuan, Z Tian, C Wang, F Zheng, Y Lv - Neural Computing and …, 2020 - Springer
… solution can be presented as a sequence: {(0, 1), (1, 0), (2, 6), (3, 2)}, which represents a
probe of an embedding scheme of the agent. The state sets, actions sets and the sequence of …