H Li, Z Zhao, G Lei, L Guo, Z Bi… - Journal of …, 2019 - dc-china-simulation …
Deep reinforcement learning continues to explore in the environment and adjusts the neural
network parameters by the reward function. The actual production line can not be used as …