Y Wu,
W Song,
Z Cao, J Zhang… - … networks and
learning …, 2021 - ieeexplore.ieee.org
… automatically learn high-quality solution picking policies that … , ie, the solution picked by the
policy will always be accepted, to … , our method can learn high-quality policies that outperform …