J Zhang,
C Wang, D Zang… - IEEE Transactions on …, 2015 - ieeexplore.ieee.org
A learning automaton (LA) is a powerful tool for reinforcement learning. Its action probability
vector plays two roles: 1) deciding when it converges, ie, total computing budget it has used …