Y Zhang, S Wang,
Z Fang - Advances in Neural Information …, 2022 - proceedings.neurips.cc
In this paper, we consider the matching of multi-agent multi-armed bandit problem, ie, while
agents prefer arms with higher expected reward, arms also have preferences on agents. In …