H Chen,
S Li,
C Zhang - Advances in Neural Information …, 2021 - proceedings.neurips.cc
The bandit problem with graph feedback, proposed in [Mannor and Shamir, NeurIPS 2011],
is modeled by a directed graph $ G=(V, E) $ where $ V $ is the collection of bandit arms, and …