uncoordinated, which often leads to inefficient use of resources and poor performance. To
solve this, we envision the utilization of completely decentralized mechanisms to enable
Spatial Reuse (SR). In particular, we focus on dynamic channel selection and Transmission
Power Control (TPC). We rely on Reinforcement Learning (RL), and more specifically on
Multi-Armed Bandits (MABs), to allow networks to learn their best configuration. In this work …