method for learning optimal reserve prices in second-price auctions. We study a limited
information setting where the values of the bids are not revealed and no historical
information about the values of the bids is available. Our proposed method is based on the
principle of Thompson sampling combined with a particle filter to approximate and sample
from the posterior distribution. Our method is suitable for non-stationary environments, and …