the model learned on labeled data to generate pseudo-labels for unlabeled samples, and in
turn teach itself. To obtain valid supervision, active attempts typically employ a momentum
teacher for pseudo-label prediction yet observe the confirmation bias issue, where the
incorrect predictions may provide wrong supervision signals and get accumulated in the
training process. The primary cause of such a drawback is that the prevailing self-training …
The supplementary material is organized as follows. Sec. B shows more dataset and
implementation details. Sec. C provide more ablation studies of our FST, including the
ablation on SYNTHIA→ Cityscapes and evaluation of various segmentation decoders. Sec.
D and Sec. E present more comparisons of our FST with state-of-the-art methods on both
UDA and SSL benchmarks. Sec. F analyzes the training process of our method and shows
more visualization comparisons with classical self-training. Sec. G discusses the social …