contrastive learning. However, due to the intra-class variance and inter-class similarity, the
cluster uni-proxy is prone to be biased and confused with similar classes, resulting in the
learned features lacking intra-class compactness and inter-class separation in the
embedding space. To completely and accurately represent the information contained in a
cluster and learn discriminative features, we propose to maintain discrepant cluster proxies …