binary anomaly label is only given on the video level, but the output requires snippet-level
predictions. So, Multiple Instance Learning (MIL) is prevailing in WSVAD. However, MIL is
notoriously known to suffer from many false alarms because the snippet-level detector is
easily biased towards the abnormal snippets with simple context, confused by the normality
with the same bias, and missing the anomaly with a different pattern. To this end, we …