classes compared to normal traffic. Many datasets are collected in simulated environments
rather than real-world networks. These challenges undermine the performance of intrusion
detection machine learning models by fitting machine learning models to unrepresentative
“sandbox” datasets. This survey presents a taxonomy with eight main challenges and
explores common datasets from 1999 to 2020. Trends are analyzed on the challenges in the …