classes such as microRNAs or snoRNAs. However, ML methods have not been successfully
used for homology search tasks. A systematic evaluation of ML in homology search requires
large, controlled, and known ground truth test sets, and thus, methods to construct large
realistic artificial data sets. Here we describe a method for producing sets of arbitrarily large
and diverse snoRNA sequences based on artificial evolution. These are then used to …