Fig. 2From: VirSorter2: a multi-classifier, expert-guided approach to detect diverse DNA and RNA virusesBoxplot of different features across non-viral and viral groups. âNonviralâ includes bacteria and archaea, fungi and protozoa, and plasmids. A subset of 100 random genome fragments were used for each group. â% of viral geneâ is calculated as the percent of genes annotated as viral (best hit to viral HMMs) of all genes; â% of bacterial geneâ is calculated as the percent of genes annotated as bacterial (best hit to bacterial HMMs) of all genes; âStrand switch frequencyâ is the percent of genes located on a different strand from the upstream gene (scanning from 5âČ to 3âČ in the + strand); âGene densityâ is the average number of genes in every 1000 bp sequence (total number of genes divided by contig length and then multiplied by 1000); âAverage GC content of genesâ is the mean of GC content of all genes in a contig; âTATATA_3-6 motif frequencyâ is the percent of ribosomal binding sites (RBS) with âTATATA_3-6â motifBack to article page