Fig. 3
From: Pangenome mining of the Streptomyces genus redefines species’ biosynthetic potential

Advanced clustering of BGCs redefines known GCFs with reduced diversity in specific types of BGCs. A Workflow used to detect BGCs, GCFs based on BiG-SLICE, and regrouping GCFs based on knownclusterblast similarity (> 80% of genes). Several examples of known GCFs are reported in the bottom boxes, classified into common, accessory, or unique GCFs to Mash-clusters. B Percentage abundance of the top twenty known GCFs across different primary Mash-clusters. Each row corresponds to a known compound (GCF). The number in parentheses denotes the number of BiG-SLICE detected GCFs that were regrouped into one GCF. C Overview of the number of GCFs that were regrouped across the twenty most abundant BGC types. Gray bars represent the number of GCFs detected using only BiG-SLICE, whereas blue bars represent the reduced number of GCFs after regrouping based on knownclusterblast