Identification and characterization of type-I PKS genes
AntiSMASH analysis of the Elsinoë genomes and those of the two related fungi included for comparative purposes revealed seven type-I PKS clusters in E. necatrix, five in E. arachidis, E. batatas and A. pullulans, four in E. fawcettii and M. duriaei and three in E. ampelina, E. murrayae and E. australis (Table 2). Each type-I PKS cluster had a single main type-I PKS gene which was mapped back to the respective genomes and for which gene identity was confirmed by BLASTp analysis. A phylogenetic comparison of the highly conserved ketosynthase (KS) and acyltransferase (AT) protein domains from the predicted type-I PKS genes made it possible to assign a putative function to each type-I PKS gene based on a set of reference proteins [18, 22, 37] (Table 3). Eight distinct classes of PKSs were identified, with all members within a class noted to synthesize structurally similar secondary metabolites (Fig. 1). Based on the clustering of the identified PKS protein sequences and the reference protein sequences within the tree, the classes were categorized as perylenequinone, aflatoxin, naphthoquinones, melanin, anthraquinone, macrolide lactones, t-toxin and azaphilones [18, 22, 37].
Only the melanin and perylenequinone gene clusters were represented in all the Elsinoë species considered in this study. The Elsinoë PKS protein domains present in the melanin class were representative of the melanin biosynthesis pathway, while the proteins clustering in the perylenequinone class represented the elsinochrome producing PKS gene cluster previously identified by Ebert et al. [18]. Putative amino acid sequences from specific Elsinoë species were found to represent the classes of macrolide lactones, naphthoquinones, t-toxin, and anthraquinones. All Elsinoë species other than E. ampelina and E. australis had one or more putative PKS proteins present in the t-toxin class, with E. necatrix having three representatives. E. australis and E. fawcettii had one putative PKS protein each in the anthraquinone class, while E. ampelina, E. arachidis and E. batatas each had a single putative PKS protein in the naphthoquinone class. Elsinoë necatrix was the only species that had any representation (two PKS proteins) in the macrolide lactone class. In addition, none of the Elsinoë species had any representative putative protein sequence in the aflatoxin class. M. duriaei and A. pullulans included for comparative purposes had representative putative protein sequences only in three classes. Both A. pullulans and M. duriaei had a single PKS gene in the melanin class, while four A. pullulans and one M. duriaei PKS genes were present in the t-toxin group. Two PKS genes from the genome of M. duriaei were present in the azaphilones outgroup, with A. pullulans not having a representative in this group.
InterProScan analysis of the different Elsinoë putative PKS proteins identified eleven different domains (Fig. 2). All of the identified PKSs contained the three main domains i.e., β-ketosynthase (KS; Pfam accession number PF00109), acyl-transferase (AT; PF00698) and ACP (PP-binding; PF00550). The melanin, elsinochrome and macrolide lactones PKSs were very similar with all three containing the additional starter unit (SAT; PF16073), dehydratase (DH; PF14765) and thioesterase (TE; PF00975) domains. Similarly, the naphthoquinone PKSs also contained an additional SAT and TE domain but lacked a DH domain. The t-toxin PKSs were divided into five groups depending on the additional domains that were present. t-toxin 1 PKS was defined by the DH, methyl-transferase (MT; PF08242), trans-acting enoyl (ER; PF00107) and β-ketoreductase (KR; PF08659) domains. t-toxin 2 PKSs contained the DH, MT, and ER domains, while t-toxin 3 contained DH, ER and KR domains. t-toxin 4 PKS contained a DH, MT, KR and a nonribosomal peptide synthetase (NRPS; PF00668) domain, while t-toxin 5 PKS contained an acetyl-CoA synthetase (AMP; PF00501) domain in addition to the DH, MT, and KR domains.
Annotation and synteny analysis of the elsinochrome and melanin biosynthetic gene cluster.
The melanin and perylenequinone type-I PKS genes were used to annotate the complete melanin and elsinochrome gene clusters in all seven Elsinoë species. After adjusting the cluster boundaries to match those previously described [18, 20], the melanin and elsinochrome clusters contained six and ten genes, respectively. BLASTp using the putative cluster proteins as queries confirmed that each gene matched to a homolog previously characterised in E. fawcettii (Table S1) [18, 20]. For the melanin biosynthetic cluster, this included the core biosynthetic PKS gene encoding a polyketide synthase (EfPKS1), as well as genes for transcription factor (TSF1), ESC reductase (RDT1), ESC prefoldin protein subunit 3 (PRF1), and ECT1 transporter (ECT1). For the elsinochrome biosynthetic cluster, the predicted genes included the core biosynthetic PKS gene encoding the protein polyketide synthase CTB1-like protein (CTB1) as well as homologs to the genes for O-methyltransferase (OMT1), FAD-binding domain-containing protein (FAD1), major facilitator superfamily domain-containing protein (MFS1), O-methyltransferase 2 (OMT2), fungal Zn(2)-Cys(6) binuclear cluster domain-containing protein (ZNC1), protein STB3 (STB3), FAD binding/ oxidoreductase (OXR1), and a fasciclin domain-containing protein (FAS1).
Only a melanin biosynthesis cluster was detected in the genomes of the closely related Dothideomycetes taxa, M. duriaei and A. pullulans. In M. duriaei, all five genes in the melanin cluster could be identified, while only four of these genes (no BLAST identity to the ECT1 protein sequence) were present in A. pullulans. No homologs to any of the genes from the elsinochrome biosynthetic cluster were detected in the genome sequences of either of the two fungi.
High levels of synteny were apparent within both the melanin and elsinochrome biosynthetic clusters across the different Elsinoë species (Fig. 3). For the melanin cluster, the five main genes were present in the order RDT1 - TSF1 - PKS1 - PRF1 - ECT1. The only exception was the ECT1 gene that was inverted and positioned upstream of RDT1 in E. australis and E. murrayae, resulting in a gene order of ECT1 - RDT1 - TSF1 - PKS1 - PRF1. Additional genes were also identified between the inverted ECT1 gene and the RDT1 gene in these two species. The core melanin cluster genes in M. duriaei and A. pullulans had the same order and orientation as the consensus for the Elsinoë species (RDT1 - TSF1 - PKS1 - PRF1 - ECT1), apart from the ECT1 gene that was absent in A. pullulans. Similarly, the structure of the elsinochrome cluster was highly conserved between the different Elsinoë species. In all the species, the genes were present in the order CTB1 – OMT1 – FAD1 – MFS1 – OMT2 – ZNC1 – STB3 – FAD2 – OXR1 – FAS1. A gene encoding for a hypothetical protein, lacking a conserved domain, was found between MFS1 and OMT2 in the genomes of E. batatas and E. murrayae.