General features of the F. filiformis genome
Prior to our study, three “genomes” assigned toF.velutipes” were publically available including the relative complete genome of the KACC42780 from Korea, L11 from China and a draft genome of the TR 19 from Japan. In this study, we sequenced the genome of wild strain of F. filiformis by small fragments library construction and did a comparative genome analysis on secondary metabolism gene clusters. The assembled genomes of the wild F. filiformis were 35.01Mbp with approximately 118-fold genome coverage. Total 10396 gene models were predicted, with an average sequence length of 1445 bp. The genome size and the number of predicted protein encoding genes are very similar to the public reference genome of F. filiformis ( = Asian F. velutipes) (Table 1). Functional annotation of the predicted genes showed that more than half of predicted genes were annotated in NR database (6383 genes) and 1972, 2582, 837, 5794 genes were annotated in database SwissProt, KEGG, COG and GO, respectively. We identified 107 cytochrome P450 family genes, 674 genes encoding secretory proteins and 287 genes in CAZy database.
There have 17293 pan-genes among four strains of F. filiformis and pan-genome core comprised 4074 genes (on average 23.5% of each genome) (Fig. 1A). The proportion (23.5%) of core genes in pan-genome analysis is similar to pan-genome analysis of 23 Corallococcus spp. [43]. But possibly, the number was lower than the actual number because that these genomes were not sequenced to completion. 3104 out of orthologous genes was annotated in KEGG database, 2722 genes have annotations in the GO database and 1055 genes are specific to the wild strain Liu355.
Functional characteristics of F. filiformis predicted genes
A KEGG enrichment analysis was performed to determine the functions of predicted genes of F. filiformis. The result showed that the highest number of genes of F. filiformis is involved in genetic information processing and translation (253 genes), followed by metabolism (carbohydrate metabolism with 243 genes). 21 genes were found participate into terpenoid and polyketides biosynthesis (Additional file 1: Figure S1).
Transcriptomic analysis and gene expression
We studied the gene expression differences across different development stages of F. filiformis including the monokaryotic (MK), dikaryotic mycelium (DK), primordium (PD) and fruiting body (FB) in transcriptomic level. Meanwhile, the DK of the cultivar strain of F. filiformis (CGMCC 5.642) also was transcriptomic sequenced. Three biological replicates were designed for each sample. An average clean data for each sample is 8.07–9.32 G. We mapped the clean reads to genomeof F. filiformis Liu 355 using the HISAT software and get relative high total mapping rate (92.63%). In addition, the expression variation between samples was the smallest between MK and DK and was the greatest between the MK and FB of F. filiformis (Additional file 1: Figure S2).
Among 10396 gene models of F. filiformis, 9931 genes models were expressed (FPKM>5) across the four different tissues (MK, DK, PD and FB) of the wild strain and the mycelium of a cultivar strain of F. filiformis. 6577 genes were commonly expressed in all tissues and 151 genes were specific expressed in the cultivar strain, and 152,116,46, 199 genes were specific expressed in MK, DK, PD and FB of the wild strain of F. filiformis, respectively (Fig. 1B). Tissue-specific and high expression transcripts in F. filiformis Liu 355 were listed in Additional file 2: Table S1. Two new genes encoding ornithine decarboxylase (involved in ployamine synthesis) are highly expressed in mycelium of cultivar strains (Novel01369, Novel01744) and genes encoding oxidoreductase also has the highest expressed level (gene 830, FPKM>1000). Genes encoding agroclavine dehydrogenase, acetylxylan exterase, beta-glucan synthesis-associated protein and arabinogalactan endo–1,4-beta-galactosidase protein are significantly high expressed in fruiting bodies (FB) of the wild strain F. filiformis with more than 20–100 fold change compared to mycelium. It was known that agroclavine dehydrogenase was involved in the biosynthesis of fungal ergot alkaloid ergovaline [44]; and beta-glucan synthesis-associated protein is probably linked the polysaccharide biosynthesis of fungal cell wall. Highly expression of these genes indicated that it probably play an important roles in fruiting bodies development and compounds enrichment.
Total 5131 genes (51.67%) were up- or down regulated at least in one stage transitions such as from mycelium to primordium (PD vs DK, 3889 genes) and from primordium to fruiting body (FB vs PD, 3308 genes) (Fig. 1C). During primordial formation stage, 1780 genes are up-regulated and most of the genes belong to oxidoreductase activity (Go:0016491), hydrolase activity (Go: 0004553) and carbohydrate metabolism (Go: 0005975), down-regulated genes mainly was enriched with transmebrane transport (Go:0055085). During fruiting body development stage, genes related to fungal-type cell wall (Go:0009277) and structural constituent of cell wall (Go:0005199) are up regulated, reflecting the dramatic changes of cell wall structure during the development process. In addition, Go term enrichment of differentially expressed genes (DEGs) between wild strain Liu355 and cultivar strain CGMCC 5.642 showed that most genes displayed a similar expression profile, but peptide biosynthetic and metabolic process (Go: 0006518; Go:0043043), amide biosynthetic process (Go: 0043604) and ribonucleoprotein complex (Go: 1901566) are up-regulated in cultivar strain of CGMCC 5.642.
KEGG enrichment analysis showed that DEGs involved in glutathione metabolism is significantly enriched in dikayontic mycelium of wild strain Liu 355 compared to cultivar strain (Fig. 2). Thirty-three DEGs, including genes encoding glutathione S-transferase, ribonucleoside-diphosphate reductase, 6-phosphogluconate dehydrogenase, cytosolic non-specific dipeptidase, gamma-glutamyltranspeptidase, glutathione peroxidase etc. participated in this pathway. Study on the glutathione metabolism of the filamentous fungus Aspergillus nidulans indicated thatglutathione itself and glutathione metabolic enzymes play a crucially important role in the germination of conidiospores, as well as remarkable contribute to the general stress tolerance of fungi [45]. High expression of genes related to glutathione metabolism in wild strain of F. filiformis implied the strain probably has strong environment adaptation and is potential better breeding resource. In addition, during the primordial and fruiting body development stage, MAPK signaling pathway (45 DEGs) and starch and sucrose metabolism (26 DEGs) is significantly enriched. Tyrosine metabolism, biosynthesis of secondary metabolites and glycosphingolipid biosynthesis are also significantly enriched in fruiting bodies formation stage.
Genes involved polysaccharide biosynthesis of F. filiformis
Polysaccharides (PSs) are an important and bioactive component of F. filiformis and other edible and medicinal mushrooms [46]. Glucan is well studied PSs and can be produced by different fungi, such as lentinan, ganoderan, grifolan, and schizophyllan [47]. Previous studies showed that F. velutipes polysaccharide (FVP) have good immunomodulation activities, anti-proliferation and improved the learning and memory impairment in rats [4, 5]. Recently, an interesting result showed that FVP even could be used as a material of edible film for packaging of food and vegetables because of their excellent features of film-forming and edibility safety [48].
The biosynthetic pathways of polysaccharide (PSs) involved synthesis of nucleotide sugar precursors, assembly of repeating monosaccharide units, and the polymerization process [42]. Although the exactly biosynthetic pathways of mushroom PSs remain unclear, polysaccharide synthesis is not a template-driven process, but instead the structures are determined primarily by the complement of polysaccharide-modifying enzymes present in any organism [49]. Phosphoglucomutase (PGM) and UDP-glucose pyrophosphorylase (UGP) are known as important enzymes in the biosynthetic pathway of nucleotide sugar precursors [46]. In addition, based on identification of intermediate compounds and activities of synthesis-related enzymes, researchers constructed a simplified biosynthetic pathway of mushroom PSs and inferred several enzymes such as glucokinase (GK), phosphoglucose isomerase (PGI), fructose–1,6-biphosphatase (FBPase) and UDP-Glc pyrophosphorylase (UGP) probably involved in the biosynthetic pathways of PSs of edible mushroom [42, 46]. These enzymes play key roles in glycolysis and gluconeogenesis pathway and starch and carbohydrate metabolism.
We identified total 80 genes related to PSs biosynthesis involved glycolysis and gluconeogenesis in KEGG pathway (KEGG map 00010) [50] at genome level, including Glucose–6-phosphate isomerase (GPI), Fructose–1,6-bisphosphatase (FBP), Mannose–6-phosphate isomerase (PMI). The expression profile of these genes showed that genes encoding Zinc-type alcohol dehydrogenase are both up-regulated in mycelium of wild strains compared to cultivar strain and in fruiting body stage compared to mycelium of wild strain of F. filiformis (Additional file 1: Figure S3 and Additional file 2: Table S2). Genes encoding glycerol 2-dehydrogenase (gene9557, gene2028), 7-bisphosphatase (gene 2929), alcohol dehydrogenase (gene7891-D2, gene 9773-D2) and aryl-alcohol dehydrogenase (gene 4871, gene 612) are up-regulated in mycelium of the wild strain. Gene encoding mannose–1-phosphate guanylyltransferase (gene 11132-D3) have highest expression level in mycelium of wild strain with more than 200-fold change compared to cultivar strain. Genes encoding glycerol 2-dehydrogenase (gene 894) and sugar phosphatase are up-regulated expression in fruiting bodies stage of wild strain.
Moreover, based on predicted metabolic enzymes related toPSs biosynthesis of Ganoderma lucidum [46], we identified 21 putative essential enzyme involved in PSs biosynthesis in F. filiformis using protein sequencing homology search method (Table 2), including glucose–6-phosphate isomerase, hexokinase, mannose–6-phosphate isomerase, UDP-glucose dehydrogenase, galactokinase and transketolase etc. Among them, genes encoding UDP-glucose pyrophosphorylase, UDP-glucose dehydrogenase and fructose-bisphosphate aldolase have relative high transcripts in all samples analyzed (FPKM>100). These candidate genes will be functional verification by experiment in future, combined the quantification of PSs in different tissue.
Carbohydrate active enzymes (CAZymes)
Secreted carbohydrate-degrading enzymes are crucial enzymes for fungal biology, both for fungal cell wall turnover and for degradation of external carbohydrate sources. They are also responsible for the biosynthesis, degradation and modification of oligo- and polysaccharides as well as of glycoconjugates [51]. We performed sequencing similarity search against the CAZy database using Blastx (E<10–5) and totally identified 287 carbohydrate-active related genes in F. filiformis genome, including 149 Glycoside-Hydrolases (GH), 48 Auxiliary Activities (AA) containing redox enzymes, 36 glycosyltransferases (GT), 29 Carbohydrate-binding module (CBM), 13 Carbohydrate Esterases (CE) and 12 Polysaccharide Lyases (PL). The number of secreted carbohydrate-degrading enzymes (mostly GHs) is nearly four times than polysaccharide biosynthetic enzyme (GT), indicating their dependence on external carbohydrate sources for growth and development as wood-rot fungi. Genes encoding beta–1,3-glucan-binding protein (GH16) are most abundant with 23 members in F. filiformis genome. Previous studies showed that GH16 enzymes are involved in the formation of the chitin-b–1,3-glucan complex, suggesting a role in the processing of fungal cell-wall polysaccharides [51]. The expression profile of these genes related to CAZys is diverse (Fig. 3). Seven genes of 23 GH16 family are up-regulated expression in DK mycelium stage compared to MK (three genes annotated as candidate glycosidase and four genes as beta-glucan synthesis-associated protein), four genes of GH16 family are up-regulated in fruiting bodies compared to primordium.
Besides GH16, seven out of the 14 GH5 members (four genes annotated as Glucan 1,3-beta-glucosidase, two genes annotated as endoglucanase and another annotated as mannan endo–1,4-beta manosidase) are differentially up-regulated in DK mycelium of the wild strain compared to MK mycelium and five genes belong to GH5 family are up-regulated in primordium stage compared to mycelium. Eight members out of 11 GH43 are up-regulated in primordium compared to mycelium.
Predicted bioactive secondary metabolism genes clusters of F. filiformis
Besides the macronutrients and micronutrients present in the F. filiformis, a large number of structurally diverse bioactive secondary metabolites, especially various novel sesquiterpenes and norsequiterpenes were identified from the mycelium and fruiting bodies of F. filiformis [9]. The genes involved in sesquiterpenes biosynthesis were described previously [9]. In this study, the gene cluster encoding terpenoid and PKS were re-examined using the genome and transcriptome resource. In total, 12 gene clusters related to terpenoid biosynthesis and two gene clusters for polyketide biosynthesis were predicted (Fig. 4, Additional file 2: Table S3). Compared with other three cultivar strains (KACC42780, TR 19 and L11 with genome sequencing), the numbers of gene clusters involved in terpene, PKS, NRPS and Siderophore biosynthesis are different and the gene number related to terpene synthesis is higher in wild strain Liu355 (119 genes) than cultivar strain L11 (81 genes) in our study (Table 3).
119 genes of 12 terpenoid clusters divided into 10 clades according to their expression level (Additional file 1: Figure S4). Most genes in clade II are up-regulated in primodium stage of wild strain Liu 355 compared to mycelium stage, including encoding 4,5-DOPA dioxygenase extradiol-like protein (gene 3103) and squalene synthase (gene 3428) involved in biosynthesis of squalene, a precursor of terpenoid compounds. Genes in VIII-X clades are significantly differently expressed in mycelium of wild strain Liu 355 compared to cultivar strain and in the fruiting body stage compared with mycelium of wild strain Liu 355, including key enzymes involved in terpenoid biosynthesis such as protoilludene synthase, candidate peroxisomal acyl-coenzyme A oxidase, L-amino acid amidase and cytochorme P450. It probably explained more divers bioactive compound in wild stains than cultivar strain in previous study.
Putative genes for sesquiterpenoid biosynthesis in F. filiformis
Sesquiterpene compounds are mainly bioactive secondary metabolites in Flammulina. Chemistry investigation of six strains of F. filiformis in previous reports revealed that the wild strain Liu355 contained many new sesquiterpens with various skeletons, including cuparene-type and sterpurane type sesquiterpenes [9]. However, little is known about sesquiterpene synthases (STSs) in Basidiomycota until the mostly recent years, although these fungi can produce divers bioactive sesquiterpenes. Thanks to the identification of sesquiterpene synthases from Coprinus cinereus (Cop1–6) and Omphalotus olearius, it provided a useful guidance for subsequent development of in silico approaches for the directed discovery of new sesquiterpene synthases and their associated biosynthetic genes [52]. In our study, we performed a genome-scale homologous search with sesquiterpene synthases of O. olearius, C. cinereus and Hericium erinaceus, 12 homologue sequences with considerable similarity (e-value < 10–5) to the known biochemically characterized sesquiterpene synthases were identified in the genome of F. filiformis. The phylogenetic analysis showed that these genes consist of four clades (Additional file 1: Figure S5, Additional file 2: Table S4). Eight genes out of 12 genes related to sesquiterpene biosynthesis have closely phylogenetic relationship with genes identified from cultivar Flammulina in the previous study [9]. 12 STSs genes of F. filiformis include five genes encoding delta (6)-protoilludene synthase (gene1663-D2, gene9115, gene2784 and gene9115-D2, gene 6325-D2), two genes encoding trichodiene synthase (gene1140, gene 2254), two genes encoding alpha-muurolene synthase (gene1358-D2, gene1358), and one gene encoding glucose–6-phosphate isomerase. Among them, genes encoding delta (6)-protoilludene synthase are up-regulated in fruiting body stage compared to mycelium and also up regulated in DK mycelium compared to MK of the wild strain Liu355 (Table 4).
Putative genes for polyketides biosynthesis in F. filiformis
Polyketides are major group of secondary metabolites isolated from bacteria and fungi and fungi in particular have been important sources of natural product polyketide pharmaceuticals. Structural complexity of these polyketides makes them interesting and useful bioactive compounds and these same features also make them difficult and expensive to prepare and scale-up using synthetic methods [53]. In past, understanding of the biosynthesis and regulation of polyketide in fungi are extremely difficult because of lacking of established genetic tools. Recent researches illustrated that Saccharomyces cerevisiae have potential as a tool for mining, studying and engineering fungal polyketide synthases during to their many advantages benefits such as unicellular organism well suited for large scale fermentation, limited native secondary metabolism and a number of developed genetic tools for protein expression and pathway constriction [53].
The diverse structures of polyketides are biosynthesized from short-chain carboxylic acid units by polyketide synthases (PKSs), PKSs have been classified into type I, type II and type III based on their product profiles and catalytic domain architecture [54]. By gene clusters prediction using the antiSMASH, we found 30 genes of two gene clusters responsible for Type I PKS. They mainly located in a single scaffold 24 and 78 in F. filiformis genome, respectively (Fig. 4, Additional file 2: Table S3). The two gene clusters both included core genes encoding polyketide synthase (gene 8217 and gene 1373). Genes located scafflold 78 are most up-regulated in wild strain mycelium compared with cultivar strain, including erythronolide synthase (gene 1374), polyketide synthase (gene1373), benzoate 4-monooxygenase (gene 1372), indicating that polypeptide compounds are probably abundant in mycelium of this mushroom and especially in wild strain.
Cytochrome P450s in F. filiformis genome
It is reported that a more divergent cytochrome P450 oxidase could be involved in secondary biosynthesis [55]. So, we searched the genome of the F. filiformis for proteins with a P450 conserved domain using NCBI’s CDD tool and BLASTp [39]. We yielded 107 genes in cytochrome P450 family, including nine putative trichodiene oxygenase, 31 O-methylsterigmatocystin oxidoreductase, five benzoate 4-monooxygenase, two linoleate 10R-lipoxygenase, two ent-kaurene oxidase, lanosterol 14-alpha and flavonoid hydroxylase and other candidate cytochrome P450. 102 genes out of them have diverse expression profile across different tissue of F. filiformis (Additional file 1: Figure S4). 26 CYP450 genes are up-regulated in mycelium of wild strain compared to cultivar strain and the gene 54280-D3 have most high expression level with more than 500 fold change. 21 CYP450 genes are up-regulated in fruiting bodies stage compared to mycelium and gene encoding benzoate 4-monooxygenase have highest transcript level with 15 fold change. In primodim formation stage, gene encoding docosahexaenoic acid omega-hydroxylase is highest differential expressed.
Heat shock protein correspond to temperature change in F. filiformis
Besides with unique compounds, the wild strain of F. filiformis Liu 355 can produce the fruiting body at relative high temperature than current commercial strains, implying it is a potential excellent breeding resource. A temperature downshift (cold stimulation) is considered to be one of the most important and essential environmental factors for the fruiting initiation and fruiting body formation of F. velutipes [34]. In generally, mycelia of F. velutipes can grow vegetative at 20–24 °C and fruiting at an optimum temperature for 12–15 °C [32]. In our study, the wild strain of Liu 355 can grow fruiting bodies at 18 °C at the laboratory. Therefore, it is potential excellent genetic material for F. velutipes breeding. Proteomic sequencing revealed that the expression of proteins related to energy metabolism (e. g. catalase, glucose–6-phosphate isomerase, trehalase and beta-glucosidase), amino acids biosynthesis (e. g. argininosuccinate synthase) and signal pathway (e.g. BAR adaptor protein) are dramatically increased after long-term cold stress [32, 34]. In addition, histidine kinases, response regulators, and sometimes histidine-containing phosphotransfer proteins were also reported play crucial roles in response to cold stress in cyanobacterium Synechocystis sp. and Arabidopsis [56, 57]. Transcriptomic sequencing revealed that histidine kinase and proteins involved in MAPK pathway (e.g. STE protein kinase, MAPK kinase kinase) and Ca2+ signal transduction pathway (calcium-dependent protein kinase) were differentially expressed in F. velutipes [31].
The heat shock protein (HSP) family was known that it positively correlated with the organism thermotolerance [58]. In this study, 28 genes annotated as HSP were identified in F. filiformis genome (Fig. 5). Among them, six genes are significantly up-regulated in wild strain Liu 355 compared to cultivar strain and encoding HSP12, HSPC4, HSP104, LHS1 and GRP78, respectively. HSP12 is part of a group of small heat shock proteins (HSP) that function as chaperone proteins and are ubiquitously involved in nascent protein folding by protecting proteins from misfolding and partially characterized as a stress response and expression of HSP12 protein was observed in response to cold stress [59]. In S. cerevisiae and C. albicans, HSP104 in association with HSP40 and HSP70 helps in reactivation and aggregation of denatured protein, by providing disaggregated protein to HSP40 and HSP70 as a substrate [60]. Expression of HSP104 and HSP70 is regulated by Hsf (heat-shock factor) interaction, which can be stimulated by heat stress in yeast [58]. However, the exactly molecular function of HSP in the high-temperature-tolerance of wild F. filiformis and adaptive mechanisms for relative high temperature need further study.