Biomass heterosis exist in contrasting hybrids at the seedling stage
Three contrasting hybrids having high (H), medium (M), and low (L) parent heterosis in yield traits together with their four inbred parents (denoted as A, B, C, and D) were used to investigate the early biomass vigor in the field. We observed that high and medium hybrids did not show statistically significant difference in fresh biomass (g/plant) and dry biomass (g/plant) relative to their mid-parent values (MPV) at 10 days after emergence of seedling (DAE) and 20 DAE (Fig. S1 and Fig. S2). However, high hybrid showed highly significant difference in fresh biomass compared with its MPV at 30 DAE (Fig. 1). According to the results, low hybrid showed significant difference in fresh biomass compared with its MPV at 10 DAE (Fig. S1). Similar results were obtained at 20 DAE (Fig. S2) and 30 DAE (Fig. 1). We observed that high hybrid showed more biomass accumulation and low hybrid showed decreased biomass accumulation compared to their parents. However, medium hybrid did not show any difference in biomass accumulation compared with both parents. These results predict that biomass heterosis is established in hybrids compared with their parents during early seedling growth.
Transcriptome sequencing, mapping and global expressions in root and leaf
Three contrasting F1 hybrids and their four inbred parents were used to perform RNA sequencing in this study. Root and leaf tissues of the same plant with three biological replicates were used to build 42 cDNA libraries for RNA Illumina sequencing. The brief detail of sequencing, mapping and gene expressions of the individual library is presented in additional file 1. The overall sequencing means of valid reads was 98.8 % with a value of 90% exon region mapping. The value of Q20% was 99.6% in our sequencing results. In the root of A, B, C, D, H, M, and L genotypes, mean of valid reads was 51128076, 54107099, 50767484, 52834574, 43141279, 46272200, and 45335083 respectively (Table 1). The mean of valid reads in the leaf of A, B, C, D, H, M, and L genotypes was 50223934, 49707365, 42235064, 44050304, 45690950, 43028428, and 45513687 respectively. On an average, more than 89% valid reads were mapped to G. hirsutum TM-1 reference genome in this sequencing. The mean of multi-mapped and splice reads was more than 28% and 31% respectively. In this study, the root tissue of each genotype showed higher number of total expressed genes compared to leaf. We also found that the total numbers of expressed genes in both tissues of F1 hybrids were higher than the parents (Additional file 1).
Differentially expressed genes in root and leaf transcriptome
Here, we analyzed the dynamic changes of root and leaf transcriptome in all F1 hybrids and their inbred parents. The expression levels significantly different at P ≤ 0.05 and with log2 (fold change) >1 or log2 (fold change) < -1 designated as differentially expressed genes in hybrids parents triads. The comparative analysis of DEGs in the root of high hybrid (H) compared with both parents revealed that the comparison of H with female parent (A) showed 2828 total number of DEGs (Fig. 2a: Additional file 2). This hybrid compared with male parent (B) showed 2917 total number of DEGs. According to results, the comparison of both parents ARvsBR showed 1154 total number of DEGs. In the leaf transcriptome of H, the comparison of ALvsHL showed 3807 total number of DEGs (Fig. 2b: Additional file 3). The comparison of BLvsHL showed 2797 DEGs. Furthermore, the comparison of both parents ALvsBL showed 1013 total number of DEGs.
The result of differentially expressed genes in root transcriptome of medium hybrid (M) compared with both parents revealed that the comparison of M with female parent (A) had 2577 total number of DEGs (Fig. 2c: Additional file 4). Total number of DEGs for the comparison of M and its male parent (C) was 2144. The comparison of ARvsCR showed 1067 total number of DEGs. The comparative analysis of DEGs in leaf transcriptome of medium hybrid as compared with both parents found that the comparison of ALvsML had 3910 total number of DEGs (Fig. 2d: Additional file 5). The comparison of CLvsML showed 2170 total number of DEGs. Furthermore, the combination of both parents ALvsCL had a total of 1586 DEGs. The comparative analysis of DEGs in root transcriptome of low hybrid (L) compared with parents determined that total number of differentially expressed genes between L and its female parent (A) was 3580 (Fig. 2e: Additional file 6). The combination of L and its male parent (D) showed 3215 total number of DEGs. Further, the comparison of both parents ARvsDR showed 2696 total number of DEGs. The comparative analysis results of DEGs in the leaf transcriptome of low hybrid showed that the comparison of ALvsLL had 5827 total number of DEGs (Fig. 2f: Additional file 7). While the comparison of DLvsLL showed 2623 total number of DEGs. Furthermore, the comparison of both parents had 2143 total number of DEG. The result of comparative analysis of DEGs between hybrids and parents revealed that hybrids had different genomic constituent relative to their parents. However, comparison of both parents indicated that they have few genetic differences.
F1 hybrids exhibited overdominant gene expressions
Allopolyploids have been found to exhibit expression level dominance [35, 40]. So, to address the magnitude and directionality of expressions in interspecific F1 upland cotton hybrids, DEGs of root and leaf transcriptome were divided into 12 possible groups as described by Rapp et al., [41]. Genes in groups 1-2 showed additive expression. The expression of genes in groups 3-4 was termed as male expression level dominance (M-ELD), wherein the expression of genes in groups 5-6 was named as female expression level dominance (F-ELD). The expression of genes in groups 7-9 called downregulated overdominance, wherein the expression of genes in groups 10-12 was named as upregulated overdominance (Fig. 3a). The result of expression patterns analysis in high hybrid (H) in both tissues detected that limited number of genes was additive expressions. Male and female parent like expression groups also had few genes. However, it was found that the overdominant upregulated (group12) and downregulated (group 8) groups had the highest number of genes in both tissues (Fig. 3b: Additional file 8). The analysis results of medium hybrid (M) also dissected that the overdominant up and downregulated groups had the highest number of genes in both tissues (Fig 3c: Additional file 9). The analysis result of low hybrid (L) was also similar to H and M hybrid (Fig. 3d: additional file 10). However, groups of parent-like expression also had many genes in L hybrid as compared to H and M hybrids. The result of expression patterns analysis directed that overdominance at the gene expression level contributes to early biomass vigor in hybrid cotton.
Functional annotations of overdominant genes
To understand the functions of genes with overdominant expressions in biomass heterosis, GO and KEGG enrichment analysis was implemented in root and leaf of hybrids. GO enrichment analysis (p-value < 0.01) of overdominant genes in root of high hybrid (H) relative to its parents revealed that most of the upregulated genes were involved in functions related to ‘plasma membrane, ‘regulation of transcription/DNA-template, and ‘extracellular region. Conversely, downregulated genes were enriched in plasmodesma, ‘integral component of plasma membrane, and ‘vacuole (Additional file 11). KEGG pathway enrichment analysis (p-value < 0.05) of overdominant genes showed that most of the upregulated genes were enriched in ‘porphyrin and chlorophyll metabolism, ‘phenylpropanoid biosynthesis, and ‘oxidative phosphorylation pathways (Fig. 4a: Additional file 12). But, the majority of downregulated genes were enriched in ‘starch and sucrose metabolism, ‘phenylpropanoid biosynthesis, and ‘circadian rhythm plant (Fig. 4b: Additional file 12). In the leaf of H hybrid, most of the overdominant upregulated genes were involved in functions of ‘plasma membrane, ‘protein serine/threonine kinase activity, and ‘plasmodesma, while downregulated genes were involved in cellular functions of ‘chloroplast, ‘membrane, and ‘extracellular region (Additional file 13). Enriched pathways of overdominant genes found that upregulated genes were enriched in ‘plant-pathogen interaction, ‘plant hormone signal transduction, and ‘circadian rhythm plant (Fig. 4a: Additional file 14). In contrast, significant pathways for downregulated genes were ‘plant hormone signal transduction, ‘carbon metabolism, and ‘circadian rhythm plant (Fig 4b: Additional file 14).
The results of GO enrichment analysis of overdominant genes in the root of medium hybrid (M) compared with its parents revealed that upregulated genes were enriched in ‘plasma membrane, ‘extracellular region and ‘oxidation-reduction process. In contrast, downregulated genes were performed following functions e.g. ‘integral component of the plasma membrane, and ‘response to salt stress (Additional file 15). Pathway enrichment analysis of overdominant genes found that upregulated genes were enriched in ‘phenylpropanoid biosynthesis, amino sugar, and nucleotide sugar metabolism, and ‘porphyrin and chlorophyll metabolism (Fig 4a: Additional file 16). For downregulated genes, enriched pathways were ‘circadian rhythm plant, ‘glycosaminoglycan degradation, and ‘ascorbate and aldarate metabolism (Fig. 4b: Additional file 16). GO enrichment analysis of overdominant genes in leaf of M hybrid revealed that most of upregulated genes performed functions related to ‘chloroplast, ‘chloroplast stroma, and ‘plastid. However, downregulated genes involved in the following functions such as ‘chloroplast, ‘protein binding, and ‘ATP binding (Additional file 17). Pathway enrichment analysis of overdominant genes revealed that most of upregulated genes were found in ‘carbon metabolism, ‘phagosome, and ‘circadian rhythm plant (Fig. 4a: Additional file 18). On the other hand, downregulated genes were enriched in ‘planted hormone signal transduction, ‘endocytosis, and ‘carbon metabolism (Fig. 4b: Additional file 18).
GO enrichment analysis of overdominant genes in the root of low hybrid (L) compared with its parents found following enriched terms e.g. ‘plasma membrane, ‘extracellular region, and ‘oxidation-reduction process. The functions of downregulated genes were enriched for ‘chloroplast, ‘regulation of transcription/ DNA-template, and ‘transcription factor activity/sequence-specific DNA binding (Additional file 19). Enriched pathways of overdominant upregulated genes were ‘phenylpropanoid biosynthesis, ‘porphyrin and chlorophyll metabolism, and ‘pentose and glucuronate interconversions (Fig. 4a: Additional file 20). On the other hand, most of downregulated genes were enriched in ‘circadian rhythm plant, ‘carbon metabolism, and ‘starch and sucrose metabolism (Fig. 4b). In the leaf of L hybrid, GO enrichment analysis of overdominant genes showed that the functions of upregulated genes were enriched in ‘chloroplast, ‘protein folding, and ‘embryo development ending in seed. In contrast, majority of the downregulated genes were involved in cellular function of chloroplast, ‘membrane, and ‘chloroplast stroma (Additional file 21). The pathway enrichment analysis of overdominant genes uncovered that majority of upregulated genes were found in ‘protein processing in the endoplasmic reticulum, ‘spliceosome, and ‘circadian rhythm plant (Fig. 4a: Additional file 22). For downregulated genes, enriched pathways were ‘carbon metabolism, ‘plant hormone signal transduction, and ‘circadian rhythm plant (Fig. 4b: Additional file 22).
The comparison of overdominance genes between contrasting hybrids
The results of KEGG revealed that most of the overdominant genes of high, medium and low hybrids relative to their parents were enriched in similar pathways. We performed comparison analysis between high, medium and low hybrids to see unique and common overdominant expressed genes in the above pathways. The results showed that 44 genes in leaf and 29 genes in root were common in comparsion of all hybrids (Fig. S3: Additional file 23). On the other hand, many genes were also overlapped between comparisons of two hybrids. The reason behind these results might be the common female parent in hybrids. The expression level of genes that showed overdominant expression in all hybrids relative to their inbred parents is presented in Fig. 5. We found that root genes were belonging to circadian rhythm, phenylpropanoid biosynthesis, porphyrin, and chlorophyll metabolism, while leaf genes were enriched in circadian rhythm, plant hormone signal transduction, and carbon metabolism. Majority of these genes were involved in functions related to DNA binding, oxidation-reduction process, heme binding, and response to oxidative stress (Fig. S4). We presumed that phenomenal changes in these pathways genes may be altered biomass vigor in hybrids of cotton.
The circadian rhythm, metabolic processes, and biomass vigor
Through compression analysis between hybrids, we found that circadian rhythm plant contained many overdominant genes in all hybrids in root and leaf. Genes in this pathway were not same but tissue-specific except for three (Gh_A12G1061, Gh_A12G1062, and Gh_D12G1185). In this pathway, we found that six genes (Gh_A11G0926, Gh_A12G1061, Gh_A12G1062, Gh_D12G1185, Gh_D12G1184, and Gh_D11G1068) related to MYB domain transcription factor, encoding LHY protein showed downregulation in root of hybrids. In contrast, five genes (Gh_A12G1061, Gh_A12G1062, Gh_D12G1185, Gh_A09G1504, and Gh_D09G1515) encoding LHY protein showed downregulated expression in leaf of hybrids. the results indicated that four genes (Gh_A08G0451, Gh_D01G0200, Gh_D07G0867, and Gh_D11G1518) associated with transcript factor CO-like showed upregulation in leaf of hybrids. Genes (Gh_A05G0944 and Gh_D05G1029) named as CIA2 (CHLOROPLAST IMPORT APPARATUS 2) associated with PSEUDO-RESPONSE REGULATOR9 (PRR9) also showed similar results in leaf of hybrids. In addition to circadian rhythm, several genes (Gh_A03G0944, Gh_A09G1415, Gh_A11G1859, Gh_A12G1915, and Gh_D02G1327) in phenylpropanoid biosynthesis related to peroxidase superfamily protein and many genes from porphyrin and chlorophyll metabolism showed upregulated expressions in root of hybrids. Furthermore, many genes of plant hormone signal transduction pathways linked to AUX/IAA transcriptional regulator family protein and genes from carbon metabolism pathway also showed downregulated expressions in leaf of hybrids. To be concise, all these results suggest that genes in circadian rhythm together with other metabolic process performed overdominance that might change root and leaf growth that ultimately lead to altered biomass vigor in hybrid cotton.
qRT-PCR analysis
Twelve overdominant genes from circadian rhythm plant were selected to validate RNA-seq data by real-time qRT- PCR analysis. All selected genes showed overdominance expression in high, medium, and low hybrids relative to their parents in RNA-seq results. We selected five genes (Gh_A12G1061, Gh_A12G1062, Gh_D12G1184, Gh_D11G1068, and Gh_D13G1939) from root and one gene (Gh_A09G1504) from leaf with downregulated expressions in hybrids, While six genes (Gh_A08G0451, Gh_D01G0200, Gh_D11G1518, Gh_D05G1029 Gh_D07G0867, Gh_A05G0944, and Gh_D12G1525) were selected from leaf with upregulated expressions in hybrids. It was found that the expression trends as calculated by qRT-PCR were consistent with RNA-seq data, thus confirming the accuracy of the RNA-seq results in this study (Fig. 6).