2.1. Gene expression profile in light treatments
The number of gene expression and the distribution of gene expression value in samples have certain differences; thus, sample expression value (FPKM) was divided into different intervals, the number of genes expressed in the samples of different expression intervals was calculated, and the stacking histogram was drawn for display. The gene expression distribution map of the samples is shown in Fig. 1A. The correlation analysis of all transcriptome samples showed a remarkable correlation between biological replicates (Fig. 1B). The principal component analysis (PCA) of total unique genes showed that genes in each treatment were homogenously expressed at PC 1 with 59.1% variance (Fig. 1C). The clustering method was used to calculate the distance between samples to investigate the similarity between samples. Samples that belong to the same treatment are close in distance and preferentially clustered together. The cluster diagram of sequencing samples obtained according to gene expression is shown in Fig. 1D. The genes expression profiles indicate that blue light brings a large number of differentially expressed genes (DEGs).
2.2. DEGs in maize that respond to blue light
The summary of transcriptome analysis on maize leaf samples is listed in Table S2. In the RNA-seq libraries, more than 8 G data were obtained with GC > 56.23 % and Q30 > 93.92 %. approximately 92.85 % (blue) and 92.97% (red) clean reads were mapped to the reference genome of maize. In total, about 89.25 % (blue) and 88.97 % (red) of the unique genes were uniquely mapped to the reference genome. A total 1296 genes were differentially expressed, in which 890 were up regulated and 406 were down regulated when the light was changed from red to blue (Fig. 2B and C). The DEGs were subjected to unsupervised hierarchical clustering. Genes in the same cluster may have similar biological functions for further analysis (Fig. 2A).
2.3. Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analyses of the DEGs
GO analysis was carried out to analyze the functions of the DEGs that respond to the change in light condition. The summary of GO categories is depicted in Fig. 3A. According to the GO classification, DEGs involved in biological process occurred after light condition change. A total 308 genes were down regulated in the level 2 GO analysis. 198, 193, and 135 genes were involved in the top three GO terms under biological process, namely, “cellular process,” “metabolic process,” and “single-organism process,” respectively; 224, 223, and 154 genes were involved in the top three GO terms under cellular component, namely, “cell,” “cell part,” and “organelle,” respectively; and 173, 151 and 37 genes were involved in the top three GO terms under molecular function, namely, “binding,” “catalytic activity,” and “nucleic acid binding transcription factor activity,” respectively. In total, 667 genes were up regulated in GO analysis. For biological process, 375, 367, and 328 genes involved in “cellular process,” “metabolic process,” and “single-organism process” were up regulated, respectively. For cellular component, 500, 500, and 375 genes related to “cell,” “cell part,” and “organelle” were up regulated, respectively. For molecular function, 339, 381, and 57 genes related to “binding,” “catalytic activity,” and “transporter activity” were enriched, respectively. The results of GO analysis were used for the functional analysis of the screened genes (Fig. 3A).
KEGG analysis showed that DEGs were significantly enriched in “plant hormone signal transduction,” “MAPK signaling pathway,” “glycerophospholipid metabolism,” “starch and sucrose metabolism,” and “circadian rhythm” (Fig. 3B). The processes of “signal transduction” and “environment adaptation” of maize leaf stomata to blue light were collected. According to KEGG pathway identification, 53 genes were classified into four major enrichment classes, namely, “plant hormone signal transduction,” “MAPK signaling pathway,” “plant pathogen – interaction” and “circadian rhythm – plant,” and six less enriched classes, namely “inositol phosphate metabolism,” “phosphatidylinositol signaling system,” “endocytosis,” “ubiquitin mediated proteolysis,” “amino sugar and nucleotide sugar metabolism,” and “protein processing in endoplasmic reticulum.” The enrichment term and annotation of these 53 DEGs are listed in Table 1. Gene annotations were retrieved from National Center for Biotechnology Information (NCBI) and InterPro. Four DEGs (LOC100281912, LOC107326007, LOC107403167, and LOC100192073) were not fully annotated in the KEGG analysis.
Table 1 The list of 53 genes classified by "signal transduction" and "environmental adaptation" in KEGG.
(LOCATION: at the end of “2.3 GO and KEGG analysis of the DEGs” page 5 line 12)
NO.
|
Genes ID
|
type
|
KEGG classification
|
Gene description or functional annotation
|
1
|
bZIP107
|
up
|
Signal transduction
|
Putative bZIP transcription factor superfamily protein; Transcription factor TGA4
|
2
|
gpm254
|
up
|
Environmental adaptation
|
Transcription factor HY5; Uncharacterized
|
3
|
gpm827
|
up
|
Signal transduction
|
Putative phosphatidylinositol-4-phosphate 5-kinase family protein
|
4
|
gpm849
|
down
|
Environmental adaptation
|
Phosphatidylethanolamine-binding protein26; ZCN26
|
5
|
IDP491
|
down
|
Signal transduction
Environmental adaptation
|
Respiratory burst oxidase homolog protein B
|
6
|
LOC100192868
|
up
|
Environmental adaptation
|
LHY protein
|
7
|
LOC100217045
|
down
|
Signal transduction
|
Histidine-containing phosphotransfer protein 4
|
8
|
LOC100273422
|
up
|
Environmental adaptation
|
putative protein kinase superfamily protein
|
9
|
LOC100273598
|
up
|
Environmental adaptation
|
Two-component response regulator-like APRR9
|
10
|
LOC100274454
|
down
|
Signal transduction
|
ETHYLENE INSENSITIVE 3-like 5 protein
|
11
|
LOC100279342
|
up
|
Environmental adaptation
|
E3 ubiquitin-protein ligase COP1
|
12
|
LOC100279578
|
up
|
Signal transduction
|
PP2C11; 2C-type protein phosphatase protein
|
13
|
LOC100280135
|
up
|
Signal transduction
|
Basic endochitinase B
|
14
|
LOC100280474
|
down
|
Signal transduction
|
SAUR25 - auxin-responsive SAUR family member
|
15
|
LOC100281007
|
up
|
Signal transduction
|
SAUR14 - auxin-responsive SAUR family member
|
16
|
LOC100281015
|
up
|
Signal transduction
Environmental adaptation
|
caltractin
|
17
|
LOC100281091
|
up
|
Environmental adaptation
|
Circadian clock associated1; Putative MYB DNA-binding domain superfamily protein
|
18
|
LOC100281845
|
up
|
Signal transduction
|
SAUR25 - auxin-responsive SAUR family member
|
19
|
LOC100281912
|
up
|
Signal transduction
|
Uncharacterized
|
20
|
LOC100283359
|
down
|
Signal transduction
|
two-component response regulator ARR3
|
21
|
LOC100283515
|
down
|
Environmental adaptation
|
caltractin
|
22
|
LOC100286122
|
up
|
Environmental adaptation
|
ubiquitin ligase protein COP1
|
23
|
LOC100381449
|
up
|
Signal transduction
|
SAUR-like auxin-responsive protein family
|
24
|
LOC103627330
|
up
|
Signal transduction
|
ETHYLENE INSENSITIVE 3-like 5 protein
|
25
|
LOC103627479
|
up
|
Signal transduction
|
SAUR11-auxin-responsive SAUR family member
|
26
|
LOC103630348
|
up
|
Signal transduction
|
gibberellin receptor GID1-like
|
27
|
LOC103631918
|
down
|
Signal transduction
Environmental adaptation
|
auxin-responsive protein SAUR41
|
28
|
LOC103632809
|
down
|
Signal transduction
|
auxin-responsive protein SAUR41
|
29
|
LOC103633012
|
up
|
Environmental adaptation
|
calcium-dependent protein kinase 22
|
30
|
LOC103633058
|
down
|
Signal transduction
|
transcription factor PHYTOCHROME INTERACTING FACTOR-LIKE 13
|
31
|
LOC103634871
|
up
|
Environmental adaptation
|
transcription factor HY5
|
32
|
LOC103636459
|
down
|
Signal transduction
|
transcription factor LG2-like
|
33
|
LOC103638645
|
up
|
Environmental adaptation
|
transcription factor HY5
|
34
|
LOC103641361
|
up
|
Signal transduction
Environmental adaptation
|
transcription factor APG
|
35
|
LOC103642058
|
down
|
Signal transduction
|
Auxin-responsive protein SAUR71
|
36
|
LOC103646211
|
up
|
Signal transduction
|
auxin-induced protein X10A
|
37
|
LOC103646787
|
up
|
Signal transduction
|
probable protein phosphatase 2C 37
|
38
|
LOC103648032
|
up
|
Signal transduction
|
probable indole-3-acetic acid-amido synthetase GH3.13
|
39
|
LOC103652527
|
up
|
Signal transduction
|
mitogen-activated protein kinase kinase kinase 18
|
40
|
LOC103654889
|
down
|
Signal transduction
|
Auxin-responsive protein SAUR71
|
41
|
LOC107326007
|
up
|
Signal transduction
|
Uncharacterized
|
42
|
LOC107403167
|
up
|
Signal transduction
|
Uncharacterized
|
43
|
LOC542189
|
up
|
Environmental adaptation
|
salt-inducible putative protein serine/threonine/tyrosine kinase
|
44
|
pco069505(751)
|
up
|
Signal transduction
|
snrkll1; Serine/threonine-protein kinase SRK2C
|
45
|
pco123854
|
up
|
Signal transduction
|
Auxin response factor
|
46
|
pco153543(105)
|
down
|
Environmental adaptation
|
HSP90-2; HSP protein
|
47
|
PP2C14
|
up
|
Signal transduction
|
Uncharacterized
|
48
|
TIDP2950
|
up
|
Environmental adaptation
|
Putative calcium-binding protein CML25
|
49
|
umc1534
|
up
|
Signal transduction
|
Auxin-responsive protein
|
50
|
ZCN16
|
up
|
Environmental adaptation
|
ZCN16 protein
|
51
|
ZCN18
|
up
|
Environmental adaptation
|
ZCN18 protein
|
52
|
ZCN19
|
up
|
Environmental adaptation
|
ZCN19 protein
|
53
|
ZCN25
|
up
|
Environmental adaptation
|
ZCN25 protein
|
2.4. Protein-protein interaction (PPI) network analysis of correlated DEGs and identification of potential functional DEGs
The correlation network of 53 DEGs in “signal transduction” and “environment adaptation” (according to KEGG classification level 2) were conducted in all transcriptome samples to fully understand the expression of the genes responding to light change in maize seedling stage (table 1). The PPI network of DEGs was constructed and classified by MCODE according to the correlation of DEGs (Fig. 4A-F). Three nodes and 25 genes were obtained under the KEGG pathway, “environment adaptation” (Fig. 4A-C). 27 genes within three nodes were obtained in the “signal transduction” KEGG pathway (Fig. 4D-F). The differences in the transcription expression of these 55 genes are shown in the heat map (Fig. 5). Three repetitive genes were deleted, and 24 genes unrelated to KEGG the pathway but interacted with the 28 annotated genes in Table 1 were finally obtained. 22 of the 24 genes can be annotated from NCBI and GO (Table 2). The other two genes, namely, LOC100280217 and LOC100285849, were unidentified.
Table 2
Classification list of KEGG level 2, enrichment term and annotation of 28 genes that interacting with two classification “Signal transduction” and “Environment adaptation”.
Gene ID
|
group
|
state
|
Gene description strived from NCBI
|
dhs2
|
B
|
up
|
deoxyhypusine synthase 2
|
IDP1436
|
B
|
up
|
4-coumarate–CoA ligase-like 9
|
LOC100281746
|
B
|
up
|
glycerol 3-phosphate permease
|
LOC100285616
|
B
|
down
|
calcineurin B-like protein 4
|
LOC103626304
|
B
|
up
|
protein ACTIVITY OF BC1 COMPLEX KINASE 7
|
LOC103646055
|
B
|
down
|
cytochrome P450 87A3-like
|
LOC103648196
|
B
|
down
|
putative glutaredoxin-C14
|
pco105718
|
B
|
up
|
Serine/threonine-protein phosphatase
|
PPCK1
|
B
|
up
|
phosphoenolpyruvate carboxylase kinase 1
|
umc1774
|
B
|
down
|
starch phosphorylase1
|
LOC100280217
|
C
|
up
|
uncharacterized
|
LOC103630810
|
C
|
up
|
RNA polymerase sigma factor sigE
|
LOC103636244
|
C
|
up
|
protein SPA1-RELATED 4
|
LOC100191429
|
D
|
up
|
putative MAP kinase family protein
|
LOC100279221
|
D
|
up
|
Protein phosphatase 2C and cyclic nucleotide-binding/kinase domain-containing protein
|
LOC100285849
|
D
|
up
|
uncharacterized
|
LOC103635843
|
D
|
up
|
CBL-interacting protein kinase 19
|
pco153236
|
D
|
up
|
Glycogen synthase kinase-3 MsK-3
|
TIDP9234
|
D
|
down
|
CBL-interacting kinase
|
LOC100286321
|
E
|
up
|
copper transporter 1
|
LOC100304064
|
E
|
down
|
SAUR33
|
LOC107548106
|
E
|
down
|
SAUR33
|
dhn1
|
F
|
up
|
dehydrin DHN1
|
LOC103641704
|
F
|
up
|
zeaxanthin epoxidase, chloroplastic
|
LOC103645618
|
F
|
up
|
carotenoid 9,10(9',10')-cleavage dioxygenase
|
LOC103649550
|
F
|
up
|
S-type anion channel SLAH3
|
LOC103651474
|
F
|
down
|
S-type anion channel SLAH2
|
wc1
|
F
|
up
|
white cap1
|
Six unidentified genes, obtained from KEGG and PPI, were analyzed using the Protein Basic Local Alignment Search Tool (BLASTP) in NCBI protein database with maize and three Gramineae plants, namely, Oryza sativa japonica, Panicum hallii, and Sorghum bicolor. The BLASTP results show that the six genes belong to different gene families. LOC107326007 was similar to MKK7 (Per. Ident = 99.39%) and MKK9 (Per. Ident = 91.44%) of the MAPK family. MKK7/9 are key members of the MAPK cascade that control the connection pathway between ligands to receptors in the procession of stomatal development signal transduction. LOC107326007 was best matched with MKK7 than MKK9 of maize. Therefore, homologous BLASTP to other Gramineae plants suggested that LOC107326007 may be the MKK7 of maize (Fig. 4G). LOC100285849 was identified as the CIPK16 (98.7 with CIPK-like 1) of the CBL-interacting protein kinase family. Its signaling pathway involved in stomatal development probably started with the catalysis of gamma-phosphoryl on protein substrates and/or calcium signals triggered by light. LOC107403167 was deduced to be a member of serine/threonine protein kinases. The protein sequence was closed, but the matching degree with the OXI1 of maize is low (query cover = 58%); thus, its gene function cannot be determined from existing results. LOC100192073 was known as the PP2C14 of the protein phosphatase 2C family. BLASTP results revealed that LOC100192073 and PP2C30 were closed, but an additional NADB_Rossmann protein domain was found in putative maize PP2C30 (Fig. 6). Therefore, LOC100192073 is not PP2C30 but a member of the PP2C family. LOC100280217 with PhrB conserved domain interacted with HY5 and was identified as (6 − 4) DNA photolyase (query cover = 92%), similar to UVR3_1, but was not reported in maize. Therefore, the effect of (6 − 4) DNA photolyase on the signal pathway of stomatal development in maize is not clear. LOC100281912 was close to the TIFY 11c (Per. Ident = 98.04%) of maize and other Gramineae plants but has two differences and more than two amino acids in length. Noticeably, the BLASTP results of LOC100281912 was also close to the pnFL-2 (Per. Ident = 98.03%) of maize.
2.5. Metabolite profile and screening of differential metabolites in maize in respond to light change
A total of 6272 peaks were detected by liguid chromatography–tandem mass spectrometry LC-(MS/MS) analysis, and 3204 metabolites were annotated in MS2 database. Among them, 1383 metabolites were obtained by negative ion scanning modes and 1821 metabolites were obtained by positive ion scanning modes (Fig. 7B). All metabolites were analyzed by principal component analysis and hierarchical cluster analysis determine the influence of blue light on the change of metabolites (Figure S1). A clear separation was found between the two sampling responses to light change. The volcanic diagram was used to screen different metabolites by visualizing the P value and fold change value from the 6272 peaks (Fig. 7A). Multi-dimensional analysis and one-dimensional analysis were used to screen different metabolites between groups after the Student’s test and fold change analysis of the 3204 detected metabolites. As a result, 419 different metabolites were obtained by orthogonal partial least squares discrimination analysis (OPLS-DA) (Fig. 7B).
Hierarchical clustering and visual analysis were conducted on the expression levels of the top 50 differential metabolites according to the variable importance of projection (VIP) values to show the relationship and the expression differences of metabolites between samples more intuitively (Fig. 8A). Correlation analysis was helpful to measure the degree of correlation and the relationship between significantly different metabolites. Pearson correlation coefficient was used for correlation analysis, and Pearson product difference correlation coefficient was used to measure the linear correlation between two quantitative variables (Fig. 8B). The differential metabolite expression amount was visualized according to VIP value. Figure 8B shows the correlation of the top 50 differential metabolites.
2.6. Effect of light change on metabolic pathways and gene screening in the joint analysis of transcriptome and metabolome
The function and interaction of metabolites were constructed by using the KEGG database. The enrichment analysis of differential metabolites by analyzing metabolite-related metabolic pathways is helpful to understand the mechanism of metabolic pathway changes in different samples. The analysis of the differential accumulation of metabolites in stomatal development showed that 65metabolites accumulated differentially after 12 h of blue light change. Among them, 31 were up regulated and 34 were down regulated. During the light change from red to blue, the key metabolites affecting stomatal development, such as those under the “carotenoid biosynthesis” and “starch and sucrose metabolism” pathways, were analyzed (Fig. 9A).
The KEGG Markup Language (KGML) file in the KEGG database was used to map the network of genes and metabolites. This map will contribute to a more systematic study of the interactions between transcriptome and metabolomics. The results of KGML showed that “carotenoid biosynthesis” and “starch and sucrose metabolism” are the KEGG pathways of the transcriptome that affected the genes related to environmental adaptation and signal transduction (Fig. 9B). Two genes, LOC103641704 and umc1774, are involved in each of the two processes. LOC103641704 is zeaxanthin epoxidase of chloroplastic. Umc1774 is alpha-1,4-glucan phosphorylase called starch phosphorylase1.
2.7. Location of key genes and their expression in the network of stomatal development
The basic network signal map was drawn according to the reported stomatal signaling pathway of monocotyledons (Fig. 11). The results of KEGG pathway analysis, PPI, and KGML showed that four (LOC107326007, LOC107403167, LOC100192073, LOC100281912), two (LOC100285849, LOC100280217) and two (LOC103641704, umc1774) genes were obtained from the three methods (Fig. 10). The LOC107326007 of putative MKK7 is assumed to be located in the MAPK cascade between YDA and MPK3. LOC100285849 is close to LOC107326007 because of the interactions. LOC100280217 is located next to HY5 with the existing PPI. LOC107403167 was identified as a member of the OXI1 family from the MAPK pathway, and the interaction among LOC107403167, MKK4, and MPK3 were retrieved from the STRING database by PPI through the connection with PTI1. Considering the exclusion of additional domains, the protein PP2C30 of Oryza sativa was used for protein interaction selection. At last, a protein signaling pathway from PP2C30 to MPK3 was established. Therefore, we speculated that LOC100192073 is also located in the signal network near MPK3. TIFY 11c is involved in jasmonate signaling. Thus, we inferred that LOC100281912 interacts more strongly with MPK3 by COI1[8]. The association between LOC100281912 and MPK3 was retrieved from STRING through the reported protein of Panicum virgatum. LOC103641704 interacts with the PP2C of LOC100192073, and umc1774 connects MKK4 and MPK3 by linking with PHI1. The results showed that the potential genes work on the regulatory nodes of HY5 and MPK3.
The expression of 32 genes, including 26 genes that play important roles in regulating the stomatal development of maize and six potential functional genes obtained in this experiment, were determined by quantitative polymerase chain reaction (qPCR) (Fig. 11). The qPCR results were consistent with the results of transcription expression. The 26 genes were divided into eight modules according to function location. The expression of photoreceptor genes CRY1/2, phytochrome (Phy)A1 and PhyB1/2 did not change significantly. The two genes in the “Ligands” module, namely, STOMAGEN and EPF2, were significantly up regulated. ERL1/2 in the “Receptor” module were significantly down regulated. MPK3 was significantly down regulated in “MAPK cascades”. Only HY5 showed significant up regulation changes in the “COP1–PIF4–HY5” module in the core of the network. Among the three genes on the molecular switch, only FAMA was decreased significantly. The expression of POLAR and PAN1 in the “Polarity and Asymmetry” modules were significantly lower, whereas the expression of PAN2 was remarkably higher. The expression levels of Cyclin-A2 and CDKB1-1 in the “Cell Division” module was significantly up regulated. The expressions of other genes were not remarkably as shown in Fig. 11. The quantitative expression of the eight potential genes and their protein network locations are shown in the Fig. 11. Among them, the expression of six genes increased, that of one gene decreased, and the expression of one gene was too low to be detected.