Distribution and synteny analysis of NPF genes in four Brassica species
Based on BLASTP using 53 Arabidopsis NPF protein and phylogenetic analysis (Additional file 1: Figure S1), a total of 169 NPF genes encoding 186 proteins were identified in B. napus genome. To investigate the evolution of BnaNPF genes, the synteny of NPF gene pairs between B. rapa and Arabidopsis genome, B. oleracea and Arabidopsis genome, B. napus and B. rapa genome, B. napus and B. oleracea was performed to further understand the expansion mechanism of NPF genes in B. napus (Fig. 1). The result showed that most of the BnaNPF genes exhibited evolutionary and syntenic relationships with NPF genes in Arabidopsis, B. rapa, and B. olereaca (Additional file 2: Figure S2), suggesting the contribution to the evolution of BnaNPF gene family. Furthermore, Ka, Ks and Ka/Ks of orthologous pairs on BnaNPF and AtNPF genes were calculated to test the evolutionary selection pressure (Additional file 6: Table S2). The majority of orthologous BnaNPF gene pairs had Ka/Ks < 1, which suggested that most of BnaNPF genes have undergone purifying selection to preserve gene function. The mean value of NPF3 (Ka/Ks = 0.10), NPF6 (Ka/Ks = 0.11) and NPF7 (Ka/Ks = 0.13) gene pairs was lower than other subfamilies, showing that these three subfamilies may have suffered robust purifying selective pressure during evolution. However, some of BnaNPF genes had Ka/Ks > 1, including BnaA01NPF2.8, BnaC01NPF2.9, BnaA06NPF2.10, BnaC03NPF2.12 and BnaC01NPF2.25 in NPF2 subfamily, BnaA09NPF4.15 in NPF4 subfamily, BnaA05NPF5.1, BnaC04NPF5.3, BnaC03NPF5.7, BnaA03NPF5.8, BnaA02NPF5.15, BnaA02NPF5.40, BnaC02NPF5.41 and BnaC06NPF5.42 in NPF5 subfamily, and BnaC09NPF8.19 in NPF8 subfamily, suggesting that these BnaNPF genes are subjected positive selection during the evolution from Arabidopsis to rapeseed.
The distribution and synteny of NPF genes were marked on the chromosomes of B. rapa, B. oleracea and B. napus (Fig. 1b). NPF genes are unevenly distributed on every chromosome, and often organized as clusters in the genome of B. rapa, B. oleracea and B. napus. In B. napus genome, the chromosomes A09 and C06 possess the most BnaNPF genes (15 respectively), and A08 possess only 4 BnaNPF genes which were clustered on the chromosome terminal. NPF genes distributed on B. rapa and B. oleracea genome keep good collinearity with NPF genes on A and C sub-genome of B. napus, respectively. B. rapa genome contains 82 NPF genes, and the corresponding A sub-genome of B. napus contains only 76 NPF genes; B. oleracea genome contains 70 NPF genes, and the corresponding C sub-genome of B. napus contains 93 NPF genes, which indicates parts of NPF genes from B. rapa genome were lost or recombined to the C genome of B. napus in evolution process. For example, BraNPF5.21 on the terminal of the chromosome BraA05 was replicated and recombined to BnaC05 chromosome (BnaC05NPF5.37 and BnaC05NPF5.38). According to the synteny analysis, 97 BnaNPF genes evolved from B. rapa genome, and 72 BnaNPF genes from B. oleracea genome. Furthermore, 73 BraNPF genes retained synteny with NPF genes in B. napus genome, including 55 BraNPF genes with 1:1 synteny relationship, and 16 BraNPF genes with a 1:2 relationship (duplication in B. napus genome) and even two BraNPF genes with more 1:2 relationship (1:3 and 1:5) (Table 1 and Additional file 7: Table S3). Nine BraNPF orthologs were not identified in B. napus genome (1:0 relationship) and two BnaNPF orthologs were not identified in B. rapa genome (0:1 relationship), which suggesting loss of the gene during evolutionary. Sixty-one BolNPF genes retained synteny with NPF genes in B. napus genome, including 54 with a 1:1 relationship and 7 with a 1:2 relationship. Twenty-six and five translocations were identified for NPF genes when comparing B. napus genome with B. rapa and B. oleracea genome, respectively. Besides, because the genomic data of B. napus has not yet been fully mapped to the chromosome, the chromosomal location and evolution of three BnaNPF genes (BnaNPF2.26, BnaNPF2.29 and BnaNPF2.30) is still unclear.
Table 1. The synteny relationship of NPF genes between B. rapa and B. napus, and between B. oleracea and B. napus
Ratioa
|
0:1
|
1:0
|
1:1
|
1:2
|
1:3
|
1:5
|
B. rape
|
2
|
9
|
55
|
16
|
1
|
1
|
B.oleracea
|
4
|
9
|
54
|
7
|
|
|
a Orthologous NPF gene ratio by comparing B. rapa and B. oleracea with B. napus genome. 0:1 represents NPF orthologs lost in B. rapa or B. oleracea genome, 1:0 represent lost in B. napus genome, 1:2, 1:3 and 1:5 represent different replication multiples in B. napus genome.
B. napus genome possessed the most NPF genes
Using the sequences of 53 Arabidopsis NPF family protein as queries to perform BLASTp and the information from the article Leran et al (2014) reported [1], NPF proteins from sequenced 36 species were retrieved including B. rapa, B. oleracea and B. napus. The information of genome size and number of NPF genes were shown in Table 2. The genome sizes of these 34 plant species ranged from 127.42 Mb (Arabidopsis) to 2,271.03 Mb (Zea mays), and the NPF gene number was varied from 23 (Physcomitrella patens) to 169 (B. napus). The B. napus genome possessed the most NPF genes (167) through its genome size smaller than that of Malus domestica and Zea mays, which indicated the copy number variations of B. napusNPF genes might be attributed to their requirement for (un)specific substrates as a result of evolutionary selection, such as some NPF2 members for transporting glucosinolates [8]. All NPF genes were grouped into eight clades with known 53 NPF members from Arabidopsis. Most plants have more NPF2, NPF4 and NPF5 subfamily members. NPF1 and NPF2 subfamilies are absent from the two lower plants Physcomitrella patens and Selaginella moellendorffii. In addition, based on BnPIR database that provides more detailed annotation for B. napus genes, 11 BnaNFP genes were identified to encode two proteins derived from two different transcripts, and three BnaNFP genes encode three proteins translated from three different transcripts. Therefore, a total of 186 BnaNPF proteins were identified in B. napus including 17 proteins from different transcripts. Based on the phylogenetic tree (Additional file 1: Figure S1), the evolutionary relationship of NPF proteins between B. napus and Arabidopsis was easy to compare and provided a good guide for studying the function of NPF genes in B. napus. According to known ArabidopsisNPF proteins subfamily information and phylogenetic tree branch, eight unambiguous clades that represented eight B. napusNPF subfamilies were identified. The BnaNPF5 subfamily was the largest because of a larger number of Arabidopsis NPF5 members and possessed 63 members (more than a third of the total number of BnaNPF genes), followed by NPF2 (30), NPF8 (19), NPF4 (16), NPF6 (15), NPF7 (10), NPF1 (10), and NPF3 (6). Additionally, BnaA05NPF5.1 and BnaC04NPF5.2, located in the same branch with AtNPF5.1, were grouped into NPF2 clade, which suggested that the two NPF genes might be more closely related to B. napus NPF2 in evolution. Similarly, BnaA02NPF6.14 and BnaC02NPF6.15 seemed to be more closely related to NPF7. Most of the phylogenetic branches within the same clade showed high bootstrap value (> 0.80), which reflected the low genetic differentiation of Arabidopsis and B. napus NPF genes within the subfamily.
Table 2. Copy number variations (CNVs) of the NPF genes in 36 plant species
Organism Name
|
NPF1
|
NPF2
|
NPF3
|
NPF4
|
NPF5
|
NPF6
|
NPF7
|
NPF8
|
Total
|
Genome Size (Mb)
|
Arabidopsis lyrata (D)
|
3
|
14
|
1
|
9
|
17
|
4
|
3
|
5
|
56
|
202.97
|
Arabidopsisthaliana (D)
|
3
|
14
|
1
|
7
|
16
|
4
|
3
|
5
|
53
|
127.42
|
Aquilaria agallochum (D)
|
6
|
7
|
3
|
12
|
13
|
5
|
3
|
6
|
55
|
726.71
|
Brachypodium distachyon (M)
|
2
|
6
|
4
|
13
|
21
|
8
|
11
|
17
|
82
|
271.3
|
Brassica rapa (D)
|
4
|
23
|
3
|
9
|
23
|
7
|
5
|
8
|
82
|
401.93
|
Brassica oleracea (D)
|
4
|
15
|
2
|
8
|
26
|
6
|
5
|
4
|
70
|
554.98
|
Brassica napus (D)
|
10
|
30
|
6
|
16
|
63
|
15
|
10
|
19
|
169
|
976.19
|
Carica papaya (D)
|
4
|
14
|
3
|
8
|
12
|
8
|
6
|
4
|
59
|
370.42
|
Capsella rubella (D)
|
3
|
12
|
1
|
6
|
17
|
4
|
3
|
5
|
51
|
133.06
|
Citrus clementina (D)
|
9
|
7
|
3
|
9
|
17
|
6
|
4
|
4
|
59
|
301.37
|
Citrus sinensis (D)
|
8
|
7
|
3
|
10
|
17
|
6
|
4
|
4
|
59
|
319.23
|
Cuscuta campestris (D)
|
4
|
9
|
3
|
8
|
19
|
6
|
5
|
5
|
59
|
476.79
|
Eucalyptus grandis (D)
|
6
|
12
|
4
|
11
|
19
|
6
|
4
|
6
|
68
|
691.43
|
Fragaria vesca (D)
|
0
|
13
|
2
|
8
|
23
|
3
|
5
|
6
|
60
|
214.37
|
Glycine max (D)
|
13
|
14
|
6
|
22
|
41
|
11
|
14
|
13
|
134
|
927.71
|
Gossypium raimondii (D)
|
7
|
10
|
4
|
14
|
14
|
11
|
7
|
8
|
75
|
773.77
|
Linum usitatissimum (D)
|
12
|
7
|
4
|
14
|
25
|
9
|
11
|
10
|
92
|
316.17
|
Malus domestica (D)
|
2
|
34
|
4
|
21
|
44
|
17
|
8
|
9
|
139
|
1,874.77
|
Manihot esculenta (D)
|
7
|
12
|
6
|
10
|
23
|
7
|
5
|
5
|
75
|
292.1
|
Medicago truncatula (D)
|
8
|
12
|
3
|
14
|
25
|
8
|
9
|
1
|
80
|
412.92
|
Oryza sativa (M)
|
3
|
6
|
5
|
12
|
29
|
6
|
11
|
21
|
93
|
389.75
|
Phaseolus vulgaris (D)
|
8
|
11
|
3
|
12
|
22
|
5
|
7
|
6
|
74
|
521.08
|
Populus trichocarpa (D)
|
15
|
9
|
5
|
12
|
26
|
6
|
5
|
7
|
85
|
434.29
|
Prunus persica (D)
|
2
|
15
|
1
|
8
|
16
|
5
|
5
|
5
|
57
|
214.22
|
Ricinus communis (D)
|
5
|
20
|
3
|
7
|
13
|
5
|
4
|
3
|
60
|
350.62
|
Setaria italica (M)
|
4
|
11
|
8
|
16
|
19
|
7
|
12
|
21
|
98
|
405.87
|
Solanum tuberosum (D)
|
17
|
10
|
2
|
15
|
8
|
9
|
4
|
8
|
73
|
772.25
|
Solanum lycopersicum (D)
|
19
|
16
|
2
|
12
|
11
|
12
|
7
|
11
|
90
|
760.07
|
Sorghum bicolor (M)
|
4
|
8
|
7
|
16
|
22
|
6
|
9
|
19
|
91
|
709.35
|
Theobroma cacao (D)
|
4
|
14
|
3
|
10
|
19
|
7
|
4
|
5
|
66
|
345.99
|
Vitis vinifera (D)
|
4
|
7
|
2
|
6
|
21
|
5
|
4
|
3
|
52
|
486.2
|
Zea mays (M)
|
4
|
4
|
6
|
12
|
17
|
8
|
12
|
16
|
79
|
2,271.03
|
Amborella trichopoda (D)
|
1
|
5
|
2
|
7
|
15
|
4
|
3
|
7
|
45
|
706.60
|
Physcomitrella patens (L)
|
0
|
0
|
1
|
1
|
8
|
6
|
3
|
4
|
23
|
472.081
|
Selaginella moellendorffii (L)
|
0
|
0
|
4
|
4
|
11
|
6
|
5
|
16
|
46
|
212.315
|
Selaginella moellendorffii (L)
|
0
|
0
|
4
|
4
|
11
|
6
|
5
|
16
|
46
|
212.315
|
D dicots, M monocots, L lower plants
BnaNPF gene owning PTR2 functional domain and might be regulated by multiple phytohormones
The gene structures (number and organization exon-intron) are typical evolutionary imprints within certain gene families and are closely related to their function. The exon/intron arrangements of 169 BnaNPF genes were analyzed together with 53 AtNPF by comparing CDS and the corresponding genomic DNA sequences within and between subgroups based on the phylogenetic tree (Additional file 3: Figure S3). The BnaNPF genes have a higher degree of divergence among gene structure than NPF genes in Arabidopsis and contained the numbers of exons varying from 2 to 18. BnaC02NPF1.8 and BnaC09NPF1.9 in NPF1 subfamily, BnaC05NPF2.6, BnaA06NPF2.7 and BnaA06NPF6.8 in the BnaNPF2 subfamily were significantly longer than other genes and contained the most exons (16, 16, 18, 18 and 8, respectively), however, most of the BnaNPF genes contained no more than 6 exons. BnaNPF genes in different branches exhibited different gene structural features, while the genes in the same branch generally had similar intron/exon distribution patterns. For instance, BnaA05NPF1.4, BnaC05NPF1.5 and BnaA06NPF1.7 in BnaNPF1 subgroup, BnaC05NPF5.56, BnaA09NPF5.57, BnaA07NPF5.58 and BnaC07NPF5.59 in BnaNPF5 subgroup, and BnaC07NPF7.3, BnaA03NPF7.4, BnaC01NPF7.5 and BnaA01NPF7.6 in BnaNPF7 subgroup had almost the same exon/intron distribution characteristics within subgroup, and different distribution patterns between subgroups. To further explore the specific and conserved regions of 186 BnaNPF proteins, four conserved domains (PTR2, MFS_1, Chorismate_bind and PDDEXK_6) were identified by the HMMER website (Additional file 3: Figure S3). PTR2 domain, responsible for proton-dependent transport, is the signature domain of NPF protein and could be found in each BnaNPF member, which suggesting functional conservation. Major facilitator superfamily MFS_1 domain feature was detected to partially overlaps or within the PTR2 domain in some BnaNPF members (45/186). The chorismite_bind domain involved in chorismate-utilizing was found in BnaC05NPF2.6 and BnaA06NPF2.7, and BnaC03NPF4.4 contained an unknown function PDDEXK_6 domain.
Transcription factors bind to CREs in the promoter and regulate the expression of the target genes (Wittkopp and Kalay 2011). Generally, genes with similar CREs show the same expression patterns. The 2.0-kb upstream regulatory regions of the BnaNPF genes were used to explore the CREs (Fig. 2 and Additional file 8: Table S4). The result showed that 157 BnaNPF genes contained at least one type of CREs in the promoter regions, which indicated that complex transcriptional regulatory might be implicated for BnaNPF genes. Apart from the common CREs, such as the CAAT-box, TATA-box and some light-responsive elements (G-box, Box 4, GT1-motif and TCT-motif), some phytohormone responsive elements, such as the auxin-responsive elements (TGA-element, AuxRR-core, GATA-box, TGA-box and AuxRE), the ABA-responsive element (ABRE) and the JA-responsive elements (CGTCA-motif and TGACG-motif), and some abiotic stress-responsive elements, such as the low-temperature responsive element (LTR), the salicylic acid responsive element (TCA-element) and the anaerobic responsive element (ARE) were identified. Some over-presented CREs, including ARE, ABRE, CGTCA-motif, TGACG-motif, LTR and TC-rich repeats, were involved in the molecular response of plants to phytohormone, defense and stress responsiveness (Fig. 2a). Among these, the MYB recognition site was most enriched, implying that the MYB transcript factors may play crucial roles in the transcriptional regulation of the BnaNPF genes. Besides, RY-element, the CRE involved in seed-specific regulation, was identified in the promoters of the 15 BnaNPF genes, which indicated that these BnaNPF genes might function in the process of seed development and matter storage.
Gene expression pattern analysis of NPF genes in diverse tissues of B. napus
In order to explore the potential tissues in which NPF genes function in B. napus, the expression profiles were characterized in 90 different organs or tissues, including cotyledon, root, vegetative rosette, stem peel (peel of upper, middle and lower stem), leaf (23 parts or periods), sepal, petal, filament, pollen, bud, silique wall (30 development periods) and seed (24 development periods) based on transcriptome information from BnTIR (http://yanglab.hzau.edu.cn/BnTIR/eFP). Except for half of the genes in the BnaNPF2, BnaNPF5 and BnaNPF8 subfamily that has relatively low expression values (FPKM <1) or no expression, most of the BnaNPF genes had preferential expression profiles in the 90 tissues (Fig. 3). For instance, half of BnaNPF1 genes showed high expression levels in the silique wall at the early and middle development stages and in leaves of all parts; One-third of BnaNPF2 genes (10/30) showed specific expression in the seeds at early and middle development stages; Most of BnaNPF7 genes (8/10) were highly expressed in the bud, petal, pollen and seeds. In general, expression patterns were conserved in each clade within a subfamily, but were quite different across different subfamilies, suggesting the expression differentiation trend of this gene family. For instance, expression patterns of BnaNPF2 and BnaNPF4 subfamilies were classified into three conserved patterns that were consistent with the three major clades in these two subfamilies, and while the expression profile of the BnaNPF3 genes was similar in this subfamily.
Based on the expression profiles in seeds, silique wall and leaves from multiple development periods or plant parts, the expression patterns of BnaNPF genes in leaves, silique wall and seeds could be clarified clearly (Fig. 4). Although some members of both BnaNPF1 (4/10) and BnaNPF2 (5/30) were highly expressed in the silique wall of the developing silique, BnaNPF1 genes showed higher expression levels at the middle development stages, and BnaNPF2 genes were higher expressed at the later development stages. The members of BnaNPF3 with high expression levels in the silique wall, BnaA07NPF3.1 and BnaC06NPF3.2, were higher expressed at the early than later development stage of the silique. However, some members of BnaNPF4 and BnaNPF5, such as BnaA06NPF4.10, BnaC03NPF4.11, BnaC04NPF5.3, BnaA04NPF5.4, BnaA08NPF5.54 and BnaC08NPF5.55 were higher expressed in silique wall at the later than early development stages of the silique. BnaC09NPF5.9 and BnaA09NPF5.10 were found preferential high expression in aged leaves and silique walls and nearly mature seeds. BnaC01NPF7.5 and BnaA01NPF7.6 showed higher expression at later development stages of seed, and BnaC07NPF8.7, BnaA06NPF8.8 and BnaC09NPF8.9 were preferential higher expressed in aged leaves and silique wall.
Expression dynamic of NPF genes during the growth of B. napus under vernalization
There are differences in nutrition utilization and phytohormone distribution at different stages of plant growth. In order to explore the function and expression variation of BnaNPF genes, the expression trend of BnaNPF genes in leaves was analyzed during the growth of B. napus based on ZS11 transcription data from online data resources BnPIR (http://cbi.hzau.edu.cn/bnapus/). Although the expression levels were quite different, members of the same subfamily usually have the same expression trend in leaves during the growth of B. napus (Fig. 5). The members in subfamily NPF1, NPF4, NPF5 and NPF7 seem to be the same expression trend, decline at beginning of vernalization or in the early stage of vernalization and rise after vernalization. For example, BnaC06NPF1.6, as an ortholog of AtNPF1.2 that was able to transport GA and JA, has the exactly same expression trend and high expression level with BnaC04NPF5.3 (homologous with AtNPF5.1 that was able to transport GA, JA and ABA), which indicated that they might play important roles in phytohormone transport for a developmental phase transition. Some other members in NPF2, NPF3 and NPF6 shared this similar expression trend, the expression level raise during vernalization and declined after vernalization. In typical cases, the expression level of BnaC02NPF2.6, BnaC06NPF3.2 and BnaC05NPF6.10 are dramatically raised from T1 to T2, and then begin to decline, which indicated that these members played an important role in the development stage during vernalization. Many BnaNPF genes showed diverse expression levels in the leaves of different cultivars at certain development stages (Additional file 4: Figure S4). For example, BnaA05NPF1.4 and BnaC05NPF1.5 have no expression or lower expression levels in Shengli than other cultivars at T3 and T4 stages. At the T2 stage, BnaC02NPF2.16 showed obviously a higher expression level in cultivars Quinta, Shengli and Tapidor than others. BnaA06NPF8.8 has almost no expression during the whole development process in the three cultivars Shengli, Tapidor and Westar in comparison to other cultivars. These expression variations might lead to differences in nitrogen utilization efficiency, peptide transport and polar transport of phytohormone among the cultivars.
Transcriptional analysis of BnaNPF genes under nitrate deficiency
Nitrate is the main substrate that NPF proteins transport and more than one-third NPF members have been reported to have nitrate transport function in Arabidopsis [8]. Here, we analyzed the expression changes of BnaNPF genes under the condition of nitrogen suitability and deficiency. A total of 20 BnaNPF genes were detected to have relatively high expression and showed significant expression changes in shoot and/or root (Fig. 6). Among them, six BnaNPF genes (BnaC06NPF4.16, BnaC04NPF6.1, BnaA07NPF6.2, BnaA08NPF6.6, BnaC05NPF7.7 and BnaA09NPF7.8) were expressed at a high level in both shoot and root, and the expression levels were significantly elevated in both shoot and root after treated with low nitrogen. Ten BnaNPF genes were specifically expressed in root, of which seven (BnaA06NPF2.7, BnaC06NPF2.20, BnaC08NPF6.4, BnaA09NPF6.5, BnaA06NPF6.8, BnaC05NPF6.9 and BnaA05NPF7.10) were induced to highly express after low nitrogen treatment, which suggested they have a positive function for nitrogen absorption by roots. However, the expression of the other three BnaNPF genes (BnaC06NPF4.8, BnaC09NPF4.14 and BnaC07NPF7.3) that specifically expressed in roots were declined under low nitrogen treatment. In addition, four BnaNPF genes that were specifically expressed in shoots also showed different expression changes under low nitrogen treatment: BnaC02NPF2.16 and BnaA06NPF4.10 were upregulated, and the other two (BnaC06NPF3.2 and BnaC07NPF6.3) were declined after treated by low nitrogen.