Genome-wide identification of PIN proteins in L. chinense
To identify L. chinense PIN genes, we used their HMMER and pfam number (PF03547) to search for PIN protein sequences in the L. chinense protein database. A local BLASTP algorithm was used with each of the eight AtPIN genes as queries. Then, the conserved domain of each candidate gene was predicted using the SMART database. We identified 11 LcPIN genes, with two pair of protein sequences (Lchi22082/Lchi33830 and Lchi23130/23125) having a similarity of 100%. Basic gene information, such as gene number, gene location, isoelectric point (pIs) and molecular weight (MW) for the L. chinense PIN proteins is listed in Table 1. The identified LcPIN encoded proteins range from 233 (Lchi23130/23125) to 662 (Lchi17800) amino acids in length, with pIs varying from 6.37 (Lchi15751) to 9.49 (Lchi05137) and MWs varying from 25.04kD (Lchi23130/23125) to 71.7kD (Lchi17800). The 11 LcPIN genes are distributed over 6 chromosomes. Chromosomes 5 and 11 each contain three LcPIN genes, while chromosomes 2, 3, 6, 7 and 17 each contain a single LcPIN gene (Fig 2A).
Table1: Summary of PIN gene family numbers in L. chinense
|
Gene Name
|
Gene ID
|
Locus
|
Num. of Amino Acids
|
MW(kDa)
|
PI
|
LcPIN1a
|
Lchi15751
|
Chr5
|
608
|
66.45
|
6.37
|
LcPIN1b
|
Lchi16330
|
Chr7
|
623
|
67.96
|
9.18
|
LcPIN1c
|
Lchi17800
|
Chr5
|
662
|
71.71
|
8.72
|
LcPIN2
|
Lchi05137
|
Chr17
|
616
|
66.77
|
9.49
|
LcPIN3
|
Lchi05662
|
Chr6
|
638
|
69.41
|
7.69
|
LcPIN5a-1
|
Lchi22082
|
Chr11
|
358
|
38.87
|
7.64
|
LcPIN5a-2
|
Lchi33830
|
Chr3
|
358
|
38.87
|
7.64
|
LcPIN6a
|
Lchi02600
|
Chr5
|
412
|
45.59
|
8.75
|
LcPIN6b
|
Lchi01817
|
Chr2
|
450
|
48.66
|
6.39
|
LcPIN8a-1
|
Lchi23125
|
Chr11
|
233
|
25.04
|
8.55
|
LcPIN8a-2
|
Lchi23130
|
Chr11
|
233
|
25.04
|
8.55
|
Phylogenetic analysis of the LcPIN gene family
To understand the phylogenetic position of the LcPIN gene family with respect to PIN genes from different plant species, we constructed an evolutionary relationship tree using the PIN protein sequences of L. chinense (Lc), A. trichopoda(Atr), A. thaliana (At), Z. mays (Zm), O. sativa (Os). V. vinifera (Vv) and S. bicolor (Sb), using MEGA7.0 with the neighbor-joining algorithm (Fig. 1 & Additional file 4). The L. chinense PIN family genes are divided into long (PIN1, PIN2, PIN3) and short (PIN5, PIN6, PIN8) PINs (Table 2). We found that PIN genes can be divided into 8 subfamilies: PIN1, PIN2, PIN3/PIN4/PIN7, PIN10, PIN6, PIN8, PIN5 and PIN9 (Fig 1). The PIN1 subgroup has extensively expanded in our selected species (including L. chinense, yet excluding A. thaliana), suggesting that it may play an important role in the growth and development of each of these different plant species. In Arabidopsis thaliana, the PIN3/4/7 family underwent an extensive differentiation in comparison to other plant species, indicating that AtPIN3/4/7 may have undergone functional specification in this species. The PIN6 group could not be identified in monocotyledons, consistent with previous studies [25]. The PIN3 and PIN10 subfamilies are exclusive to dicots and monocots respectively and may have evolved independently in these lineages, based on previous studies [10, 25]. Since previous studies found that magnoliaceae emerged before the divergence of monocotyledons and dicotyledons, this suggests that in monocotyledons the PIN3 and PIN6 families were lost, while the PIN10 family evolved independently. The PIN gene subfamilies in L. chinense most closely resemble those of A. trichopoda and A. thaliana.
Table2: Number of PIN genes in seven species
Species
|
Long PIN
|
PIN10
|
PIN9
|
Short PIN
|
Total
|
PIN1
|
PIN2
|
PIN3/4/7
|
PIN5
|
PIN6
|
PIN8
|
A. thaliana
|
1
|
1
|
3
|
0
|
0
|
1
|
1
|
1
|
8
|
A. trichopoda
|
2
|
1
|
1
|
0
|
0
|
1
|
1
|
3
|
9
|
O. sativa
|
3
|
1
|
0
|
2
|
1
|
0
|
4
|
1
|
12
|
S. bicolor
|
3
|
1
|
0
|
2
|
1
|
0
|
2
|
0
|
9
|
V .vinifera
|
2
|
1
|
0
|
0
|
0
|
2
|
2
|
1
|
8
|
Z. mayz
|
4
|
0
|
0
|
2
|
1
|
0
|
4
|
1
|
12
|
L. chinense
|
3
|
1
|
1
|
0
|
0
|
2
|
2
|
2
|
11
|
LcPIN protein gene structure and transmembrane topology
To better understand L. chinense PIN gene structure diversity and transmembrane topology, we constructed a phylogenetic tree using the PIN gene sequences. Gene structure patterns are highly conserved in LcPIN genes, with each gene containing 3-5 introns (Fig 2). The difference in gene size between the largest gene LcPIN1a and the smallest gene LcPIN6a is mainly due to intron length. Therefore, it’s possible that the diversification of exons/introns played an important role in the evolution of this gene family, but the exact mechanism is unclear.
Their predicted transmembrane topology showed that, the number of hydrophobic loops in LcPIN proteins has a high degree of variance. Excluding LcPIN2 and LcPIN6b, LcPIN proteins have a typical conserved structure with two highly conserved hydrophobic loops at the N and C terminus and a central hydrophilic loop within each terminus (Fig S2).
LcPIN geneshave highly conserved motifs and evolutionary relationships within different species
To further explore the evolution of the LcPIN gene family, we obtained 156 PIN amino acid sequences from Phytozome, of 17 different plant species belonging to 14 plant families: Bryophylla (M. polymorpha, P. patens), Selaginella (S. moellendorffii), Amborellaceae (A. trichopoda), Lauraceae (C.kanehirae), Magnoliaceae (L. chinense), Gramineae (Z. mays, S. bicolor, O. sativa), Cruciferae (A. thaliana), Leguminosae (G.max), Malvaceae (G. raimondii), Vitaceae (V. vinifera), Euphorbiaceae (M. esculenta), Rosaceae (C. sinensis), Salicaceae (P. trichocarpa) and Solanaceae (S. lycopersicum), and used these to constructed a neighbor-joining phylogenetic tree (Fig S2). The gene name and number of each PIN gene per species is indicated in Fig S4. Most of the sequences from different species within a single subfamily clustered together to form an independent group.
We then performed motif analysis using MEME/MAST, which showed highly conserved sequences (referred to as “Motifs” numbered from 1 upwards, starting at the N-terminus) present at both the protein N- and C-termini (Fig S2). Motif1-8, 12 and 16 were found or partially found in conserved sequence regions including the two transmembrane regions. Comparisons of motif distributions revealed that the intermediately hydrophilic region of PIN proteins was variable across different subgroups. It could furthermore be derived that two different types of PIN proteins can be distinguished during PIN gene evolution. The first group of sequences contains a short hydrophilic loop in between two conserved transmembrane regions and can be considered a “Short PIN”. This group of PIN proteins is represented in Arabidopsis by PIN5 and PIN8 [30]. Our result shows that PIN6 and PIN9 belong to the “Short PINs” as well. A second group of PIN proteins contains a longer hydrophilic loop between the two transmembrane regions and can be considered a “Long PIN”. This type is represented in Arabidopsis by PIN1-PIN4 and PIN7 [30]. The PIN1, PIN2, PIN3/4/7, PIN10 and some PIN genes from Bryophylla and Gymnosperm belong to the “Long PIN” in our evolutionary tree. This suggests that the “Long PIN” could have independently differentiated in Angiosperms, Gymnosperms, and Bryophytes.
As summarized in LcPIN, In the “Short PIN”, the LcPIN5 only contain Motif1-8,12 and 16 withmainly distributed in the hydrophobic region at both ends of the protein. The LcPIN8 lacked motif 1,3,7,12 in the N-terminus and there is Motif11 in the middle hydrophilic loop. In the “Long PIN”, the Motif numbers of LcPIN range is 17~20 and they have highly conservative in the hydrophilic and hydrophobic. According to the number of motifs in the middle hydrophilic area of PIN6 was divided into two clades, Motif 9,10 of hydrophilic loop specifity appear in LcPIN6b. This branch number of motif was clost to “Long PIN”. LcPIN6a was close to the “Short PIN” for the Motifs were distributed only in hydrophobic regions. In the whole, the increase in the number of PIN6 motifs was more like a transition from “Short PIN” to “Long PIN”. Combing with the PIN6 in the Amborella trichopoda, we guessed the sequences of Motif 9/10 could be believed to the symbol of the appearance of a long PIN. Compared to the “Short PINs”, the“Long PINs” possess a complex hydrophilic loop, including almost all sequence motifs save for those found in the conserved region. However, additions or deletions to individual motifs between and within subgroups could be found in the middle hydrophilic loop.
We constructed an evolutionary tree by multiple sequence alignment (Fig S2) and found a close evolutionary relationship between all LcPINs and PIN proteins from A. trichopoda and C. kanehirae, indicating that it could be a more recent evolutionary relationship to these two subgroups. A separate analysis of each subgroup reveals an interesting phenomenon: the short PINs are divided before long PINs and the gramineae (grasses) PIN1/8 subgroups evolved into a separate branch before LcPIN1/8 emerged (Fig S2). This result suggests that the long PINs could play a specific functional moving in particular direction by the deepening of differentiation and the PIN1/8 subgroups could exist the difference of funcition in monocotyledons and dicotyledons. However, this specific conclusion still needs further analysis.
Functional site analysis within L. chinense PIN conserved motifs
Through previous experimental verification and data analysis, several functional elements and sites that control PIN protein polarity, trafficking and activity have been identified [3, 31, 32]. Our multiple sequence alignments show that these elements reside for a large part within the highly conserved LcPIN sequence motifs (Fig S2). For example, motif1 and motif5 contain two cysteine residues (C39 and C521) and occur in the transmembrane domain of all LcPIN proteins, excluding only LcPIN8 which only contains motif5 (All functional sites are labeled with LcPIN1a as a reference) (Fig 3 and Additional file 3). This motif has been implicated in regulating the endocytosis and distribution on the PM of “Long PINs” [33], while for “Short PINs” the functionality of these sites has not yet been verified.
The hydrophilic loop (HL) domain contains identifiable motifs as well, being motifs 4, 7, 9 and 15 (Additional file 2). Previous studies have shown that these motifs in the PIN protein HL contain sites involved in regulating PIN protein membrane abundance, as well as in maintaining their polar localization in the cell [34–37]. For example, the NPXXY element (Motif 4) near the C-terminus plays an important role in AtPIN1 localization (Fig. 3 & Additional file 3, Mravec et al., 2009).
Motif15 contains a conserved phenylalanine residue (F165) in all “Long PIN” protein sequences of L. chinense (Fig 3). This residue has previously been found to interact with μA- and μD-adaptins in vitro and is possibly involved in PIN1 trafficking and polar localization in A. thaliana [37].
Coordinated PIN-mediated auxin transport requires activation and polarity control via phosphorylation by protein kinases [38]. So far, the phosphorylation of PIN is known to be controlled by the following three protein families: AGC kinases, PROTEIN (MAP) KINASES (MPKs), and Ca2+/calmodulin-dependent protein kinase-related kinases (CRKs) [31]. We found a small number of phosphorylation sites associated with these kinases, such as S1~S3 (motif9) and T1~T3 (motif17, not all PINs have this motif), that are highly conserved (Additional file 3). These sites inside highly conserved TPRXS motifs and are phosphorylated by MPK4/6 kinases [32], suggesting possible coregulation. Part of the sites, such as T1/T2 (motif9) were different in individual PIN proteins of different species. For example, Thr is replaced by Gln in some gramineous plants. The protein kinases PID/WAGs and D6PK directly phosphorylate PINs mainly through three conserved serine sites: S1, S2 and S3, activating PIN-mediated auxin efflux and regulating PIN polarity (Barbosa et al., 2018; Hammouti et al., 2016; Huang et al.,2010).
Our multiple sequence alignment showed that these sites of significance are highly conserved in the motifs found in the “Long PIN” HL domain and all PIN transmembrane domains. We could only detect sequence divergence in individual angiosperm sequences. These results suggest that the conserved sites in multiple motifs may have a fundamental function in different species.
Organ-specific expression profile of the LcPIN family genes
We used fluorescence quantitative PCR to analyze the expression of LcPIN gene family members in different organs (stamen, pistil, petal, bark, bud, root, stem, leaf) of L. chinense. Since the LcPIN5a-1/5a-2 and LcPIN8a-1/8a-2 sequences are identical, it was difficult to design specific expression primers to distinguish them. Therefore, the expression level of these genes is the combined expression of two gene copies. We found that the LcPIN gene family members are expressed in almost all L. chinense tissues (Fig 4). However, the relative expression levels of LcPIN8a-1/8a-2 and LcPIN1b are comparatively low and absent from some tissues. LcPIN3 and LcPIN6a are highly expressed in all tissues and LcPIN1c is expressed in multiple tissues except the petal and leaf. The expression patterns of LcPIN1a and LcPIN5 show high similarity, with higher expression in the bud and leaf than in other tissues. LcPIN2 and LcPIN6b showed more specific expression patterns in a more limited number of tissues, with LcPIN2 mainly expressed in the root and stamen and LcPIN6b mainly in buds. Interestingly, we found that LcPIN3 and LcPIN6a show relatively high expression in stamens and petals. The upper part of the L. chinense petal is covered with green, and the color of middle and lower part becomes weak compared with the L. tulipifera. To further observe the changes of PIN genes in stamens and petals. We established the dynamic expression pattern in different periods of petals and stamens. In the stamens, LcPIN6a, LcPIN3, LcPIN1a and LcPIN1c was expressed (Additional file 4). The LcPIN3 had a higher level of expression than LcPIN1a/1c (Fig 5A). At the same time, its level of expression has been declining from LC-1 to LC-3 and increased in LC-4. Comparing with the LcPIN3, the changed of LcPIN1a and LcPIN1c was more stabilization (Fig 5A &Additional file 5). The LcPIN3 and LcPIN6a displayed a similar expression profiles in the same part of petals and LcPIN1a had a low expression except in the upper part of LC-4 stage of petal development (Fig 5A & Additional file 5). On the lower middle part of the petal in the LC-4 stage, the LcPIN3 and LcPIN6a expression levels rose sharply (Fig 5C &5D). Before this stage, their expression level was relatively stable in the middle of the petal. To summarize the above, the LcPIN3 could display certain influence on the development of stamens and the LcPIN3 and LcPIN6a play a role in the development of petal.
PIN genes dynamic expression patterns during the somatic embryogenesis in Liriodendron × sinoamericanum(154102)
As an auxin efflux protein, PIN plays an important role during the growth and development of many plant species [39]. Though the Liriodendron × sinoamericanum artificial hybrid (obtained from the Chinese and North American Liriodendron subspecies) acquired immature embryos induced embryogenic callus. Based on previously generated RNA-Seq data that was obtained from successive developmental stages during somatic embryogenesis, we constructed a heat-map showing the expression of all 11 LcPINs at each stage (Globular embryo, Heart-shaped embryo, Torpedo embryo, Immature cotyledon embryo, Cotyledon embryo and Plantlet). Barely detectable or no expression was observed for LcPIN5a-1/5a-2, LcPIN8a-1/8a-2 and LcPIN6a/6b (Fig 6), which indicates that they might not express during embryogenesis or are only activated under special conditions. The other LcPIN (Long PIN) gene expression patterns were divided into two groups. LcPIN2 and LcPIN1b expression was barely detectable at the globular embryo stage, after which expression levels rose until the immature cotyledon embryo stage. The LcPIN1a, LcPIN1c and LcPIN3 expression levels remained relatively constant throughout this process except at the plantlet stage. These findings suggested that the long LcPINs genes could play a key role at different stages of somatic embryogenesis, while Short PINs are barely expressed and less likely to be involved.