Identification of PIN genes in peanut
A total of 27 AhPIN genes were identified in the peanut genome, and were designated as AhPIN1 to AhPIN27 according to their chromosomal location (Table 1). As shown in Table 1, the length of AhPIN proteins ranged from 73 (AhPIN10) to 661 (AhPIN5) amino acids (aa) with an average of 377.52 aa, and corresponding MWs from 8.44 to 91.56 KDa. The pI values varied from 6.55 (AhPIN19) to 11.32 (AhPIN11). The GRAVY values of AhPIN proteins ranged from − 0.47 to 1.48, 23 of which were positive GRAVY values, indicating that these proteins shared the characteristic of hydrophobicity. The prediction results of AhPIN subcellar location showed that 81.48% of AhPINs were found in the plasma membrane.
Membrane localization of AhPINs
To detect the subcellular localization of PIN family genes in peanut, we constructed a PIN-GFP expression vector using 1300-GFP (Fig. 1a) as the fundamental skeleton. Through an Agrobacterium-mediated transient expression system, PIN-GFP was injected into tobacco leaves that were 2–3 weeks old. Compared with 1300-GFP, all PIN-GFPs were expressed in the plasma membrane (Fig. 1b), consistent with the results predicted by the software, indicating that most PIN family genes of peanut might have a regulatory role in membrane transport.
Sequence alignment, gene structure, and phylogenetic analysis of AhPINs
To further assess the evolutionary relationships of AhPIN genes, an unrooted phylogenetic tree was constructed based on an alignment of 26 AhPIN, eight AtPIN, and 12 OsPIN full-length protein sequences (Fig. 2, Supplemental Table 1). Given the criteria for wheat PIN classifications [22], the AhPIN members were categorized into four subfamilies: I, II, III, and IV. The sequence of AhPIN10 was too short to use in phylogenetic tree construction, so it was not classified into a subfamily. Subfamily I possessed 11 AhPINs and subfamily IV contained two AhPINs, which were the largest and the smallest group, respectively.
The exon-intron structure of subfamilies exhibited significant diversity (Fig. 3a and b). Exon numbers of these AhPINs ranged from 1 (AhPIN10) to 9 (AhPIN15) and the lengths of exons varied from 2 to 1,662 bp with an average of 281.47 bp. The length of introns varied from 47 to 30,482 bp with an average of 618.07 bp, displaying larger variation than exons. AhPINs clustered in the same group shared similar exon-intron structures, and exon numbers tended to be consistent within a group. Group III had complex gene structures, indicating that they may have versatile functions in growth and development. Domain analysis showed that all AhPINs had the membrane transport domain (PF03547) and three of them (AhPIN14, 23, and 24) had the PHD-zinc-finger like domain (Supplemental Table 2). These PINs belonged to group III and were in the same evolutionary branch, and presumably have similar functions (Fig. 3a).
MEME software was used to explore conserved motifs of AhPIN protein sequences, and ten conserved motifs were predicted and annotated (Fig. 3c and d). The most conserved motifs were 3 and 4, predicted in 21 AhPIN genes (Fig. 3c). Notably, some conserved motifs were identified in specific subgroups. For example, motif 7 was uniquely located in subfamily I and II, motif 1 was located in the N-terminal region, and motif 2 was exclusively distributed in the C-terminal region. AhPIN27, 13, and 5 contain all ten motifs, and AhPIN10 contains only motif 9. Overall, conserved motif arrangements and gene structure differed greatly in subfamilies I and III, and were relatively conserved in subfamilies II and IV.
Cis-regulatory elements analysis
As the region of the transcription factor binding site that initiates transcription, the promoter is essential in controlling the tissue-specific or stress responses of gene expression. To further infer the regulatory mechanisms and potential functions of AhPINs, the upstream 1.5 kb promoter region sequences from the initiation codon were extracted to explore cis-regulatory elements (Fig. 4 and Supplemental Table 3).
Four types of cis-regulatory elements (development/tissue specificity, light responsiveness, stress, and hormone response) were detected (Fig. 4a). The majority of AhPINs possessed hormone- or stress-response related cis-elements, except for AhPIN26. In total, 20 gibberellin (GARE-motif, P-box, TATC-box), 52 MeJA (CGTCA-motif, TGACG-motif), 29 ABA (ABRE), 12 auxin (TGA-element, AuxRR-core), 61 ethylene (ERE), and 15 salicylic acid (TCA-element) cis-elements were detected in all AhPINs (Fig. 4b and Supplemental Fig. 1). There were four abiotic stress cis-elements detected, including Wun-motif (wound-responsive element, 15 genes), ARE (cis-acting regulatory element essential for the anaerobic induction, 31 genes), and LTR (low-temperature responsiveness, ten genes).
Other cis-elements were also found, including 28 development/tissue specificity-related elements and 92 light-responsive elements (Supplemental Table 3). In total, 55.56% of the AhPINs contained development/tissue specificity-related elements, including AC-II, CCGTCC-box, dOCT, GCN4_motif, circadian, and O2-site. There were 14 types of light-responsive elements observed in the promoter regions of AhPIN family genes, with the largest being I-box, having 13 in total (Fig. 4 and Supplemental Table 3).
Chromosomal distribution and gene duplications
The chromosome location information of 27 AhPIN genes in peanut was obtained from PeanutBase (https://www.peanutbase.org/home). AhPIN genes were distributed on all chromosomes except 7, 8, 17, and 19 (Supplemental Fig. 2). Chromosomes 4, 11, 14, and 20 contain three genes, while chromosomes 3, 5, and 10 contain two genes and the others contain only one gene.
In plant genome evolution, gene duplications (tandem and segmental duplication) lead to gene family expansion. All the identified AhPIN gene members were used to ascertain gene duplication in the peanut genome. A total of 17 gene pairs consisting of 24 genes were identified as duplicated, which were associated with segmental duplications (Fig. 5a). Replication events mainly occurred on chromosomes except chromosomes 6, 7, 8, 17, and 19. As no tandem duplication was detected, segmental duplication events played a predominant role in the expansion of the PIN gene family. The Ka/Ks ratios for 17 AhPIN paralogous gene pairs (Supplemental Table 4) varied from 0.0046 to 1.2188 with an average of 0.2088, suggesting that the PIN gene family has mainly undergone strong purifying selection pressure during the gene evolution process with limited functional divergence occurring after segmental duplication. The divergence time of paralogous genes ranged from 7.42 to 4,821.29 million years ago (Mya) with an average of 1,515.12 Mya.
Synteny analysis for PINs in peanut and its relatives
To further examine the evolutionary relationships of AhPIN genes, synteny analysis was performed between peanut and other species, namely A. ipaensis, A. duranensis, and A. thaliana. We found 13, 11, and five orthologous pairs between AhPINs and other PIN genes in A. ipaensis, A. duranensis, and A. thaliana (Fig. 5b, c and d). Moreover, 20 AhPINs were mapped to A. ipaensis and A. duranensis chromosomes with one on Adchr1, three on Adchr3, three on Adchr4, one on Adchr5, one on Adchr9, two on Adchr10, one on Aichr1, one on Aichr2, one on Aichr3, four on Aichr4, one on Aichr5, one on Aichr6, one on Aichr8, and three on Aichr10.
To provide a basic framework of orthologous relationships of PIN genes among peanut and its relatives, the Ka/Ks was also calculated (Supplemental Table 5). A. ipaensis and A. duranensis, the progenitors of cultivated peanuts, had 21 of 24 orthologous pairs with Ka/Ks values less than 1, suggesting that these genes have experienced intense purifying selection during evolution. Interestingly, the Ka/Ks values of A. thaliana and peanut homologous genes were both greater than 1.
Expression profiling of peanut PIN genes in various tissues
Based on the peanut RNA-seq data from publicly available databases, the expression profiles of AhPINs in 22 tissues were investigated (accession numbers and sample information of RNA-seq data are listed in Supplemental Fig. 3). Supplemental Table 6 shows the FPKM values of all expressed AhPIN genes.
The results revealed that most of the 27 AhPINs genes had distinct tissue-specific expression patterns based on the log-2-transformed (FPKM + 1) (Fig. 6a); AhPIN9, AhPIN22, and AhPIN24 were expressed in all 22 tissues. AhPIN5 and AhPIN19 had extremely high levels in leaf, especially AhPIN5, whose FPKM value in the three leaf tissues (seedling leaf ten days post-emergence, main stem leaf, and lateral stem leaf) were all greater than 18. AhPIN5 and AhPIN19 also exhibited significantly high expression levels in flower and pistils. Six AhPIN genes (AhPIN5, 9, 13, 19, 22, and 27) were expressed at a high level with FPKM > 5 in shoot, and AhPIN22 had the highest expression in reproductive shoot. AhPIN5, 9, 13, 19, 22, and 27 showed the highest expression in stamen. AhPIN5, 9, 19, and 22 showed the highest expression in tip.
Furthermore, we used qRT-PCR to detect the expression levels of nine AhPIN genes in roots, hypocotyls, stems, leaves, and pegs at different periods (Fig. 6b and c). Eight AhPIN genes were highly expressed in stems and leaves, except AhPIN22 (Ah72M57I.1), which was only highly expressed in stems. On the second day after penetration into the soil, AhPIN6 (AhK2GLG.1) and AhPIN22 (Ah72M57I.1) were highly expressed, while AhPIN24 (AhI2RKDG.1) and AhPIN13 (Ah66N4V9.1) were highly expressed on the sixth day after penetration. There was no significant difference in the expression of other AhPIN genes. The results indicated that AhPINs may function in the growth of peanut seedlings and the development of peanut gynophores.
Protein interaction analysis of AhPIN genes
To investigate protein-protein interactions between AhPINs and AtPINs, the STRING database was used to construct a protein interaction network (Fig. 7a, Supplemental Table 7). The network was formed by 29 nodes and 129 protein interactions, including nine AhPINs and 20 other genes. Among them, eight genes (AhPIN1, 6, 9, 10, 11, 12, 15, and 27) can interact with ABCB19 and AUX1. AhPIN24 exhibited an interactive relationship with HTA8 (Probable histone H2A variant 2), HTA9 (Probable histone H2A variant 3), PIE1 (SNF2 domain-containing protein/helicase domain-containing protein), TRO (Set1/Ash2 histone methyltransferase complex subunit ASH2), and AT3G21060 (Transducin/WD40 repeat-like superfamily protein).
Gene ontology enrichment
GO enrichment analyses was performed to understand the function of AhPIN genes using AgriGO (Fig. 7b, Supplemental Table 8). The results showed eight GO terms, divided into biological process and cellular component. AhPIN genes were enriched in transmembrane transport (GO:0055085), transport (GO:0006810), localization (GO:0051179), and establishment of localization (GO:0051234). The cellular component category showed similar results with subcellular localization signifying that AhPINs were enriched in the membrane (GO:0005575).