Identification and characterization of SUMOylation genes in Caixin
Whole-genome sequencing of Caixin was completed in 2023 [25]. Using known Arabidopsis SUMOylation component sequences as queries, we conducted a search for their respective orthologues in the Caixin genome via BLAST and domain confirmation methods. A total of 30 genes encoding the core components of SUMOylation were identified in this Brassica species (Table 1). Our list revealed that the Caixin genome contains nine SUMO genes, termed BrSUMO1 to BrSUMO9. To further understand the evolutionary relationships among these SUMO isoforms, we searched a number of other plant genomes for related sequences, including those of Arabidopsis, rice, soybean and maize (Fig. 1A). Phylogenetic analysis revealed that these SUMOs clustered into three groups: the “canonical SUMO”, “noncanonical SUMO” and “SUMO variant” groups. Members belonging to the “canonical SUMO” group were strongly conserved. Five members of the BrSUMO family (BrSUMO1 to BrSUMO5) are divided into this group and are evolutionarily most closely related to AtSUMO1. The “noncanonical SUMO” group, which contains four BrSUMOs (BrSUMO6 to BrSUMO9), shares low amino acid identity. No BrSUMOs belong to the “SUMO variant” group. The C-terminal di-Gly motif of SUMO is necessary for substrate conjugation. Alignment of SUMO sequences revealed that, with the exception of BrSUMO7, both canonical and noncanonical BrSUMOs have a C-terminal di-Gly motif, indicating their potential ability to covalently attach to target proteins. We also noted that the SUMO interaction motif in BrSUMO6 and BrSUMO7 is different from that in other BrSUMOs. Additionally, the other two noncanonical BrSUMOs (BrSUMO8 and BrSUMO9) did not contain the conserved Lys (Lysine) residue, which is required for the formation of SUMO chains (Fig. 1B).
Table 1
Characteristics of SUMOylation system genes in Caixin
Group | Gene | Sequence ID | Chromosome | Protein length (aa) | MW (Da) |
SUMO | BrSUMO1 | Bra_cxA08g015400 | A08 | 101 | 11124.43 |
BrSUMO2 | Bra_cxA03g012940 | A03 | 98 | 10844.17 |
BrSUMO3 | Bra_cxA01g030080 | A01 | 113 | 12733.35 |
BrSUMO4 | Bra_cxA10g021190 | A10 | 140 | 15970.09 |
BrSUMO5 | Bra_cxA02g036080 | A02 | 109 | 12399.09 |
BrSUMO6 | Bra_cxA03g050360 | A03 | 121 | 13471.4 |
BrSUMO7 | Bra_cxA05g032960 | A05 | 152 | 16996.64 |
BrSUMO8 | Bra_cxA09g065660 | A09 | 92 | 10332.47 |
BrSUMO9 | Bra_cxA06g009930 | A06 | 116 | 13374.26 |
E1 | BrSAE1 | Bra_cxA01g031910 | A01 | 322 | 35888.14 |
BrSAE2a | Bra_cxA04g017510 | A04 | 556 | 62585.22 |
BrSAE2b | Bra_cxA09g010380 | A09 | 552 | 62403.83 |
E2 | BrSCE1a | Bra_cxA04g035410 | A04 | 160 | 17851.38 |
BrSCE1b | Bra_cxA07g020470 | A07 | 160 | 17843.4 |
BrSCE1c | Bra_cxA04g037680 | A04 | 265 | 30304.82 |
E3 | BrSIZ1a | Bra_cxA02g039720 | A02 | 862 | 95055.24 |
BrSIZ1b | Bra_cxA10g015860 | A10 | 884 | 97190.92 |
BrSIZ1c | Bra_cxA06g025660 | A06 | 804 | 88269.61 |
BrMMS21 | Bra_cxA05g011250 | A05 | 160 | 17810.38 |
BrPIAL1a | Bra_cxA08g002610 | A08 | 986 | 108099 |
BrPIAL1b | Bra_cxA09g003500 | A09 | 636 | 70816.11 |
BrPIAL2a | Bra_cxA04g019330 | A04 | 723 | 79030.63 |
BrPIAL2b | Bra_cxA07g023060 | A07 | 640 | 69796.76 |
SUMO protease | BrULP1A | Bra_cxA01g003850 | A01 | 439 | 51090.7 |
BrULP1B | Bra_cxA03g019930 | A03 | 617 | 70756.78 |
BrULP1C | Bra_cxA06g038970 | A06 | 569 | 65650.8 |
BrESD4 | Bra_cxA08g024070 | A08 | 404 | 46397.07 |
BrFUG1 | Bra_cxA06g020780 | A06 | 223 | 26836.84 |
BrSPF1 | Bra_cxA08g003020 | A08 | 638 | 71562.39 |
BrSPF2 | Bra_cxA01g042650 | A01 | 792 | 89798.89 |
SUMOylation requires an initial activation step in which the C-terminal di-Gly of mature SUMO is first adenylated by a heterodimeric SUMO-activating enzyme (SAE1/2) [26]. In contrast to Arabidopsis, the Caixin genome encodes one small-subunit SAE1 gene (BrSAE1) and two large-subunit SAE2 genes (BrSAE2a and BrSAE2b). BrSAE1 encodes a protein of 322 amino acids that is conserved with AtSAE1a (90% identity) and AtSAE1b (82% identity). BrSAE2a and BrSAE2b both encode a protein of 625 amino acids that shares 87% and 93% identity with AtSAE2, respectively. Furthermore, functional domains were also identified in BrSAEs (Fig. S1A, B). Phylogenetic analysis of SAEs from various plant species revealed that BrSAE1 and BrSAE2a/b were evolutionarily most closely related to AtSAE1a/b and AtSAE2, respectively (Fig. S2A, B). We also predicted the 3D structures of SAE1/SAE2 heterodimers via SWISS-MODEL. The structures of both BrSAE1/BrSAE2a and BrSAE1/BrSAE2b were similar to those of AtSAE1a/AtSAE2 and AtSAE1b/AtSAE2 (Fig. S2C).
Upon activation by a SUMO-activating enzyme, the bound SUMO moiety is transferred via transesterification to an active site Cys (Cysteine) in the SUMO-conjugate enzyme SCE1 and then forms a SUMO-E2 thioester intermediate [27]. The Arabidopsis genome contains a single SCE1 gene (AtSCE1) compared with three other SCE1 genes (BrSCE1a, BrSCE1b and BrSCE1c) in Caixin. The maize genome encodes a novel class II isotype of SCE1, which is found only in the cereal branch of monocots. To further examine the origins of BrSCE1 isotypes, we phylogenetically analysed a collection of 17 SCE protein sequences from 4 plant genomes. As shown in Fig. 2A, all three BrSCE1 genes were clustered into class I and presented high similarity with homologues in other species, especially Arabidopsis. We then identified conserved motifs in the SCE1 proteins via the MEME program. A total of three conserved motifs were found in most of the SCE1 proteins, and motif 2 was present in all of these SCE1 proteins. BrSCE1a, BrSCE1b and BrSCE1c also share high sequence identity with AtSCE1 and have a highly conserved Ub-conjugating enzyme catalytic (UBC) domain (spanning from 8 to 150 aa residues, Fig. 2B), which is common to the UBC family.
Although SUMOylation can occur without SUMO E3 ligases under certain conditions [28], increasing evidence has revealed that the absence of E3 ligases significantly reduces SUMO conjugate levels in plants. There are three types of SUMO E3 ligases in plants: SIZ/PIAS (SIZ1), methyl methane sulfonate-sensitive protein-21/high ploidy-2 (MMS21/HPY2) and protein inhibitor of activated STAT (PIAS)-like (PIAL). The genome of Caixin encodes three SIZ1 genes (BrSIZ1a, BrSIZ1b and BrSIZ1c), four PIAL genes (BrPIAL1a, BrPIAL1b, BrPIAL2a and BrPIAL2b) and one MMS21/HPY2 gene (BrMMS21). A phylogenetic analysis of SUMO E3 ligases from various plant species revealed that different types of SUMO E3 ligases in Caixin are evolutionarily most closely related to those in Arabidopsis (Fig. 3A). Among the members of the SUMO E3 ligase family, AtSIZ1 is the most extensively studied in plants. Its involvement in various biological processes, including flowering [29–31], plant immunity [32], abiotic stress tolerance [8], thermomorphogenesis [33, 34], and phytohormone signalling [35–38], has been well documented. A common feature of known SIZ1 is the presence of five conserved domains (SAP, PHD, MIZ/SP-RING domain, PINIT and SXS motifs). Both BrSIZ1a and BrSIZ1b have five conserved domains, but a deletion of the SAP domain was observed in BrSIZ1c (Fig. 3B, S3). Like Arabidopsis, Caixin contains only one MMS21/HPY2-type E3 (BrMMS21), which also has a conserved SP-RING domain and shares high sequence identity with AtMMS21 (Fig. S4). Previous studies have shown that AtPIAL1 and AtPIAL2 function as SUMO ligases capable of SUMO chain formation. Sequence alignment revealed that both AtPIALs and BrPIALs have the MIZ/SP-RING domain as well as SIM motifs, which have specific affinities for binding to SUMO chains. However, the absence of the N-terminal SIM motif was observed in BrPIAL1b (Fig. S5).
SUMOylation is a reversible process that requires the specific action of SUMO proteases to process SUMO precursors and release SUMO from substrate conjugates [39]. A search of the protein sequence of Arabidopsis SUMO proteases revealed the presence of seven SUMO proteases in the Caixin genome. We phylogenetically analysed the selected plant species that encode potential orthologues of known Arabidopsis SUMO proteases. The phylogenetic tree revealed that the SUMO proteases in Caixin clustered into four groups, namely, the ESD4-type, OTS-type, FUG-type and SPF-type groups, which was consistent with the classification results in Arabidopsis (Fig. S6).
Cis ‑acting element analysis of SUMOylation gene promoters
To obtain information regarding the stimulus-induced, temporal, and spatial expression patterns of the SUMOylation genes in Caixin, 2.0 kb promoter regions upstream of these 30 genes were extracted and utilized for cis-acting element searches via the PlantCARE website. As shown in Fig. S7, a diverse range of cis-acting elements were present in the promoters of these genes. We classified these cis-acting elements into three groups: “plant growth and development”, “phytohormone responsiveness” and “stress responsiveness” (Fig. 4A). Almost all the promoters of these genes contained many phytohormone responsive elements related to ABA (ABRE), auxin (TGA element and AuxRR core), gibberellin (TATC-box, GARE motif and P-box), salicylic acid (TCA element) and MeJA (TGACG motif and CGTCA motif) (Fig. 4B). ABREs are the most common cis-acting elements involved in the plant hormone response, accounting for 32% of all ABREs. In addition, nine types of cis-acting elements related to the cell cycle (MAS-like motif), circadian control (circadian motif), zein metabolism regulation (O2-site), meristem expression (CAT-box and NON-box), root-specific expression (motif I), endosperm expression (GCN4 motif and AACA motif) and differentiation of palisade mesophyll cells (HD-Zip 1), which cover most plant developmental stages, were involved in plant growth and development (Fig. 4B). Seven types of cis-acting elements were divided into stress-responsive groups, namely, mixed stress response (TC-rich repeats), low-temperature responsiveness (LTR motif), anaerobic induction (ARE), MYB binding site (MBS) and wound-responsive element (WUN motif) elements, and ARE elements appeared in most of the promoters of these SUMOylation system genes (Fig. 4B). These results indicated that the expression of these genes might be associated with diverse signalling pathways during plant growth and stress responses.
Transcriptional profiles of SUMOylation genes under soft rot stress
Soft rot is a common disease of Brassica crops caused primarily by Pcc, a bacterial pathogen that infects plants through wounds at the base of the stem or petiole [40]. As the infection time increased, the infected area expanded, and the area from the water stains decreased (Fig. 5A). To further understand the potential roles and transcriptional regulation of the SUMOylation in Caixin under biotic stress, the expression patterns of the associated genes were analysed via RNA-seq datasets. The FPKM data (Table S1) from the RNA-seq of all the SUMOylation system genes upon Pcc infection are shown in a heatmap. As shown in Fig. 5B, a total of 30 SUMOylation system genes were clustered into four subgroups according to their expression patterns. Among these genes, 23 had a relatively high level of expression under normal conditions and were downregulated after Pcc infection. The expression of four genes, namely, BrSUMO3, BrSUMO8, BrSCE1b and BrSIZ1b, was continuously downregulated by Pcc infection. The expression of four other genes, namely, BrSUMO5, BrSUMO7, BrPIAL2a and BrESD4a, was continuously induced by Pcc infection. These results suggest that the transcription of SUMOylated genes is not only essential for plant development but also regulated by soft rot stress.
Pcc infection regulated SUMO conjugates in Caixin
Abiotic stresses, including heat, drought, and oxidative stress, have been shown to substantially increase the levels of SUMO conjugates [9, 23, 41]. To determine whether SUMOylation in Caixin responds to soft rot stress, the variation in the SUMO complex was investigated. The occurrence of heat-induced SUMO conjugates has been confirmed in several plant species, and high temperature is a key factor in outbreaks of soft rot disease in the field. Therefore, we first examined heat-induced SUMO conjugates in Caixin as a positive control. As shown in Fig. 6A, during 37°C treatment, the amount of SUMO conjugates in leaves increased substantially after 30 min of heat stimulus compared with that in the unstressed conditions. We then detected SUMO conjugates in Caixin leaves upon Pcc infection. After inoculation with the Pcc strain for 12 hpi, the amount of SUMO conjugates remained unchanged. However, the conjugate levels increased substantially from 12 to 24 hpi (Fig. 6B). These results suggest that SUMOylation is involved in the response of Caixin to soft rot stress.