Characteristics of ribosome profiling analyses
To examine the molecular reaction to simulated microgravity, ribosome profiling and RNA-seq analyses were conducted on Arabidopsis seedling roots subjected to normal growth conditions and simulated microgravity. Three biological replicates were used for each data set, yielding a combined total of approximately 482 to 579 million reads. Ribosome profiling reads were acquired for both normal growth and simulated microgravity seedlings, as outlined in Table S1. Furthermore, RNA-seq reads were obtained for normal growth and simulated microgravity seedling roots, as detailed in Table S2.
Transfer RNA (tRNA) recognizes codons on mRNA and transfers the corresponding amino acids during translation. It plays a crucial role in determining protein synthesis and translation speed. The concentration of tRNA have a significant impact on these processes. To avoid the precipitation of tRNA along with the ribosome-nascent peptide chain complex during the precipitation process, we compared the sample data to GenBank and Rfam databases after removing ribosomal RNA using BLAST. The ribosome profiling raw reads underwent initial processing to remove rRNA sequences. Subsequently, samples were aligned to TAIR10 Arabidopsis reference genome, as described in the experimental procedures. Strong correlations for both ribosome profiling and RNA-seq data were identified by principal component analysis (PCA) between three biological replicates, as depicted in Figure S1.
Ribosome profiling in various organisms exhibits common characteristics related to the translation process. These characteristics can be considered indicative of effective ribosome profiling libraries. 12To assess the impact of simulated microgravity stress, we conducted a comparison of ribosome profiling data between normal growth and simulated microgravity seedling roots. Our analysis revealed that the length of RPFs was about 32 nucleotides in both samples, a length consistent with findings in mammalian systems and rice (Figs. 2a and S2a) 11, 16. The observed read length was longer than the 28 nucleotides reported in yeast 11, 12. The distribution of read density by ribosomes showed an advancement of three nucleotides at a time during translation, leading to the characteristic triplet periodicity 12. Strong evidence of a three-nucleotide periodicity was identified when analyzing sequences in normal growth and simulated microgravity seedling roots (Figs. 2b and S2b). Meta-gene analysis revealed distinct peaks of read density surrounding the start and stop codons (Fig. 2c), indicating that translation initiation and termination processes may serve as the limiting factors for translation in Arabidopsis seedlings. Additionally, an elevated level of normalized read density was detected close to the start of the coding sequence (CDS), extending up to around 200 nucleotides. Tuller et al. (2010) reported that rare codons slowed-down the start of translation at the start of the CDS, so the next part of the sequence could be translated more quickly and efficiently.19
Read density distributions within the first 200 nucleotides of the CDS were found to be similar between normal growth and under simulated microgravity conditions in seedling roots (Fig. 2c). The findings with yeast under hydrogen peroxide stress differed from this. Stress led to more ribosomes at the start of translation. Additionally, small ORFs have been identified in the untranslated regions of various species, including yeast, mammals, and plants 20, 21. In the seedlings under normal growth, 1.9% and 1.3% of RPFs were observed in the 3’-UTR and 5’-UTR, respectively, whereas 96.8% of RPFs were located in the gene body (Fig. 2d). Under simulated microgravity conditions, the proportion of reads mapped to the gene body increased to 97.6%, whereas in the 3’-UTR and 5’-UTR decreased to 1.4% and 1.0%, respectively (Fig. 2d). The alterations observed in the ribosome-protected footprints (RPFs) mapping to the 5’- and 3’-untranslated regions (UTRs) indicate the potential for translation modifications in these regions, consistent with previous findings in Arabidopsis.
Simulated microgravity affects gene expression both transcriptionally and translationally.
It is reported that simulated microgravity affects plant growth and development.10, 22, 23 By the conduct of ribosome profiling coupled with RNA-seq methodologies, we analyzed the impact of simulated microgravity on seedling roots cultivated under normal growth and simulated microgravity conditions.
Our initial study focused on delineating the alterations in gene expression induced by simulated microgravity. At the transcriptional level, we found 360 genes up-regulated and 71 down-regulated, whereas, at the translational level, 331 genes were up-regulated and 49 were down-regulated (Fig. 3a). The genes with up-regulation expression level surpassed down-regulation at both translational and transcriptional levels, suggesting a widespread enhancement in gene expression. Polysome profiling was used to validate translational promotion, as depicted in Fig. 3b. In comparison to the control group of seedlings under normal growth conditions, the polysome fraction in seedlings subjected to simulated microgravity was up-regulated from 49.5–56.1%, whereas the monosome fraction was down-regulated from 32.1–25.3%. These findings unequivocally illustrated a global up-regulation of translation in simulated microgravity-exposed seedling roots (Fig. 3b).
Subsequently, a functional analysis was conducted on the differentially expressed genes (DEGs) through Gene Ontology (GO) analysis. A significant overlap in enriched pathways was found at both the transcriptional and translational levels. Specifically, pathways related to response to stimuli, response to stress, wounding, defense response, hormone signaling pathway, and response to oxygen levels, etc. (Figure S3). In addition, Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis revealed that genes were enriched at both transcriptional and translational levels in photosynthesis, hormone signaling pathway, plant-pathology interaction, nitrogen metabolism, and biosynthesis of secondary metabolites, etc (Figure S4).24
It is noteworthy that a considerable proportion of genes within enriched pathways displayed disparate responses at the two levels. As illustrated in Fig. 3c, only under one third of responsive genes (31.6% up-regulated and 20.8% down-regulated) were found to be shared between the two processes (Fig. 3c), which suggested discordant changes at the two levels. To assess the collective pattern at the two distinct levels concurrently, the fold-change of gene expression measured by FPKM (fragments per kilobase of transcript per million mapped reads) was computed. The findings indicated a moderate correlation between altered expression at the transcriptional and translational levels (Fig. 4). Subsequently, genes were categorized into nine groups based on the FPKM fold change. Subsequent analysis revealed that about 40% of the responsive genes fell into the (classes C and G) concordant groups, which exhibited coordinated regulation either increasing or decreasing to a comparable degree at both levels.
Approximately 60% of responsive genes were found within the other six discordantly groups. Further analysis was conducted on genes within four discordantly regulated groups. The classes D and F genes showed regulation without significant changes in translation. In class D, transcriptionally down-regulated genes were enriched in plant hormone signal transduction and MAPK signaling pathway. Additionally, in class F, 244 up-regulated genes were observed to be enriched in nitrogen and carbon metabolism at transcriptional level. Genes within classes B and H exhibited translational regulation without discernible alterations at the expression level. Specifically, there were 167 up-regulated genes at the translational level in class B, associated with pathways related to biosynthesis of secondary metabolites and plant-pathogen interaction. Class H, on the other hand, featured 66 genes with reduced translation, showing significant enrichment for sugar metabolism and plant hormone signal transduction.
The influence of simulated microgravity stimuli resulted in notable changes in the translational efficiencies of numerous genes.
Translational efficiency (TE) is a metric that directly assesses the effectiveness of RNA utilization and is a significant factor in the process of translation 12. Significantly, defense response and photosynthesis have been recognized as prominently enriched pathways in developing Arabidopsis seedlings under a microgravity environment, demonstrating altered at the mRNA levels 25. This changed mRNA level may be related to the altered TE of these pathway genes (Fig. 5a).
We aimed to investigate the likely mechanism of plants response to simulated microgravity through alterations in TE. Despite observing a high degree of similarity in global mRNA abundance and TE among simulated microgravity seedlings, our analysis revealed 552 genes with decreased TE and 529 genes with increased TE (z-score > 2) following simulated microgravity treatment (Fig. 5a). These findings suggested that modulating gene TE may be an effective mechanism in the response of plants to simulated microgravity stimuli. To further evaluate this phenomenon, we conducted a detailed analysis of the data.
Plants demonstrate a wide range of responses to stress. To better understand changes in TE, we analyzed the distribution of genes showing decreased and increased TE across nine predetermined responsive categories. Our findings indicated that modifications in transcriptional and translational abundance influence changes in TE, as illustrated in Figure S5. Nevertheless, a majority of the genes displaying significantly altered TE were found to be regulated exclusively at either the transcriptional or translational level. For example, genes belonging to Groups F and H were solely regulated at one of these levels. Collectively, these groups account for major of genes exhibiting a notable increase in TE, as indicated by a z-score above 2. Moreover, genes within groups B, D, and E, which are subject to both transcriptional and translational regulation, may experience a substantial reduction in TE (Figure S5).
The genes translational efficiencies are influenced by their specific sequence characteristics.
Gene sequence features influenced mRNAs associated with polysomes.26 The potential impact of typical characteristics (sequence length, normalized minimal free energy (NMFE), and GC content) on TE was evaluated. Our analysis revealed that genes with higher TE tended to have higher GC content, NMFE, and shorter lengths as shown in Fig. 6a-c in CDS region. Conversely, differences in these characteristics between genes with higher and lower TE in 3’-UTR were minimal. Specifically, only genes in the highest TE group exhibited shorter length and lower NMFE in the 3’-UTR compared to other groups, as depicted in Figures S6. Interestingly, for 5’-UTR, genes with higher TE exhibited a higher NMFE, shorter length, and lower GC content (Figure S6). The trends observed in the normal growth seedlings were similar to those observed in seedlings subjected to simulated microgravity (Figure S7).
Subsequently, an examination was conducted to determine whether genes exhibiting concordant and discordant expression patterns displayed differences in their sequence characteristics. The Kolmogorov-Smirnov test results demonstrated that genes within the concordant group, located in the overlapping regions of Fig. 3d, exhibited a higher GC content and lower NMFE in CDSs compared to genes within the discordant group (Fig. 6d, e).
Widespread upstream open reading frames
The prevalence of uORFs was found to decrease under simulated microgravity (Fig. 2d). This phenomenon mirrored previous findings in yeast subjected to starvation 12, 27, suggesting a potential role of uORFs in this stress response. These observations highlighted the significance of elements within the 5’-UTR in modulating cellular responses to stress, particularly through the regulation of uORFs. It is widely recognized that uORFs, which are small ORFs situated in the 5’-UTR, can inhibit the translation of main ORFs by reinitiation and leaky scanning. uORFs have been reported in various species 21. This study facilitated the comprehensive identification of uORFs in Arabidopsis seedling roots at a genome-wide level. To identify genes containing uORFs accurately, our study focused on all Arabidopsis genes. We revealed that approximately 20% of these genes, totaling 4023 genes, were predicted to contain uORFs. The figure S8 indicates the length distribution of all predicted uORFs, statistics for the number of predicted uORFs per gene, and the length distribution of translated uORFs in the normal growth and simulated microgravity seedling roots.
The translation of main ORFs was repressed by the translated upstream ORF.
To demonstrate the distinctions between untranslated and translated uORFs, three parameters related to the mORF reinitiation (including uORF length, 5’-UTR length, and NMFE) were compared 21. Our results suggested that translated uORFs exhibited greater length and higher folding potential compared to untranslated uORFs (Fig. 7a). Conversely, the shorter 5'-UTR genes more likely contribute to translated uORFs (Fig. 7b). Upon further examination of the impact of 5’-UTR length, it was observed that the distance from the uORF to the transcription start site (TSS) in translated uORFs were obviously shorter and the distance from the uORF to the start codon of the mORFs in translated uORFs were obviously longer (Fig. 7c-e).
The Kozak consensus sequence, denoted as "GCCA/GCCAUGG," is a conserved motif found in the mRNA of certain eukaryotes 28. ORFs containing Kozak sequences exhibit a greater likelihood of translation and increased translation efficiency. Statistical analysis should be conducted on all small open reading frames (sORFs) that contain Kozak sequence features. Motif analysis was conducted in the vicinity of the start codon for all full-length gene sequences. The sequences flanking the start codon in the genes exhibited a high degree of similarity to the Kozak sequence. Higher GC content was observed in the translated uORFs flanking sequence, as depicted in Figs. 7f.
To examine likely impact of translated uORFs on the downstream mORFs translation, we analyzed three groups TE: those predicted uORFs lacking translation, those with a single translated uORF, and those with multiple translated uORFs. Translated uORFs have been shown to significantly reduce the TE of corresponding genes, with this effect becoming more pronounced as the number of translated uORFs increases (Fig. 7g). Previous studies have indicated that uORFs can be influenced by sugar, light, and pathogenic infections 16, 29. Consequently, we performed an examination of the translation efficiency (TE) of upstream open reading frames (uORFs) in seedlings cultivated under simulated microgravity conditions. Our findings indicated that the TE of uORFs exhibited comparable levels in both normal growth and simulated microgravity environments (Fig. 7h).