Expression profiles of tsRNAs
A total of 4,021 tsRNAs mapped to 334 unique tRNAs, including tRNAs in the cytosol or mitochondria, were identified. Interestingly, some tsRNAs mapped to different tRNAs shared the same sequence. Based on this classification, we further analysed the proportion of tsRNAs subtypes in patients with sarcoidosis and healthy controls. The pie charts show the distribution of the number for each subtype of tsRNA with a threshold average CPM ≥ 20. A total of 231 tsRNAs for exact matches were identified in the serum (207 in sarcoidosis, 214 in healthy controls). In the sarcoidosis group, two tRF-2, 33 tRF-3a, 22 tRF-3b, 38 tRF-5a, 24 tRF-5b, 76 tRF-5c, and 12 tiRNA-5 were identified in the sarcoidosis group (Fig. 1A). In the control group, four tRF-1, two tRF-2, 34 tRF-3a, 26 tRF-3b, 38 tRF-5a, 23 tRF-5b, 72 tRF- 5c, and 15 tiRNA-5 were identified (Fig. 1B). In addition, the numbers of tsRNAs derived from the variable anticodon tRNAs are shown in the stacked plots (Fig. 1C-D).
Differentially expressed tsRNAs in patients with sarcoidosis vs. healthy controls were identified through RNA sequencing. All differentially expressed tsRNAs are shown in the cluster heatmap in Fig. 2A. The heatmap revealed systematic variations between the sarcoidosis and control groups in terms of tsRNA expression. Additionally, we identified 13 significantly dysregulated tsRNAs: four were upregulated, while nine were downregulated (FC > 1.5, P < 0.05; Fig. 2B and Table 4).
Table 4
Differential tsRNAs in sarcoidosis.
tsRNA | Type | Regulation | Length | Fold_Change | p_value | q_value |
tRF-Ser-GCT-109 | tRF-5b | up | 24 | 95.62724785 | 0.044775714 | 0.996329094 |
tRF-Thr-TGT-054 | tRF-3b | up | 19 | 94.37648341 | 0.04475833 | 0.996329094 |
tRF-iMet-CAT-001 | tRF-5c | up | 30 | 4.75526063 | 0.011536113 | 0.996329094 |
tRF-Gln-TTG-005 | tRF-5a | up | 16 | 3.600230008 | 0.035728136 | 0.996329094 |
tRF-Ser-TGA-037 | tRF-3b | down | 19 | 0.006946015 | 0.001563587 | 0.345742941 |
tRF-Pro-TGG-016 | tRF-5b | down | 24 | 0.01103809 | 0.019808701 | 0.996329094 |
tRF-Arg-ACG-017 | tRF-1 | down | 21 | 0.044988695 | 0.036103512 | 0.996329094 |
tiRNA-Val-CAC-002 | tiRNA-5 | down | 34 | 0.098833932 | 0.045891918 | 0.996329094 |
tRF-Ser-TGA-005 | tRF-1 | down | 17 | 0.176230811 | 0.033045783 | 0.996329094 |
tiRNA-Glu-TTC-001 | tiRNA-5 | down | 34 | 0.179020622 | 0.017044841 | 0.996329094 |
tiRNA-Lys-CTT-003 | tiRNA-5 | down | 34 | 0.191484844 | 0.001920794 | 0.345742941 |
tiRNA-Gly-GCC-002 | tiRNA-5 | down | 33 | 0.199464689 | 0.039743529 | 0.996329094 |
tRF-Ser-TGA-007 | tRF-1 | down | 19 | 0.228750737 | 0.015413429 | 0.996329094 |
Rt‑qpcr Validation Of Rna Sequencing Results
Although 13 significantly dysregulated tsRNAs were identified, only half of tsRNAs (tRF-iMet-CAT-001, tRF-Gln-TTG-005, tiRNA-Glu-TTC-001, tiRNA-Lys-CTT-003, tiRNA-Gly-GCC-002, and tRF-Ser-TGA-007) were selected to validate the RNA sequencing results using RT-qPCR in sarcoidosis and control samples. The main reasons for the above situation are as follows. First, the individual differences in the sarcoidosis group or the control group were large, resulting in a large P value of some tsRNA sequencing results, such as tRF-Ser-GCT-109, tRF-Thr-TGT-054, and tiRNA-Val-CAC-002. Therefore, the above tsRNAs were excluded. In addition, we found after reading the literature that there are also some tsRNAs that are also significantly different in other diseases, such as tiRNA-Glu-TTC in breast cancer [17] and tiRNA-Lys-CTT in prostate cancer [46]. Finally, we screened 6 more likely distinct tsRNAs for validation. The expression profiles of tiRNA-Glu-TTC-001, tiRNA-Lys-CTT-003, and tRF-Ser-TGA-007 agreed with results obtained from RNA sequencing. There were significant differences in tsRNA expression levels between patients with sarcoidosis and healthy controls (P < 0.05; Fig. 3A-C). These findings indicated that the RNA sequencing data were reliable enough to warrant further analysis.
Roc Analysis Of Differentially Expressed Tsrnas In Sarcoidosis
ROC analysis of differentially expressed tsRNAs in sarcoidosis
The results of qPCR were used to construct the ROC curve to evaluate the diagnostic value of tiRNA-Glu-TTC-001, tiRNA-Lys-CTT-003, and tRF-Ser-TGA-007 in sarcoidosis (Fig. 3D-F). Compared with healthy subjects, the AUC of tiRNA-Glu-TTC-001 in sarcoidosis was 0.705 (95% confidence interval [CI]: 0.531–0.878). The tiRNA-Lys-CTT-003 in sarcoidosis was 0.765 (95% CI: 0.611–0.920). The AUC of tRF-Ser-TGA-007 in sarcoidosis was 0.808 (95% CI: 0.672–0.944).
Correlation Between Differentially Expressed Tsrnas And The Clinical Parameters Of Sarcoidosis
The relative expression levels of tiRNA-Glu-TTC-001 were negatively correlated with age (P < 0.01; Fig. 4A). The relative expression levels of tiRNA-Lys-CTT-003 were significantly correlated with age and the number of affected systems (P < 0.01; Fig. 4B-C). There was a significant positive correlation between the relative expression of tRF-Ser-TGA-007 and blood calcium levels (P < 0.01; Fig. 4D).
Prediction Of Target Genes Of Differentially Expressed Tsrnas
Although the mechanism of tsRNA action is yet to be elucidated, recent evidence suggests a miRNA-like mode of action, and bioinformatic prediction of tsRNA (whether tRF or tiRNA) will still use similar miRNAs for prediction in some studies [46–50]. miRNAs can recognise their mRNA targets using their seed sequence (positions 2-7nt at their 5ʹ ends) and suppress global mRNA translational activities. Based on this rule, we used two types of algorithms, TargetScan and miRDB, to predict tsRNA targets. TargetScan and miRDB are two softwares for predicting ncRNA binding sites, which is very effective for predicting ncRNA (including miRNA) binding sites in mammals. Before predicting ncRNA target genes, the 3'UTR region of the transcript needs to be determined first. The databases use a sequencing technology called 3P-seq to determine the 3'UTR region corresponding to the transcript (some tsRNAs in mammals are transcribed by binding the 3'UTR region of this sequence plays a role in post-transcriptional regulation), and combined with the analysis results of this technology and the existing 3'UTR annotation in NCBI, a comprehensive 3'UTR region sequence is provided. Additionally, to reduce false-positive results, only genes predicted by all two software programs were considered targets of tsRNAs. In total, tsRNAs that target 646 mRNAs as predicted by miRDB, tsRNAs that target 228 mRNAs as predicted by TargetScan, and the two target a total of 96 mRNAs (Fig. 5B); tiRNA-Glu-TTC-001, tiRNA-Lys-CTT-003, and tRF-Ser-TGA-007 could target 637, 111, and 39 transcripts, respectively (Fig. 5A and Table S1).
Biological Function Analysis Revealing Potential Therapeutic Effects Of Differentially Expressed Tsrnas
The tsRNAs regulated mRNA translational activities, therefore we conducted bioinformatics analyses of the functions of target mRNAs to understand their biological functions. We used GO biological processes and KEGG pathways to explore the functions of target mRNAs. The target mRNA of tiRNA-Glu-TTC-001 was enriched in BP and involved in the regulation of transcription, DNA-templated, transcription, negative regulation of transcription from RNA polymerase II promoter, Ras protein signal transduction, heterophilic cell-cell adhesion via plasma membrane cell adhesion molecules, transport, ERK1 and ERK2 cascade, positive regulation of transcription from RNA polymerase II promoter, small GTPase-mediated signal transduction, brain development, ER to Golgi vesicle-mediated transport, and cell proliferation (Fig. 6A). The CC comprised intracellular, nucleus, cis-Golgi network, Golgi membrane, receptor complex, chromatin, early endosome, and actin cytoskeleton (Fig. 6B), while MF included protein binding, metal ion binding, nucleic acid binding, transcription factor activity, sequence-specific DNA binding, DNA binding, RNA polymerase II core promoter proximal region sequence-specific DNA binding, chromatin binding, protein serine/threonine kinase activity, and transcription corepressor activity (Fig. 6C).
The target mRNA of tiRNA-Lys-CTT-003 was enriched in BP, involved in transcription regulation, DNA-templated, regulation of cell cycle, DNA damage response, signal transduction by p53 class mediator resulting in cell cycle arrest, transcription from RNA polymerase II promoter, megakaryocyte development, platelet alpha granule organisation (Fig. 6D), in CC, comprising the nucleus (Fig. 6E), and MF, in RNA polymerase II core promoter proximal region sequence-specific DNA binding, metal ion binding, transcriptional activator activity, RNA polymerase II core promoter proximal region sequence-specific binding (Fig. 6F).
The target mRNA of tRF-Ser-TGA-007 was enriched in BP involved in negative regulation of transcription from RNA polymerase II promoter, positive regulation of transcription from RNA polymerase II promoter, negative regulation of vascular endothelial growth factor receptor signalling pathway, positive regulation of IRE1-mediated unfolded protein response, activation of signalling protein activity involved in unfolded protein response, negative regulation of protein serine/threonine kinase activity, cellular response to vascular endothelial growth factor stimulus (Fig. 6G). The CC comprised cerebellar mossy fibre, cell-cell adherens junction (Fig. 6H), and MF in transcription factor activity, RNA polymerase II distal enhancer sequence-specific binding, transcription factor activity, RNA polymerase II distal enhancer sequence-specific binding, and protein phosphatase 2A binding (Fig. 6I).
KEGG analysis revealed that the target mRNA of tiRNA-Glu-TTC-001 was particularly enriched in pathways in cancer, chemokine signalling, cAMP signalling, Rap1 signalling, endocytosis, Ras signalling, cGMP-PKG signalling, retrograde endocannabinoid signalling, FoxO signalling, adrenergic signalling in cardiomyocytes, and inflammatory mediator regulation of TRP channels (Table 5), while no KEGG pathway was associated with the target mRNAs of tiRNA-Lys-CTT-003 and tRF-Ser-TGA-007 (P < 0.05).
Table 5
KEGG pathway analysis of the target mRNA of tiRNA-Glu-TTC-001.
Pathway ID | Name | Count | % | P Value | Genes |
hsa05200 | Pathways in cancer | 23 | 0.026752 | 0.001234 | NTRK1, MAP2K1, SMAD4, FZD5, TPM3, PTGER2, LAMB4, GNAI3, PIK3R3, ADCY1, IGF1, CBL, TGFBR2, HSP90B1, MAPK10, FGF5, FGF14, CXCL12, PLCB4, PAX8, GNB5, PLCB2, PRKACB |
hsa04062 | Chemokine signaling pathway | 14 | 0.016284 | 0.001914 | MAP2K1, GNAI3, PIK3R3, ADCY1, ARRB2, VAV2, RAP1B, CXCL12, PLCB4, GRK6, CCR6, GNB5, PLCB2, PRKACB |
hsa04024 | cAMP signaling pathway | 14 | 0.016284 | 0.003336 | RYR2, MAP2K1, ADCYAP1R1, PTGER2, GNAI3, ATP1B4, PIK3R3, ADCY1, VAV2, MAPK10, RAP1B, PRKACB, CNGB1, GRIA4 |
hsa04015 | Rap1 signaling pathway | 14 | 0.016284 | 0.005515 | VASP, MAP2K1, FLT1, GNAI3, PIK3R3, ADCY1, IGF1, MAPK13, FGF5, RAP1B, FGF14, PLCB4, PRKD1, PLCB2 |
hsa04144 | Endocytosis | 14 | 0.016284 | 0.016358 | ARF3, IQSEC2, WIPF2, CLTC, AP2B1, VPS26B, ARRB2, CBL, HLA-F, RAB22A, TGFBR2, EEA1, GRK6, SMAP2 |
hsa04014 | Ras signaling pathway | 13 | 0.01512 | 0.023119 | MAP2K1, FLT1, GAB1, PIK3R3, IGF1, MAPK10, FGF5, RAP1B, FGF14, ABL2, GNB5, RGL2, PRKACB |
hsa04022 | cGMP-PKG signaling pathway | 12 | 0.013957 | 0.004367 | VASP, MEF2A, ATF2, MAP2K1, PLCB4, IRS1, GNAI3, KCNMB3, ATP1B4, ADCY1, PLCB2, CNGB1 |
hsa04723 | Retrograde endocannabinoid signaling | 11 | 0.012794 | 4.61E-04 | MAPK10, PLCB4, DAGLA, GABRA4, GNAI3, GNB5, ADCY1, PLCB2, PRKACB, MAPK13, GRIA4 |
hsa04068 | FoxO signaling pathway | 11 | 0.012794 | 0.003971 | MAPK10, USP7, MAP2K1, SMAD4, HOMER1, IRS1, PIK3R3, IGF1, SIRT1, MAPK13, TGFBR2 |
hsa04261 | Adrenergic signaling in cardiomyocytes | 11 | 0.012794 | 0.004894 | RYR2, ATF2, PLCB4, PPP1R1A, TPM3, GNAI3, ATP1B4, ADCY1, PLCB2, PRKACB, MAPK13 |
Hub Gene Analysis
The top ten genes of each sub-plugin in the PPI network of the target mRNAs of tiRNA-Glu-TTC-001, tiRNA-Lys-CTT-003, and tRF-Ser-TGA-007 were identified using the cytoHubba plugin, respectively (Table 6). The ten genes with the highest frequency in all sub-plugins were classified as hub genes. KEGG analysis of the hub genes showed their involvement in the insulin signalling pathway (hsa04910), chemokine signalling pathway (hsa04062), regulation of lipolysis in adipocytes (hsa04923), bacterial invasion of epithelial cells (hsa05100), neurotrophin signalling pathway (hsa04722), AMPK signalling pathway (hsa04152), platelet activation (hsa04611), FoxO signalling pathway (hsa04068), cAMP signalling pathway (hsa04024), proteoglycans in cancer (hsa05205), and pentose phosphate pathway (hsa00030).
Table 6
The top 10 Hub genes of differential tsRNAs by cytoHubba
tsRNA | Top ten Hub genes |
tiRNA-Glu-TTC-001 | CLTC APP SIRT1 PRKACBCBL RAP1BARRB2IRS1 ACTR2 PIK3R3 |
tiRNA-Lys-CTT-003 | KIAA0101 CDT1GTSE1KMT2DOPA3 FBXL19 SPC24 PCGF3 KDM2A KDM6B |
tRF-Ser-TGA-007 | NR5A1 SPI1 TKTL2 FNBP1L GPI FMN1 MNDA NCOR2 SCARB1 NFIX |
We used AUC analysis to evaluate their potential for diagnosing sarcoidosis using the expression data from the GSE19314 database. The hub genes associated with tiRNA-Glu-TTC-001 were all expressed in the serum. The AUCs of CLTC, APP, PRKACB, RAP1B, ARRB2, IRS1, ACTR2, and PIK3R3 were all greater than 0.5, of which APP, PRKACB, and ARRB2 were 0.707, 0.728, and 0.714, respectively (Fig. 7A). Except for KIAA0101, the hub genes associated with tiRNA-Lys-CTT-003 were all expressed in the serum of patients with sarcoidosis. The AUCs of nine genes were all greater than 0.5 (Fig. 7B). The hub gene associated with tRF-Ser-TGA-007 was expressed in the serum. The AUCs of NR5A1, SPI1, TKTL2, FMN1, MNDA, and NCOR2 were all greater than 0.5, of which NR5A1 and MNDA were 0.712 and 0.726, respectively (Fig. 7C). The AUCs > 0.5 were considered statistically significant. These genes have higher diagnostic potential since they were all greater than 0.70.
RNAhybrid software was used to construct a base complementary pairing model between tiRNA-Glu-TTC-001 and APP, PRKACB, and ARRB2 (Fig. 8A-C), and between tRF-Ser-TGA-007 and NR5A1 and MNDA, respectively. There was no base complementary pairing model between tRF -Ser-TGA-007 and MNDA (Fig. 8D).