IVDD is a common degenerative disease with a complex etiology that poses many challenges to medicine and society. Exploring the potential mechanisms of IVDD and defining it with more specific and sensitive features for clinical diagnosis and early intervention is imperative [30]. As research progresses, we will not only be able to target disease markers based on sequencing data, but we can fully investigate their hidden mechanisms and operational characteristics. In our study, we used two independent GEO microarray datasets (GSE34095 and GSE147383) for the screening of DEGs and hundreds of DEGs were screened and identified. A large number of relevant entries were enriched using GO enrichment analysis. Among the overlapping GO terms were collagen-containing extracellular matrix, cohesin complex and endosome membrane. The extracellular matrix (ECM) [31,32] is defined as a complicated macromolecular network composed of collagen, proteoglycan, cohesin and several other glycoproteins that not only plays a supportive, protective and nutritional role for tissue cells, but is also closely related to basic life activities such as cell proliferation, differentiation, metabolism, recognition, adhesion and migration. Degradation of the intervertebral disc ECM is one of the main pathological changes of IVDD in several studies. [33,34] Yao M [35] et al. used Marein as a protective agent against intervertebral disc ECM degeneration and effectively resisted apoptosis of NP cells. Neidlinger-Wilke C [36] et al. revealed that ECM remodeling altered the mechanical microenvironment of the IVDD thereby compromising disc function. Endosomes are small vesicles produced by cells through endocytosis and this sorting function is essential for critical cellular functions [37]. Our enrichment results confirm that the biological processes of most DEGs are consistent with previous evidence and provide a theoretical basis for subsequent analysis.
When exploring the genetic biomarkers of diseases, previous studies often only used the microarray data from the disease group and the control group for simple difference comparison analysis. Future bioinformatics research needs to dig deeper into statistical results based on existing clinical and biologic theory. We used two other datasets (GSE23130 and GSE70362), that included pfirmann grades of IVDD to validate the DEGs. Pfirmann grading system is a common and well-established tool to evaluate degenerative abnormalities on T2-weighted sagittal MRI in daily clinical practice. It differentiates IVDD into 5 levels based on the signal of the central NP and peripheral NF and the height of the intervertebral space. We included pfirmann grades as outcome variables for assessing DEGs. On the one hand, the introduction of external data can avoid false positive or negative results. On the other hand, using pfirmann grades also makes the conclusions more relevant to clinical practice. And then, we employed the idea of variable screening of prediction models to further screen out genes with predictive value. With fewer variables included, we used a best subset selection regression and identified our final 3 key genes based on Akaike Information Criterion (AIC) and Bayesian Information Criterion (BIC). The ROC plots of prediction models incorporating these 3 genes show that the maximum AUC can reach 0.78, suggesting that the key genes in our screen are predictive of the progression of the disease course of IVDD.
We analyzed the key genes based on existing databases such as GeneCards (http://www.genecards.org/). leucine rich pentatricopeptide repeat containing (LRPPRC) encodes a kind of leucine-rich protein that has multiple pentatricopeptide repeats (PPR) [38]. The role of this protein is unknown and may play a role in cytoskeletal organization, vesicular transport, or in transcriptional regulation of both nuclear and mitochondrial genes. Various studies demonstrate that high expression of LRPPRC is associated with poor prognosis in a variety of cancers, such as bladder urothelial carcinoma [39], lung cancer [40] and pancreatic cancer [41]. Maimaiti [42] et al. defined LRPPRC as a N6-methyladenosine (m6A) modification for intracranial aneurysms. Ghavami S [43] et al. reported that LRPPRC was associated with multiple neurodegenerative diseases via autophagy and apoptosis. In our study, LRPPRC is considered to be upregulated in the degenerated disc tissue and is expressed in endothelial cells and fibroblasts in addition to different types of chondrocytes. Accordingly, we speculate that LRPPRC may also be involved in the pathological changes of IVDD in an autophagic or apoptotic pathway, with the involvement of multiple cells.
Gremlin1 (GREM1) is known as a member of the transforming growth factor-β (TGF-β) signalling family encoding antagonists to bone morphogenetic protein (BMP) which may play a part in regulating organogenesis, body patterns and tissue differentiation and its related pathways include mainly angiogenesis (CST) and BMP signalling [44,45]. HKišonaitė M [46] et al. concluded that GREM1 can inhibit BMP2-mediated osteoblast differentiation from in vitro studies. Kobayashi H [47] et al. identified GREM1 and Islr as CAF-specific genes involved in BMP signaling and drived Colorectal Carcinogenesis. Shunlun Chen [48] et al. found that GREM1 promoted myeloid apoptosis and IVDD through inhibition of TGF-β-mediated Smad2/3 phosphorylation. This is coherent with our findings. The role BMP plays in IVDD is unclear though and it is characterized by a wide range and versatility of BMP and its signal pathways and inhibitors. Hollenberg AM [49] et al. demonstrated a correlation between the expression level of BMP-2 and Pfirmann grades, while Haschtmann D [50] et al. concluded that BMP-2 resulted in an up-regulation of Col I and type II, and of aggrecan gene expression. Most current studies believe that BMP can promote the ossification of AF. However, it is not clear whether it can be a unique treatment for IVDD and the effect of BMP in overall disc tissue remains to be investigated. We learned from the prediction models that GREM1 was the strongest predictor among the 3 genes so it will also be the centre of our research and discussion afterwards. Our study firstly proved the potential of GREM1 as a gene marker for IVDD, which is also in line with several previous studies [51,52]. GREM1 can affect cellular function by multiple mechanisms in addition to acting as a type of BMP inhibitor [53,54,55]. Whether GREM1 could also ameliorate the development of IVDD is another worthwhile studying field.
SLC34A4 encodes a member of the zinc/iron-regulated transporter-like protein (ZIP) family which localizes to cell membranes and is mostly required for zinc uptake in the intestine. Among its related pathways are Metal ion SLC transporters and Transport of inorganic cations/anions and amino acids/oligopeptides (http://www.genecards.org/). The main diseases associated with SLC39A4 are Acrodermatitis Enteropathica [56], Zinc-Deficiency Type Acrodermatitis [57], other than that there are relatively few studies. Zinc is an essential element for human and is involved in the maintenance of many metabolic functions. Recent studies have linked metabolic changes to cell fate. Liang J [58] et al. discovered that type II alveolar epithelial cell ZIP8 deficiency in young mice results in reduced precursor cell function and impaired self-renewal. Chen PH [59] et al. identified an unexpected role for ZIP7 in iron death by maintaining endoplasmic reticulum homeostasis, and these findings may have therapeutic implications for diseases involving iron death and zinc dysregulation. Few studies have reported the relationship between genes related to zinc metabolism and IVDD. Staszkiewicz R [60] et al. investigated the relationship between the concentration of local metal ions in the patient's disc tissue and the degree of progression of IVDD and found that the strongest relationships were noted between the concentrations of zinc. Our study suggests that down-regulation of SLC39A4 may be another important feature of IVDD and we hypothesize that abnormalities in SLC39A4 lead to an imbalance in zinc metabolism inside and outside the disc tissue cells which may promote IVDD through several pathways such as programmed cell death. The exact mechanism and principle of this is subject to further experimental verification.
Current studies on miRNAs in IVDD have confirmed that a variety of miRNAs play critical roles in the process of IVDD through apoptosis, aberrant proliferation, inflammatory response and ECM degradation [61,62,63]. Another major research interest in miRNA is its important role in exosome therapy [64]. Exosome therapy is achieved by direct in vitro injection of extracellular vesicles (EVs) containing miRNAs or by building vectors to intervene in cellular metabolism using paracrine signaling regulation [65,66]. However, caution must be exercised when studying the application of miRNAs, as miRNA-mRNA and miRNA-miRNA regulatory pathways may not be the same in different tissues [67], and misuse may lead to imbalance in ceRNA regulatory network. In this study, we successfully screened meaningful mRNA-miRNA pairs (GREM1-mir-665, LRPPRC-mir-107, LRPPRC/SLC39A4-mir-484, LRPPRC/SLC39A4-miR-103a-3p) for these 3 key genes by combining the miRNA target prediction database miRnet and external miRNA microarray dataset GSE63492. The application and validation of these miRNAs need further experimental proof.
The pattern of immune infiltration is another crucial issue. Our results on immune infiltration by ssGSEA in patients under different pfirmann grades show a decreased infiltration of effector memeory CD4 T cell, pDC,hiDCs and a increased infiltration of TH1, TH17, MDSC, macrophage and pDCs in disc tissues of high degeneration grade relative to low degeneration grade. Interesting is the simultaneous association of GREM1 with pDCs, hiDCs and SLC39A4 with immature B cells. On the one hand, infiltration of immune cells in degenerated discs can further amplify the inflammatory cascade response [68,69], and on the other hand, immune cells may lead to vascular invasion and release of neurogenic factors [70]. Our study confirmed the infiltration of some immune cells in IVDD and the possible mechanisms [71,72]. However, the role of dendritic cells (DCs) in IVDD has been poorly studied. We hypothesize that the migration and antigen-presentation functions of DCs are critical for disc tissue inflammation initiation and tolerogenic immune responses [73]. However, this must also be tested in future experimental studies.
The current study still has many limitations. For example, we depended on our own clinical experience to devide pfirmann grades into low degeneration and high degeneration groups, which was a reluctant move limited by the sample size and made the outcome variables less detailed. Secondly, the analysis of single-cell transcriptome data lacked the corresponding control group. We only used p-values to screen for significant miRNAs, which may result in a degree of false positives. We relied on available data that also lacked other clinical phenotypes of the patients, such as comorbid underlying diseases, which may have led to errors in the results of the difference analysis. In addition, in the single-cell sequencing clustering step, we divided the cells mainly into chondrocytes and non-chondrocytes. Our future studies will incorporate other omic approaches that incorporate genomics, proteomics and metabolomics, complemented by appropriate experiments, to more fully address the relevant role of IVDD biomarkers.