IPF This is a group of diffuse parenchymal lung diseases of unknown cause, and the mechanisms responsible for the fibrotic process and structural disorders of lung tissue are still unclear[27]. The fibrotic process of IPF is irreversible and there is no effective treatment. Statistically, nearly one third of patients die within 3 years of disease diagnosis[28]. Due to its extremely high mortality rate, there is an urgent need for an effective biomarker for early diagnosis and treatment of the disease. The endoplasmic reticulum (ER), a key organelle in eukaryotic cells, can induce endoplasmic reticulum stress (ERS) when there is an accumulation of misfolded proteins in the endoplasmic reticulum. ERS can activate the unfolded protein response (UPR), a cytoprotective response that removes misfolded proteins to protect the cell, but it can also trigger cell death when the ERS is overloaded[29]. In addition, ERS can cause immune and inflammatory reactions through some signal pathways[30, 31]. It has been shown that IPF is closely associated with ERS. Therefore, finding ERS-related biomarkers and revealing their correlation with IPF could provide important information for early diagnosis and treatment.
In this study, we comprehensively evaluated the expression of ERS-related genes in normal and IPF tissues and identified a total of 65 ERS-related DEGs. we then explored the biological functions of the 65 ERS-related DEGs. GO analysis showed a strong link between DEGs and biological processes connected to ERS. KEGG analysis similarly demonstrated the involvement of ERS in the pathophysiological pathway of IPF. For example, prostacyclin (IP) receptor agonist (ACT-333679) is able to suppress YAP/TAZ-dependent fibrosis gene transcription by elevating cAMP[32], thereby inhibiting pulmonary fibrosis. In addition, local hypoxia can lead to ER stress, induced CHOP expression, increased apoptosis of type II AEC cells and enhanced pulmonary fibrosis[33].
Machine learning is an emerging discipline. It is able to find patterns in observed data and use these patterns to make predictions about unknown data. Machine learning has now been applied in a large number of clinical studies. It was reported that a random forest algorithm was used to screen for iron death-related biomarkers in IPF by He J, et al[34]. However, different machine learning algorithms can produce different computational results, so this study uses three algorithms, SVM-RFE, LASSO, and RF, together to screen genes with excellent predictive power. SVM-RFE is an SVM-based machine learning method that generates optimal choices by subtracting the feature vectors generated by SVM [35]. LASSO regression is another machine learning method that selects variables by picking the value of λ with the smallest classification error [36]. The random forest algorithm can create binary subtrees from the training samples generated by bootstrap, and then merge the prediction results of multiple decision trees to finally obtain the importance ranking of each predictor [37]. In this study, two key ERS-related DGEs (COMP, GPX8) were further screened by WGCNA analysis and the three machine learning algorithms mentioned above and validated in the validation set.
Cartilage oligomeric matrix protein (COMP) is an extracellular matrix (ECM) protein that has been found to be expressed in a variety of tissues: fibroblasts, tendon, cartilage, etc[38–41]. Currently, COMP has been shown to play a role in a variety of fibrotic diseases. In the liver, COMP increases type 1 collagen synthesis in hepatic stellate cells via CD36 receptor signaling and activation of the MEK1/2-per k1/2 pathway, promoting liver fibrosis[42]. And in IPF, COMP has the same fibrogenic effect[43]. COMP was highly expressed in IPF tissues, which is consistent with our findings. In an in vitro assay, stimulation of normal human lung fibroblasts with TGF-β1 induced an increase in COMP expression, and silencing of COMP reduced the expression of TGF-β1 target genes and inhibited fibroblast proliferation. In addition, Vuga LJ et al. found that COMP concentrations in the blood of IPF patients correlated with the severity of disease in IPF patients[43]. GPX8 is the last member of the glutathione peroxidase (GPx) protein family to be identified, a type II transmembrane protein located in the ER[44]. GPx is able to break down peroxides into alcohols and limit intracellular deposition of reactive oxygen species (ROS)[45]. However, GPX8 exhibits low GPx activity [44]. The physiological function of GPX8 is unknown. It was found that the expression level of GPX8 could affect Ca2 + homeostasis and signaling, in which the transmembrane structural domain (TMD) of GPX8 was critical for Ca2 + homeostasis[46]. Interestingly, disruption of Ca2 + homeostasis can promote pulmonary fibrosis[47]. In addition, in mesenchymal-like cells (MDA-MB-231), GPX8 knockdown was able to lead to epithelial-like morphological changes, downregulation of EMT characteristics[48]. This further reveals the potential connection between GPX8 and IPF.
Immune cells have been shown to be associated with the development and progression of IPF [49]. In this study, Plasma cells (PCs), Macrophages M0 were more infiltrated in IPF tissues, NK cells activated, Monocytes, Eosinophils were low infiltrated. Plasma cells, also known as effector B cells, specifically express CD138 on their surface. Prele CM et al. also found high infiltration of CD138 + cells in lung tissue from IPF patients and mouse models of pulmonary fibrosis, and the use of bortezomib reduced PCs infiltration and inhibited bleomycin-induced pulmonary fibrosis[50]. Macrophages are the highest proportion of lung immune cells (about 70%) and play an important role in the development of pulmonary fibrosis[51]. However, the role of M0 macrophage infiltration has not been fully demonstrated. This may provide a new direction for further study of pulmonary fibrosis. PF543, a specific sphingosine kinase 1 (SPHK1) inhibitor, was found to reduce lung fibrosis by reducing mtDNA damage and monocyte recruitment in lung epithelial cells[52]. Similarly, monocyte infiltration was found to be less in our study, which could be a potential pathogenic mechanism for IPF. Schwartz DA et al. found more eosinophils in bronchoalveolar lavage fluid (BALF) in the IPF patient group than in the control group[53]. This phenomenon differs from the results of the present study and may be due to differences in samples and detection techniques, which require further validation. In the analysis of the correlation between immune cells and key markers, Comp and GPX8 were negatively correlated with monocytes and neutrophils and positively correlated with plasma cells. It's interesting to note that these 2 genes did not always exhibit the same link with various immune cell subpopulations; for example, COMP was inversely connected with T cells CD4 memory resting. In contrast, GPX8 was negatively correlated with T cells CD8 and positively correlated with Macrophages M2. Although our findings on the correlation of COMP and GPX8 with immune infiltration suggest that these genes may be involved in the regulation of immune cells, the exact mechanisms remain unclear and need to be further investigated.
Our research still has certain shortcomings, however. First, in order to increase the dependability of the study's findings, the sample size must be increased further. Second, relevant in vivo and in vitro research are still needed to support the findings of this study.