Highly immune-related genes of breast cancer: potential diagnostic and prognostic biomarkers

doi:10.21203/rs.3.rs-2326101/v1

Download PDF

Research Article

Highly immune-related genes of breast cancer: potential diagnostic and prognostic biomarkers

https://doi.org/10.21203/rs.3.rs-2326101/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Although immune checkpoint inhibition (ICI) has shown therapeutic promise in breast cancer, there is considerable heterogeneity in its efficacy. Therefore, our study aimed to explore effective biomarkers for identifying patients most likely to benefit from immunotherapy. In our study, differentially expressed genes from the Cancer Genome Atlas breast cancer dataset were first identified using the R package limma; they were then intersected with the list of immune-related genes obtained from the ImmPort and InnateDB databases to obtain 542 immune-related differentially expressed genes for breast cancer. Twelve immune-related hub genes and three independent prognostic genes (S100B, NPR3, and SDC1) were then identified by weighted gene coexpression network analysis and multivariate Cox regression analysis, respectively. Furthermore, the accuracy of the prognosis prediction model (IRGRS) constructed by these three genes (S100B, NPR3, and SDC1) for breast cancer patients was further verified in four GEO data sets. In addition, we predicted the matrix and immune components in the high- and low-risk scores groups, and found that the low-risk score group had a higher Immune Score and a better prognosis. The drug response prediction analysis also found that the IC50 values of Bleomycin, Gemcitabine, Lapatinib, and Paclitaxel were lower in the low-risk score group than in the high-risk score group. The IRGRS constructed in this study may potentially differentiate the prognostic, molecular, and immunological features of breast cancer.

breast cancer

immune-related genes

WGCNA

prognosis

Breast cancer is the most common cancer worldwide. According to GLOBOCAN, there were approximately 2.3 million new diagnoses and 690,000 deaths worldwide in 2020; this accounted for 11.7% of new cancer cases and 6.9% of cancer deaths (Sung et al. 2021). Due to the development of immune checkpoint inhibitors (ICIs) and validation of the immunogenicity of breast cancer, immunotherapy has been widely used in the treatment of breast cancer. Also, anti-programmed cell death protein 1 and programmed death-ligand 1 (PD-L1) therapies have been shown to be effective. In combination with standard neoadjuvant chemotherapy regimens for early-stage triple-negative breast cancer (TNBC), pembrolizumab and atezolizumab offer higher rates of complete pathologic remission (pCR) than standard neoadjuvant chemotherapy alone, regardless of PD-L1 status (Nanda et al. 2020; Mittendorf et al. 2020). Phase III trials of pembrolizumab in combination with chemotherapy have shown that first-line treatment is well tolerated by patients with metastatic PD-L1-positive TNBC, and they obtain benefits in terms of progression-free survival (Cortes et al. 2020). Based on the findings of the IMpassion130 study, atezolizumab in combination with nab-paclitaxel has become the standard of care for the treatment of locally advanced or metastatic PD-L1-positive TNBC(Schmid et al. 2020). Findings from several studies have driven the development of immunotherapy in hormone receptor-positive/human epidermal growth factor receptor 2-negative breast cancer in addition to TNBC. Pembrolizumab combined with standard neoadjuvant chemotherapy regimens more than doubled the pCR rate in one study(Nanda et al. 2020), and anthracycline-based chemotherapy followed by nabulizumab and endocrine therapy achieved a pCR rate of 16.3% (Dieci et al. 2022). Nevertheless, the effect size of ICIs in breast cancer remains controversial. Therefore, the development of more refined biomarkers to best identify patients most likely to benefit from immunotherapy is a priority in the field.

In this study, we used bioinformatics methods to explore potential biomarkers and prognosis models related to immunotherapy of breast cancer from the perspective of gene transcriptome.

Patients and datasets

Ribonucleic acid sequencing (RNA-seq) and clinical data for 1164 breast tissue samples were downloaded from the Cancer Genome Atlas (TCGA) database (HTTPS://portal.gdc.cancer.gov/projects/TCGA-BC); these included 1053 cancer and 111 paracancer samples. RNA-seq data and survival information for 887 breast cancer samples (GSE17705, GSE58812, GSE22219, and GSE21653) were then downloaded from the Gene Expression Omnibus (GEO) database (https://www.ncbi.nlm.nih.gov/geo/). The list of immune-related genes was downloaded from the ImmPort (https://www.immport.org/shared/home) and InnateDB (https://www.innatedb.com/) databases.

Identification Of Immune-related Hub Genes

Differentially expressed genes (P < 0.05, |log2(fold change [FC])| > 1) were identified using the R package limma, based on RNA-seq data from TCGA breast cancer samples. The lists of immune-associated genes obtained from the ImmPort and InnateDB databases were then intersected to obtain immune-associated differentially expressed genes. The Gene Ontology, including biological process, cellular component, molecular function, and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analysis of immune-associated differentially expressed genes, was performed using the clusterProfiler package in R.

The hub genes were then identified using weighted gene coexpression network analysis (WGCNA), which divided the gene coexpression network of a complex biological process into highly correlated modules of features. It also associated the modules with specific phenotypes, from which core genes that perform key functions could be identified (Langfelder et al. 2008). An adjacency matrix was obtained by calculating the Pearson correlation coefficient between two gene expressions and weighting the correlation coefficients. This was followed by further calculation of the topological overlap matrix, where the values could reflect the similarity of co-expression between two genes.

The genes were clustered based on the coefficient of topological overlap-based dissimilarity, 1-TOM, to construct a dynamic shear tree identification module. Four modules were identified by setting the merging threshold function to 0.25. The edges between two genes with a weight > 0.2 were used to construct the network based on the most significantly related module (turquoise module) genes. The 12 immune-related hub genes that significantly affected prognosis were also screened on the basis of turquoise module genes, combined with survival information, the survival package of R and univariate Cox regressionanalysis. (P < 0.05).

Constructing The Irgrs With Multivariate Cox Regression Analysis

Among the 12 immune-related hub genes, significant genes independently affecting overall survival (OS) were screened by multivariate Cox regression analysis. The multivariate cox regression model for the immune-related gene risk score (IRGRS) was calculated according to the following formula: risk score = ∑ni = 1(Expri∗coefi), where expr represented the expression value of gene i, and coef represents the univariate cox regression coefficient of genes i. TCGA breast cancer samples were divided into high- and low-risk groups using the median risk score. Meanwhile, GEO cohort (GSE17705, GSE58812, GSE22219, and GSE21653) were used as verification groups to verify the reliability of the results.

Subtypes Analyses Of High- And Low-irgrs Subgroups

The Immune Subtype Classifier package of R was used to analyze the differences in immunophenotyping between the high- and low- IRGRS subgroups.

Prediction Of Stromal And Immune Components In High- And Low-irgrs Subgroups

ESTIMATE is a tool for predicting the presence of infiltrating stromal/immune cells in tumor tissues using gene expression data. A higher score estimated by the Stromal Score, Immune Score or Estimate Score indicated a larger number of stromal cellular or immune components in the tumor microenvironment (TME) (Yoshihara et al. 2013). The “estimate” R package was used to estimate the proportions of immune and stromal cellular components in high- and low- IRGRS subgroups.

Drug Response Prediction

Half inhibitory concentration (IC50) is used to reflect the drug response. We used the “pRRophetic” package to calculate the IC50 of different drugs between the high- and low-IRGRS subgroups(Geeleher et al. 2014).

Statistical analysis

Statistical analyses were performed using R version 4.1.1. Independent t test was used to compare continuous variables between the two groups and the χ2 test was used for categorical data. Univariate survival analysis was performed using K-M survival analysis and log-rank tests. Multivariate survival analysis was performed using Cox regression models; two-sided P < 0.05 was considered statistically significant.

Differentially expressed Immune-related genes in breast cancer

A total of 5443 differentially expressed genes were obtained by differential expression analysis of 1053 tumor samples and 111 normal samples in TCGA database. Among them, 2265 genes were up-regulated and 3178 genes were down-regulated in the tumor samples (Fig S1A.). These differentially expressed genes were further intersected with the list of immune-related genes obtained from the ImmPort and InnateDB databases, and 542 immune-related differentially expressed genes were obtained. Among them, 283 and 259 immune-related genes from the tumor samples were up-regulated and down-regulated, respectively (Fig S1B). Functional enrichment analysis showed that these 542 immune-related differentially expressed genes were significantly associated with 2335 Gene Ontology analyses and 108 KEGG pathways. The top 10 most significantly associated biological processes, molecular function, cellular component, and signaling pathways were demonstrated. The biological process was mainly enriched in cell tropism, myeloid migration, regulation of chemotaxis, and granulocyte migration; however, the signaling pathways were mainly enriched in cytokine-cytokine receptor interaction, JAK-STAT signaling, chemokine signaling, Ras signaling, neuroactive ligand-receptor interaction, IL-17 signaling, and tumor necrosis factor signaling (Fig S2A, S2B).

Immune-related Hub Genes In Breast Cancer

We acquired 542 immune-related differentially expressed genes and constructed the co-expression network using the WGCNA package in R software (the threshold value was 0.9, the optimal soft-thresholding power was 5) (Fig S3). Four modules were identified based on hierarchical clustering and the optimal threshold value (Fig S4A, S4B); the 542 genes were assigned to the 4 modules. According to the Pearson correlation coefficients obtained between each module and the sample features, it was found the brown and turquoise modules were strongly associated with breast cancer; the genes with the highest significance in the turquoise module were selected for subsequent analysis (this module had 213 genes and 292 edges). The top 10 significantly enriched Gene Ontology terms and KEGG pathways in the module genes were identified. Biological processes were mainly enriched for signaling receptor activator activity, receptor ligand activity, cellular chemotaxis, regulation of chemotaxis, negative regulation of response to external stimuli, ERK1 and ERK2 cascade and regulation, epithelial cell proliferation, migration of myeloid leukocytes, and regulation of epithelial cell proliferation (Fig S4C). Signaling pathways were mainly enriched in cytokine-cytokine receptor interaction, PI3K-Akt signaling, neuroactive ligand‒receptor interaction, MAPK signaling, and JAK-STAT signaling (Rap1 signaling pathway, chemokine signaling pathway, axon guidance, Ras signaling pathway, and calcium signaling pathway) (Fig S4D). There were 65 genes and 292 edges of the turquoise module in the network with a threshold weight > 0.2 (Fig S4E).

Univariate Cox Regression Analysis And Identification Of Prognostic Factors

To identify prognostic features among these 213 genes in the turquoise module, univariate cox regression models and Kaplan-Meier curves were constructed accordingly. Univariate cox regression analysis showed that age, risk score, and stage were significantly associated with breast cancer prognosis (Fig S5A). As seen from the K-M curves, the 12 immune-related hub genes (S100B, JUN, TPM2, VIM, SAA1, NRP3, SDC1, TACRA, ICAM2, RAD21, SERPING1, and STAT5A) correlated strongly with the prognosis of breast cancer patients (P ≤ 0.05), (Fig S5B, S5C).

Multivariate Cox Regression Analysis And Establishment Of Prognostic Risk Model

Multifactorial cox regression analysis were performed to identify independent prognostic genes and to confirm that the risk score remained an independent prognostic factor after adjustment for other clinicopathological factors (Fig. 1A, 1B); only three genes (S100B, NPR3, and SDC1) were found to significantly affect OS in breast cancer patients (Fig. 1C, 1D, 1E). The following formula was used to calculate the IRGRS prognostic model formula: risk score = expression level of S100B*(-0.16) + expression level of NPR3*(0.24) + expression level of SDC1*(0.24). Taking the median risk score as the cut-off value, patients with low-risk scores had a better prognosis (Fig. 2A). The prognostic ability of the risk score was verified using four GEO breast cancer datasets (GSE17705, GSE58812, GSE22219, and GSE21653). The results of the four GEO datasets were consistent and in agreement with those of the TCGA dataset (Fig. 2B, 2C, 2D, 2E). Finally, the Human Protein Atlas (https://www.proteinatlas.org/) database also showed increased protein expression of these two genes(SDC1, S100B)in breast cancer tissues (Fig. 2F).

Heatmap were plotted to evaluate the predictive significance of the clinical prognosis factor, combining the results of multivariate cox regression analysis. As shown in Fig. 3, there was no significant difference in age and stage between the high-risk-score and low-risk-score groups. As shown in Fig. 4, the area under the curve (AUC) values of 1-year, 2-year, and 3-year of the risk score prognostic model were 0.661, 0.675 and 0.665, respectively.

Relationship Between Different Irgrs Subgroups And Other Immune Subtypes

Thorsson et al. performed a large-scale immunogenomic analysis of over 10,000 tumor samples from TCGA, including 33 different cancer types. In a cross-tumor category study, researchers identified six immune subtypes: wound healing (C1), interferon (IFN)-γ dominant (C2), inflammatory (C3), lymphocyte depleted (C4), immunologically quiet (C5), and TGF-ß dominant (C6) (Thorsson et al. 2018). We used the R package Immune Subtype Classifier to classify the samples according to the gene expression matrix for each immunotype. As shown in Fig. 5, the low-IRGRS group consisted of 28% C1, 39% C2, 25% C3, 3% C4 and 4% C6, while the high-IRGRS group consisted of 40% C1, 35% C2, 10% C3, 12% C4 and 3% C6. C2 and C6 subtypes were almost evenly distributed between the two groups. C3 subtype was more frequent in the low- IRGRS group and C1 and C4 subtypes were more frequent in the high-IRGRS group (P < 0.001, χ² test).

Immunotherapy And Chemotherapy Of Different Irgrs Subgroups

To identify the prognostic value of the proportions of immune and stromal cells in different IRGRS subgroups, we used the ESTIMATE algorithm to estimate the proportions of immune and stromal components in the TME for each sample. The results showed that the Immune Score and Estimate Score of patients in the high-IRGRS group were lower than those in the low-IRGRS group, suggesting that patients in the low-IRGRS group were more likely to benefit from immunotherapy(Fig. 6A). There were no significant differences between the high-IRGRS and low-IRGRS groups in terms of the Stromal Score. We evaluated the associations of the Immune Score and Estimate Score with overall survival. The median of Immune Score and Estimate Score was used as the cut-off values to divide breast cancer patients into high and low cut-off groups. As shown in Fig. 6, a high Immune Score was associated with prolonged survival, while the Estimate Score did not contribute to the overall survival rate (Fig. 6B, 6C).

In addition, we also examined the IC50 values of four chemotherapeutics (Bleomycin, Gemcitabine, Lapatinib, and Paclitaxel) between the high- and low-risk score groups. The results showed that patients in the low-IRGRS group had lower IC50 values than those in the high- IRGRS group, suggesting that patients in the low-IRGRS group were more likely to benefit from these four drugs (Fig. 6D).

The use of ICIs has revolutionized patient care and improved survival outcomes for a wide range of clinical malignancies, including breast cancer (Kruger et al. 2019). In combination with chemotherapy, PD-1/PD-L1 monoclonal antibodies increased pCR rates after neoadjuvant treatment for TNBC or hormone receptor-positive/human epidermal growth factor receptor-2-negative breast cancer. This combination is also the standard first-line treatment for locally advanced or metastatic TNBC. In order to improve the prediction of the efficacy of breast cancer immunotherapy, the accuracy and individualization of treatment, the development of biomarkers deserves further study.

WGCNA is a method to analyze gene expression patterns of multiple samples, cluster the genes with similar expression patterns to form different modules and analyze the association between the modules and phenotypes or traits and core genes in the network. We screened 12 immune-related core (hub) genes associated with OS using WGCNA and identified three genes (S100B, NPR3, and SDC1) that independently affected OS by univariate and multifactorial Cox regression analysis. Taking the median risk score as the cutoff value, the prognosis of patients in the low-risk score subgroup was significantly better than that in the high-risk score subgroup. The prognostic ability of the risk-score prognostic model was then validated using RNA-seq data from four breast cancer microarrays in the GEO database covering both early and advanced TNBC and estrogen receptor-positive breast cancers, including more than 800 cases with a maximum follow-up of 10 years.

The risk score prognostic model consists of three genes, namely S100B, NPR3, and SDC1. The S100 calcium binding protein B (S100B) is a member of the S100 superfamily, which is localized in S100. S100B is a member of the S100 superfamily, which is localized in the cytoplasm and/or nucleus of a wide range of cells and is involved in the regulation of cell proliferation, differentiation, apoptosis, and immune responses (Hua et al. 2020). A study found that abnormal expression of S100B in glioblastoma led to changes in the immune microenvironment; this promotes the proliferation and migration of tumor cells(Hu et al. 2021). In this context, high S100B expression has been found to predict good OS in estrogen negative-breast cancer patients and good distant metastasis-free survival in all breast cancer patients(Yen et al. 2018). A study using a machine learning approach to analyze the prognosis of TNBC also found that high S100B expression predicted a good prognosis (Thalor et al. 2022). Natriuretic peptide recepor 3 (NPR3) encodes the natriuretic peptide clearance receptor, which has the basic function of mediating natriuretic peptide degradation (Ehret et al. 2011). However, previous studies have shown that abnormal expression of NPR3 may act as a tumor promoter and lead to cancer growth(Gu et al. 2018; Martinez-Romero et al. 2018). There is also evidence that NPR3 can change the immune microenvironment of male gastric cancer patients and affect the development of tumor (Xu et al. 2021). Syndecan-1 (SDC1) is mainly expressed in epithelial cells and plasmacytes in adult human tissues and plays a significant role in cell-to-cell and cell-to-matrix interactions (Palaiologou et al. 2014). Recent studies have confirmed that SDC-1 also plays a crucial role in carcinogenesis. It was found that SDC-1 may mediate the phosphorylation of very late antigen-4, which suppresses the influx of tumor-inhibiting NK cells and cytotoxic thymus-derived cells and facilitates the escape of tumor cells for immune surveillance (Jung et al. 2019). Several studies have shown that high SDC1 expression in breast cancer is significantly associated with more aggressive characteristics and poor prognosis(Qiao et al. 2019; Cui et al. 2017; Okolicsanyi et al. 2015; Yeh et al. 2018; Zhao et al. 2020; Jung et al. 2019); this is consistent with our results. Our established IRGPI was positively correlated with S100B and negatively correlated with NPR3 and SDC1; this is consistent with the characteristics of each gene.

Researchers performed a large-scale immunogenomic analysis of more than 10,000 tumor samples from 33 different cancer types in TCGA, including breast cancer, to understand the immune landscape of cancer. They identified wound healing (C1), IFN-γ dominant (C2), inflammatory (C3), lymphocyte depleted (C4), immunologically quiet (C5), and TGF-ß dominant (C6) subtypes as stable and reproducible immune subtypes that were associated with altered prognosis, genotype, and immune regulation(Thorsson et al. 2018). IFN-γ dominant (C2) subtype and inflammatory (C3) subtype were more frequent in the low-risk group, while wound healing subtype (C1) and lymphocyte depleted (C4) subtype were more frequent in the high-risk group; this is similar to the characteristics and outcomes reported in the literature. Among the subtypes, C3 subtype with elevated Th17 and Th1 genes had the best OS, while C1 and C2 subtypes had less favorable outcomes due to its higher tumor infiltrating lymphocyte region. Along with Th1 inhibition, M2 hyperresponsiveness, and other more mixed characteristic subtypes, C4 had the least favorable outcome (Thorsson et al. 2018).

TME is not only related to the occurrence, development and metastasis of tumor, but also plays a significant role in tumor immunotherapy. In this study, we used the ESTIMATE algorithm to obtain immune, stromal, and estimatescores. The results showed that the Immune Score and Estimate Score of patients in the high-IRGRS group were lower than those in the low-IRGRS group, suggesting that the immune components of TME were higher in patients with the low-risk score, which supported that they were more likely to benefit from immunotherapy. This was confirmed in the exploration of the impact of Immune Score and Estimate Score on the overall survival rate. The group with high immunization score had higher overall survival rate. This finding is consistent with previous research results showing that Immune Score significantly affects the survival rate of many tumors, including liver cancer, lung adenocarcinoma, etc. (Xiang et al. 2021; Wu et al. 2021). It has been proved that chemotherapy combined with immunotherapy is a promising treatment option for breast cancer. However, clinical trials of PD-L1 monoclonal antibodies are insufficient to prove that combined chemotherapy is superior to monotherapy when compared with PD-1 monoclonal antibodies (Garrido-Castro et al. 2019). Therefore, it is still very necessary to explore new combination therapies of different immune drugs and chemotherapeutic drugs. Our study also verified the sensitivity of the high-IRGRS group and low-IRGRS group to multiple chemotherapy drugs. As the cornerstone of systemic therapy, chemotherapy supports this risk stratification as a prognostic indicator of treatment response. At this stage, this further confirms the reliability of IRGRS as a reliable biomarker for predicting the efficacy of immunotherapy.

We have successfully constructed an IRGRS of breast cancer, where each subtype had different immune characteristics. It was found to be a potential biomarker for predicting the efficacy of breast cancer immunotherapy. Our findings need to be further validated in future clinical studies.

Conflict of Interest

The authors declare that the study was conducted in the absence of any business or financial relationships that could be interpreted as potential conflicts of interest.

Author Contributions

BY, XC, and LZ designed and conducted the study. BY and XC drafted and edited the manuscript and Figs. WZ performed manuscript improvedLZ reviewed, revised, and supervised the work.

Funding

This work was supported by the Incubation Project of the West China Hospital, Sichuan University (2019HXFH053).

Cortes J, Cescon DW, Rugo HS et al (2020) Pembrolizumab plus chemotherapy versus placebo plus chemotherapy for previously untreated locally recurrent inoperable or metastatic triple-negative breast cancer (KEYNOTE-355): a randomised, placebo-controlled, double-blind, phase 3 clinical trial. Lancet 396:1817–28.
Cui X, Jing X, Yi Q, Long C, Tian J, and Zhu J (2017) Clinicopathological and prognostic significance of SDC1 overexpression in breast cancer. Oncotarget 8:111444–55.
Dieci MV, Guarneri V, Tosi A et al (2022) Neoadjuvant Chemotherapy and Immunotherapy in Luminal B-like Breast Cancer: Results of the Phase II GIADA Trial. Clin Cancer Res 28:308–17.
Ehret GB, Munroe PB, Rice KM et al (2011) Genetic variants in novel pathways influence blood pressure and cardiovascular disease risk. Nature 478:103–9.
Garrido-Castro AC, Lin NU, and Polyak K (2019) Insights into Molecular Classifications of Triple-Negative Breast Cancer: Improving Patient Selection for Treatment. Cancer Discov 9:176–98.
Geeleher P, Cox N, and Huang RS (2014) pRRophetic: an R package for prediction of clinical chemotherapeutic response from tumor gene expression levels. PLoS One 9:e107468.
Gu L, Lu L, Zhou D, and Liu Z (2018) Long Noncoding RNA BCYRN1 Promotes the Proliferation of Colorectal Cancer Cells via Up-Regulating NPR3 Expression. Cell Physiol Biochem 48:2337–49.
Hu Y, Song J, Wang Z et al (2021) A Novel S100 Family-Based Signature Associated with Prognosis and Immune Microenvironment in Glioma. J Oncol 2021:3586589.
Hua X, Zhang H, Jia J, Chen S, Sun Y, and Zhu X (2020) Roles of S100 family members in drug resistance in tumors: Status and prospects. Biomed Pharmacother 127:110156.
Jung O, Beauvais DM, Adams KM, and Rapraeger AC (2019) VLA-4 phosphorylation during tumor and immune cell migration relies on its coupling to VEGFR2 and CXCR4 by syndecan-1. J Cell Sci 132.
Kruger S, Ilmer M, Kobold S et al (2019) Advances in cancer immunotherapy 2019 - latest trends. J Exp Clin Cancer Res 38:268.
Langfelder P, and Horvath S (2008) WGCNA: an R package for weighted correlation network analysis. BMC Bioinformatics 9:559.
Martinez-Romero J, Bueno-Fortes S, Martín-Merino M, Ramirez de Molina A, and De Las Rivas J (2018) Survival marker genes of colorectal cancer derived from consistent transcriptomic profiling. BMC Genomics 19:857.
Mittendorf EA, Zhang H, Barrios CH et al (2020) Neoadjuvant atezolizumab in combination with sequential nab-paclitaxel and anthracycline-based chemotherapy versus placebo and chemotherapy in patients with early-stage triple-negative breast cancer (IMpassion031): a randomised, double-blind, phase 3 trial. Lancet 396:1090–100.
Nanda R, Liu MC, Yau C et al (2020) Effect of Pembrolizumab Plus Neoadjuvant Chemotherapy on Pathologic Complete Response in Women With Early-Stage Breast Cancer: An Analysis of the Ongoing Phase 2 Adaptively Randomized I-SPY2 Trial. JAMA Oncol 6:676–84.
Okolicsanyi RK, Buffiere A, Jacinto JM et al (2015) Association of heparan sulfate proteoglycans SDC1 and SDC4 polymorphisms with breast cancer in an Australian Caucasian population. Tumour Biol 36:1731–8.
Palaiologou M, Delladetsima I, and Tiniakos D (2014) CD138 (syndecan-1) expression in health and disease. Histol Histopathol 29:177–89.
Qiao W, Liu H, Guo W, Li P, and Deng M (2019) Prognostic and clinical significance of syndecan-1 expression in breast cancer: A systematic review and meta-analysis. Eur J Surg Oncol 45:1132–37.
Schmid P, Rugo HS, Adams S et al (2020) Atezolizumab plus nab-paclitaxel as first-line treatment for unresectable, locally advanced or metastatic triple-negative breast cancer (IMpassion130): updated efficacy results from a randomised, double-blind, placebo-controlled, phase 3 trial. Lancet Oncol 21:44–59.
Sung H, Ferlay J, Siegel RL et al (2021) Global Cancer Statistics 2020: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries. CA Cancer J Clin 71:209–49.
Thalor A, Kumar Joon H, Singh G, Roy S, and Gupta D (2022) Machine learning assisted analysis of breast cancer gene expression profiles reveals novel potential prognostic biomarkers for triple-negative breast cancer. Comput Struct Biotechnol J 20:1618–31.
Thorsson V, Gibbs DL, Brown SD et al (2018) The Immune Landscape of Cancer. Immunity 48:812 – 30.e14.
Wu J, Li L, Zhang H et al (2021) A risk model developed based on tumor microenvironment predicts overall survival and associates with tumor immunity of patients with lung adenocarcinoma. Oncogene 40:4413–24.
Xiang S, Li J, Shen J et al (2021) Identification of Prognostic Genes in the Tumor Microenvironment of Hepatocellular Carcinoma. Front Immunol 12:653836.
Xu X, Lu Y, Wu Y et al (2021) A signature of seven immune-related genes predicts overall survival in male gastric cancer patients. Cancer Cell Int 21:117.
Yeh MH, Tzeng YJ, Fu TY et al (2018) Extracellular Matrix-receptor Interaction Signaling Genes Associated with Inferior Breast Cancer Survival. Anticancer Res 38:4593–605.
Yen MC, Huang YC, Kan JY, Kuo PL, Hou MF, and Hsu YL (2018) S100B expression in breast cancer as a predictive marker for cancer metastasis. Int J Oncol 52:433–40.
Yoshihara K, Shahmoradgoli M, Martínez E et al (2013) Inferring tumour purity and stromal and immune cell admixture from expression data. Nat Commun 4:2612.
Zhao B, Xu Y, Zhao Y, Shen S, and Sun Q (2020) Identification of Potential Key Genes Associated With the Pathogenesis, Metastasis, and Prognosis of Triple-Negative Breast Cancer on the Basis of Integrated Bioinformatics Analysis. Front Oncol 10:856.

No competing interests reported.

FigureS1.jpg
Fig S1. Analysis of differentially expressed genes between tumor and normal tissues. (A) Heat map of all differentially expressed genes (DEGs) between 1053 breast cancer samples (red) and 111 paracancerous samples (blue) (p < 0.05, |logFC| > 1); (B) Heat map of immune-related DEGs between 1053 breast cancer samples (red) and 111 paracancerous samples (blue).
FigureS2.jpg
Fig S2. Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analysis of immune-related differential genes. (A) GO enrichment analysis of the immune-related DEGs (p < 0.05). (B) KEGG pathway analysis of the immune-related DEGs (p < 0.05).
FigureS3.jpg
Fig S3. Determination of the soft-thresholding power in the WGCNA.In the left graph, the horizontal line indicates the threshold value at 0.9. As can be seen from the graph, the optimal soft threshold of WGCNA is 5.
FigureS4.jpg
Fig S4. WGCNA of immune-related differentially expressed genes. (A) Dynamic tree cutting based on WGCNA. (B) Gene modules related to breast cancer obtained by WGCNA. (C) Gene Ontology (GO) enrichment analysis of the genes of the turquoise module (p < 0.05). (D) Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis of the genes of the turquoise module (p < 0.05). (E) The network of genes in the turquoise module (weight of edge > 0.2).
FigureS5.jpg
Fig S5. Breast cancer immune-related hub genes and survival analysis. (A) Univariate Cox analysis of clinicopathologic factors and the IRGRS score. (B) Univariate Cox analysis of 12 immune-related hub genes. (C) Kaplan-Meier curves of 12 immune-related hub genes (P < 0.05, log-rank test).

Download PDF

Version 1

posted

You are reading this latest preprint version

Highly immune-related genes of breast cancer: potential diagnostic and prognostic biomarkers

Status:

Version 1

Abstract

Figures

Introduction

Materials And Methods

Patients and datasets

Identification Of Immune-related Hub Genes

Constructing The Irgrs With Multivariate Cox Regression Analysis

Subtypes Analyses Of High- And Low-irgrs Subgroups

Prediction Of Stromal And Immune Components In High- And Low-irgrs Subgroups

Drug Response Prediction

Statistical analysis

Results

Differentially expressed Immune-related genes in breast cancer

Immune-related Hub Genes In Breast Cancer

Univariate Cox Regression Analysis And Identification Of Prognostic Factors

Multivariate Cox Regression Analysis And Establishment Of Prognostic Risk Model

Relationship Between Different Irgrs Subgroups And Other Immune Subtypes

Immunotherapy And Chemotherapy Of Different Irgrs Subgroups

Discussion

Conclusions

Declarations

References

Additional Declarations

Supplementary Files

Status:

Version 1