Identification of potential crucial genes associated with breast cancer using bioinformatics analysis and experimental verification

doi:10.21203/rs.3.rs-2457642/v1

Download PDF

Research Article

Identification of potential crucial genes associated with breast cancer using bioinformatics analysis and experimental verification

https://doi.org/10.21203/rs.3.rs-2457642/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

In this study, we identified a total of 492 DEGs, including 176 up-regulated and 316 down-regulated DEGs. GO analysis showed that the up-regulated DEGs are mainly involved in cell division, nucleus and protein binding. The down-regulated DEGs mainly involve immune response, extracellular exosome and calcium ion binding. Top five enriched pathways obtained in the KEGG pathway analysis are pathways in cancer, cytokine-cytokine receptor interaction, focal adhesion, the PI3K-akt signaling pathway and ECM-receptor interaction. Top 10 up-regulated hub genes identified from the PPI network are AURKA, CDC6, CCNA2, CDCA8, NUSAP1, CDK1, CCNB1, CCNB2, UBE2C, HMMR. The top 10 down-regulated hub genes are IGF1, JUN, FGF2, CXCL12, KIT, PTGS2, LEP, EGF, EGR1, FOS. Survival analysis showed that the expression levels of WIF1 (P = 0.019) and HMMR (P = 0.027) were correlated with the prognosis of patients with breast cancer. In addition, gene expression and methylation analysis showed that COL11A1 is highly expressed and hyper-methylation. MMP1 is highly expressed and hypo-methylation. SFRP1, WIF1 is low expressed and hyper-methylation in breast cancer. In terms of tumor purity and immune cell infiltration analysis, Interestingly, it is found that HMMR makes a strong connection with B Cell, CD8⁺ T Cell, neutrophil, dendritic cell (P <0.05). MMP1 was negtively associated with tumor purity. The use of bioinformatics can effectively analyze the data of the gene chip, obtain the inherent information of the organism, and provide the basis for the next experiment. This study identifies key genes and pathways in breast cancer that will advance our understanding of molecular mechanisms.

Breast cancer

Integrated bioinformatics

Differentially expressed genes

Biological pathways

Worldwide, breast cancer is the most common cancer affecting women, and its morbidity and mortality are expected to increase significantly in the coming years. Despite tremendous advances in human cancer research, breast cancer is still a major health issue and represents the highest priority of biomedical research(1). Various forms are currently used in the diagnosis and treatment of breast cancer, such as introducing precision medicine to the challenges associated with cancer care(2). There is ample evidence that lifestyle (high-fat diet, drinking, lack of physical exercise) and environmental factors have an impact on the development of breast cancer. Eliminating these factors (primary prevention) may help reduce morbidity and mortality. Secondary prevention including diagnostic tests (such as mammography, ultrasound, magnetic resonance imaging, breast screening, and modern and more accurate imaging methods) can help early detection of tumors or lesions susceptible to tumors(3). Previous studies have shown that its onset may be linked to genetic, environmental and other factors. In recent years, research has focused on the molecular mechanism of its onset, but the specific etiology is still unclear. Recently, some updates on breast cancer screening recommendations have been released internationally. On the other hand, advances in genomics have made it possible to establish new molecular classifications of breast cancer(4).

Microarray analysis is a novel method to study tumor genes, find molecular targets for tumor drug therapy and monitor prognosis. However, due to the heterogeneity of the experimental samples, the use of different detection platforms and data processing methods will result in inconsistent results. The Robust Rank Aggregation (RRA) method is suitable for comparing multiple sequence gene lists(5), because this method checks the ranking of each gene in each list and is based on the idea that each gene identified in each experiment is randomly arranged. Therefore, RRA compares the ranking of the randomly ordered list with the baseline situation, while a higher gene ranking is linked to a lower P value. RRA integrates the results of multiple gene expression data sets, thereby enhancing the understanding of the molecular mechanism of tumor genes. Our research will provide reliable molecular markers and effective therapeutic targets for breast cancer.

Gained microarray data

The GSE20711(6), GSE61304(7), GSE139038(8), GSE124646(9), GSE33447(10) and GSE5764(11) gene expression profile matrix files were downloaded from the GEO database (https://www.ncbi.nlm.nih.gov/geo/). The platform of the GSE20711 dataset is the GPL570 [HG-U133_Plus_2] Affymetrix Human Genome U133 Plus 2.0 Array, and this dataset contains 2 normal breast tissue and 88 breast cancer tissues. The platform of the GSE61304 dataset is the GPL570 [HG-U133_Plus_2] Affymetrix Human Genome U133 Plus 2.0 Array, and this dataset contains 4 normal breast tissue and 58 breast cancer tissues. The platform of the GSE139038 dataset is the GPL27630 Print_1437, and this dataset contains 24 normal breast tissue and 41 breast cancer tissues. The platform of the GSE124646 dataset is the GPL96, and this dataset contains 10 normal breast tissue and 10 breast cancer tissues. The platform of the GSE33447 dataset is the GPL14550, and this dataset contains 8 normal breast tissue and 8 breast cancer tissues. The platform of the GSE5764 dataset is the GPL570, and this dataset contains 20 normal breast tissue and 12 breast cancer tissues.

Identification of robust DEGs

We downloaded series matrix files of datasets from GEO. The R package “limma”(12) was utilized to normalize the data and find DEGs. We then used RRA to integrate the results of those 6 datasets to find the most significant DEGs(13). The P value of each gene indicated its ranking in the final gene list, and genes with adjusted P < 0.05 were regarded as significant DEGs in the RRA analysis.

Function enrichment analyses

The commonly used bioinformatics analysis database, DAVID 6.8 database (https://david.ncifcrf.gov/) is a commonly used database for gene enrichment and functional annotation analysis. The database integrates biological data and analysis tools to provide systematic and comprehensive annotations of biological functions for large-scale gene or protein lists. Use DAVID to perform

and KEGG pathway enrichment analysis on the identified DEG, and download GO and KEGG pathway enrichment analysis results for subsequent. Then utilize Cytoscape 3.6.1 software to conduct a visual network analysis of the KEGG analysis results. If P <0.05, the result is considered statistically significant.

PPI network analysis

Studying the interaction network between proteins helps to mine the core regulatory genes. What we are interested in being actually "gene interaction". The search tool for searching interacting genes/proteins is a search tool that can analyze the interaction between proteins (https://string-db.org/). Using STRING to analyze DEG's PPI network can help us understand this relationship between different genes. Cytoscape software was utilized to screen hub genes according to degree.

Prognosis analysis and Methylation analyses

UALCAN is a comprehensive, user-friendly and interactive web resource for analyzing cancer OMICS data. UALCAN provides easy access to published cancer OMICS data (TCGA and MET500) and enables users to identify biomarkers or perform computer verification of potential genes of interest. It provides graphs and graphs describing gene expression and patient survival information based on gene expression , evaluate gene expression in molecular subtypes of breast and prostate cancer, and evaluate epigenetic regulation of gene expression by promoter methylation, and correlate with gene expression. UALCAN conducts a full-oncogene expression analysis. These resources allow researchers to collect valuable information and data about genes/targets of interest(14). We utilized this website to compare methylation levels of hub genes between the breast cancer and paracancerous normal tissues.

Analysis of gene expression and tumor-infiltrating immune cells

Ethical statement

The study was approved by the Ethics Committee of Zhuzhou Central Hospital and conducted in accordance with the Declaration of Helsinki. Prior to the start of the study, all participants gave written informed consent.

Tissue samples and clinical data

51 breast cancer tissues (age, 45±0.26 years; male/female patient ratio, 1/60) and 32 non-tumor breast tissues (age, 47±0.73 years; n=31 female patients) were collected from the Zhuzhou Central Hospital (Hunan, China) betwee February in 2015 and July 2019. Patients with diabetes, nephritis, or cardiovascular disease were excluded. Patient information was obtained from medical records. The present study was approved by the Ethics Committee of Zhuzhou Central Hospital. Written informed consent was obtained from all of the participants.

Cell culture

The human breast cancer cell lines MCF-7 and breast cell lines MCF10A were obtained from American Type Culture Collection (Manassas, VA, USA). Cells cultured in high glucose Dulbecco's modified Eagle's medium (Invitrogen; Thermo Fisher Scientific, Inc., Waltham, MA, USA) containing 10% fetal bovine serum (FBS; Gibco; Thermo Fisher Scientific, Inc.) and maintained in a humidified atmosphere of 5% CO2 in air at 37℃.

RNA extraction, real-time PCR and RT-PCR

Total RNA was isolated from tissue samples using TRIzol reagent (Invitrogen) according to the manufacturer’s protocol. The cDNA was synthesized from the total RNA using a Reverse Transcription System (Fermentas, Glen Burnie, MD, USA) according to the manufacturer’s instructions. GAPDH was amplified in parallel as an internal control. The expression level of each gene was quantified by measuring the cycle threshold (Ct) values and normalized relative to that of GAPDH using the 2-ΔΔCt method. The primers used in the reaction were as follows:

COL11A1,(forward, 5’-TAACATCGCTGACGGGAAGTG-3’, reverse, 5’-CCGTGATTCCATTGGTATCAACA-3’).

SFRP1,(forward, 5’-ACGTGGGCTACAAGAAGATGG-3’, reverse, 5’-CAGCGACACGGGTAGATGG-3’).

MMP1,(forward, 5’-CTCTGGAGTAATGTCACACCTCT-3’, reverse, 5’-TGTTGGTCCACCTTTCATCTTC-3’).

WIF1,(forward, 5’-CTGATGGGTTCCACGGACC-3’, reverse, 5’-AGAAACCAGGAGTCACACAAAG-3’)

Western blot

Protein was extracted from indicated cells by using RIPA lysis buffer. Protein concentrations were determined using a BCA Protein Assay kit (Thermo Fisher Scientific, Rockford, IL, USA). A total of 60 μg of protein was separated on a 10% SDS-PAGE gel and transferred onto polyvinylidene difluoride membranes (Millipore, Billerica, MA, USA), which were soaked in 5% nonfat milk for 1 h and then incubated with corresponding primary antibodies overnight at 4℃. Antibodies used in this study: rabbit polyclonal anti-COL11A1(ab64883)(1:500 dilution), rabbit polyclonal anti-SFRP1(ab4193)(1:500 dilution), rabbit polyclonal anti-MMP1(ab137332)(1:500 dilution), rabbit polyclonal anti-WIF1(ab186845) (1:500 dilution), and rabbit polyclonal anti-β-Tubulin (1:3000 dilution) from Proteintech (Wuhan, China). After washing with 1×TBST three times for 8 min each, the membranes were incubated with the corresponding secondary antibodies for 1 h at 37℃, and then washed with 1×TBST for three times again, and finally the bands were visualized using an ECL kit (Millipore). Signals were quantified by Image-J software and normalized to β-tubulin.

3.1 Identification of differentially expressed genes in breast cancer

The dataset information was presented in Table 1. Owing to the dataset information was chaotic at first, so it must be standardized. The breast cancer chip expression datasets GSE20711, GSE61304, GSE139038, GSE124646, GSE33447, and GSE5764 were normalized, and the results were shown in Fig.S1(Supplementary figure1). We screened DEG using the limma R package (adjusted P <0.05 and |fold change (FC)|>1).

Table 1. Details of the GEO breast cancer data.
Sample	Dataset ID	Number of samples	GPL ID
Breast	GSE20711	88T 2N	GPL570
Breast	GSE61304	58T 4N	GPL570
Breast	GSE139038	41T 24N	GPL27630
Breast	GSE124646	10T 10N	GPL96
Breast	GSE33447	8T 8N	GPL14550
Breast	GSE5764	12T 20N	GPL570

Note: GSE, Gene Expression Omnibus Series; T, tumor samples; N, paracancerous normal samples

3.2 Identification of robust DEGs by RRA analysis

The RRA method is based on the assumption that each gene in each data set is randomly arranged. If genes rank higher in all data sets, the associated P-values will decrease and the probability of differential gene expression will be greater. We identify DEGs using integrated bioinformatics in breast cancer, the top 20 up-regulated and down-regulated DEGs are shown in heatmap (Fig1). Build on the results of RRA analysis, a total of 176 up-regulated and 316 down-regulated significant DEGs were identified.

3.3 GO term analysis of DEGs

GO functional analysis is divided into the following three parts: biological processes (BP), molecular functions (MF), and cellular components (CC). We use the DAVID database and its online analysis tools to annotate GO functions for the integrated DEG. The results were considered statistically significant if P < 0.05, and the three parts of the GO consequences are shown in Figs2 and 3. The up-regulated genes were mainly enriched in cell division (ontology: BP), nucleus (ontology: CC), and protein binding (ontology: MF) and the down-regulated genes were mainly enriched in immune response (ontology: BP), extracellular exosome (ontology: CC) and calcium ion binding (ontology: MF).

KEGG is a database of systematic analysis of gene function, genome information, which serves helps to analyze genes as a whole network. KEGG pathway analysis of the integrated DEGs was performed using the DAVID database, and the results of the analysis are shown in Fig4. The integrated DEGs were mainly enriched in pathways in cancer, cytokine-cytokine receptor interaction, focal adhesion, the PI3K-akt signaling pathway, ECM-receptor interaction.

3.5 Integration of protein-protein interaction (PPI) network analysis

The STRING online database was utilized used to analyze the 492 integrated DEGs and to construct a PPI network. The PPI network has a guiding role in the study of breast cancer target genes and proteins, and has a prominent function in the future study of breast cancer. The results were downloaded and analyzed using Cytoscape software. Based on STRING database, we chose the top 10 interacting proteins from up-regulated genes, the protein-protein interactional (PPI) network of DEGs included AURKA, CDC6, CCNA2, CDCA8, NUSAP1, CDK1, CCNB1, CCNB2, UBE2C, HMMR in Fig5. We chose the top 10 interacting proteins from down-regulated genes, the PPI network of DEGs included IGF1, JUN, FGF2, CXCL12, KIT, PTGS2, LEP, EGF, EGR1, FOS in Fig6.

3.6 Association between prognostic significance and methylation of hub genes

Prognostic analysis, Among the hub genes, the following two genes were considered to be associated with the prognosis of breast cancer patients: (Figure7). WIF1(P=0.019), HMMR(P=0.027). We explored the correlation between the expression levels of the four hub genes and their methylation status to elucidate the underlying mechanism of abnormal upregulation in breast cancer tissues. COL11A1 is highly expressed and hyper-methylation in breast cancer. MMP1 is highly expressed and hypo-methylation in breast cancer. SFRP1, WIF1 is low expressed and hyper-methylation in breast cancer in Figs.8 and 9.

3.7 Analysis of tumor-infiltrating immune cells

To investigate the correlation between the expression of selected hub genes and tumor infiltrating immune cells (B cells, CD4⁺ T cells, CD8⁺ T cells, neutrophils, macrophages, and dendritic cells), we applied the online tool TIMER (https://cistrome.shinyapps.io/timer/)(17, 18), which contains 10,897 samples from diverse cancer types available in the TCGA database. Interestingly, HMMR were very positively associated with tumor purity. Simultaneously, HMMR makes a strong connection with B Cell, CD8⁺ T Cell, neutrophil, dendritic cell(P<0.05). MMP1 were negtively associated with tumor purity. MMP1 has a strong connection with B Cell, CD8⁺ T Cell, neutrophil, CD4⁺ T Cell，macrophage, and dendritic cells(P<0.05). COL11A1 was negatively associated with tumor purity in Fig10.

3.8 The validation of differential genes by q-PCR and Western blot

To verify the results, we analyzed the expression levels of COL11A1, SFRP1, MMP1 and WIF1 in breast cancer tissues and non-tumor breast tissues samples by qRT-PCR assay. COL11A1 and MMP1 was found to be upregulated, SFRP1 and WIF1 was found to be downregulated in breast cancer tissues when compared to non-tumor breast. However, the results of western blot showed that WIF1 and SFRP1 were downregulated, COL11A1 and MMP1 was upregulated in MCF-7 when compared to MCF10A in Fig11.

Breast cancer is the most common malignant tumor in women worldwide, and about 70-80% of patients with early non-metastatic disease can be cured. Advanced breast cancer with distant organ metastases is considered incurable by currently available treatments(19). In 2018, an estimated 2.1 million women were newly diagnosed with breast cancer, and a new case was diagnosed approximately every 18 seconds, in addition, 626,679 women with breast cancer died(20).

In this study, we identified important DEGs between cancerous and normal samples, and conducted a series of bioinformatics analyses to screen for key genes and pathways closely related to breast cancer. The results showed that the GSE20711, GSE61304, GSE139038, GSE124646, GSE33447 and GSE5764 datasets were analyzed using the RRA method, and 492 integrated DEGs were found, including 176 up-regulated and 316 down-regulated significant DEGs. The up-regulated genes were mainly enriched in cell division (ontology: BP), nucleus (ontology: CC), and protein binding (ontology: MF) and the down-regulated genes were mainly enriched in immune response (ontology: BP), extracellular exosome (ontology: CC) and calcium ion binding (ontology: MF). These results suggest that these DEGs are involved in the proliferation and migration of breast cancer cells. KEGG pathway analysis found five significantly enriched pathways.The integrated DEGs were mainly enriched in pathways in cancer, cytokine-cytokine receptor interaction, focal adhesion, the PI3K-akt signaling pathway, ECM-receptor interaction. Defective components in DNA damage and repair mechanisms are the root causes of the occurrence and development of different types of cancer, and breast cancer is no exception(21, 22). Cytokine-cytokine receptor interaction pathway appeared to be a key factor in triple-negative breast cancer drug resistance(23). Interestingly, Wang L's research found that calcium signaling were significantly enriched in breast cancer(24). Gkretsi V study found that the silencing of growth differentiation factor-15 promotes breast cancer cell invasion by down-regulating focal adhesion genes(25). Costa RLB believes that the development of PI3K/AKT/ mTOR pathway drugs for the treatment of breast cancer is an evolving field. In addition to their interaction with altered cancer pathways, the efficacy and toxicity of new drugs should also be considered(26).

The top 10 hub genes were HMMR, AURKA, CDC6, CCNA2, CDCA8, NUSAP1, CDK1, CCNB1, CCNB2, and UBE2C in up-regulated DEGs. HMMR, MDM2 and PALB2 genes' polymorphic site combinations appear to be candidate markers of genetic predisposition with breast cancer in the Kyrgyz population(27). Incomplete inhibition of AURKA was a common source of therapy failure, and combinations of PI3K, AKT or mTOR inhibitors with the AURKA inhibitor MLN8237 were highly synergistic and durable suppressed mTOR signaling, resulting in apoptosis and tumor regression in vivo(28). Cdc6 is a potential prognostic marker and therapeutic target in breast cancer patients(29, 30). The protein encoded by the gene CCNA2 belongs to the highly conserved cyclin family, and its members act as cell cycle regulators. The protein binds and activates cyclin-dependent kinase 2, thereby facilitating the transition through G1/S and G2/M(31, 32). When comparing normal tissue and tumor samples by microarray analysis, the biggest difference most often occurs in the expression level of genes that control cell proliferation(33). The top 10 hub genes were IGF1, JUN, FGF2, CXCL12, KIT, PTGS2, LEP, EGF, EGR1 and FOS in down-regulated DEGs. Paracrine recruitment and activation of fibroblasts by c-Myc expressing breast epithelial cells through the IGFs/IGF-1R axis(34). Sahores shows that HMW-FGF2 isoforms are PRB targets which confer endocrine resistance and were localized in the nuclei of breast cancer samples(35). JUN is a putative transformation gene for avian sarcoma virus 17. JUN encodes a protein highly similar to viral proteins, which directly interacts with specific target DNA sequences to regulate gene expression. The gene is intron-free and is located at 1p32-p31, a chromosomal region involved in the translocation and deletion of human malignancies(36).

We performed a prognostic analysis of those hub genes using the UALCAN(37). The following two genes were found to be associated with the prognosis of breast cancer patients: WIF1(P=0.019), HMMR(P=0.027). Liu S study found that the expression of WIF1, DKK2, SFRP2 and AXIN2 were positively correlated with the survival of patients(38). HMMR was significantly associated with metastasis and overall survival in patients with lung adenocarcinoma(39). The protein encoded by gene HMMR is involved in cell motility. It is expressed in breast tissue and is expressed together with other proteins, forming a complex with BRCA1 and BRCA2, so it may increase the risk of breast cancer. It has been noted that this gene encodes splice transcript variants of different isoforms(40-42). Silencing of tumor suppressor genes RASSF1A, SLIT2, and WIF1 by promoter hypermethylation in hereditary breast cancer(43). The protein encoded by gene WIF1 acts to inhibit the WNT protein, which is an extracellular signaling molecule that plays a part in embryonic development, Promoter methylation of WNT inhibitory factor-1 may be associated with the pathogenesis of multiple human tumors(44, 45). Moreover, Veeck J’s research found have found prognostic relevance of WIF1 and Dickkopf-3 (DKK3) promoter methylation in human breast cancer(46). We also referred to UALCAN to explore DNA methylation patterns that could account for the abnormal expression of the above hub genes in breast cancer. COL11A1 was highly expressed and hyper-methylation in breast cancer. We found that MMP1 was highly expressed and hypo-methylation in breast cancer. SFRP1 and WIF1 were low expressed and hyper-methylation in breast cancer. Such as coordinated over-expression of particular collagens, mainly COL11A1. The composition of the overexpressed genes indicates invasion-facilitating altered proteolysis in the extracellular matrix(47).

Ameku T identification of MMP1 as a novel risk factor for intracranial aneurysms in ADPKD using iPSC models(48). Lim JP found that silencing Y-box binding protein-1 inhibits triple-negative breast cancer cell invasiveness via regulation of MMP1 and beta-catenin expression(49). SFRP1 expression is strongly correlated with triple-negative breast cancer on the protein level. Associations with age and tumor grade support the role of SFRP1 as a biomarker for chemotherapy response in triple-negative breast cancer(50). Loss of SFRP1 expression was a significant regulator of typical breast hyperplasias transcriptional profiles driving previously unidentified changes affecting responses to estrogen and possibly other pathways(51).

In summary, the purpose of this study is to improve our understanding of the molecular mechanisms of breast cancer development through comprehensive bioinformatics analysis aimed at identifying DEGs and breast cancer progression related pathways. Our research also identified some key candidate genes and biological pathways that can help find biomarkers and therapeutic targets for breast cancer. However, further molecular biology experiments are needed to verify the results of this study.

Author Contributions

Chao Liu conceived and supervised the study; Chao Liu designed experiments; Xiaoyu Ni，Haibing Yang performed experiments; Xiaoyu Ni，Haibing Yang analysed data; Xiaoyu Ni，Haibing Yang wrote the manuscript; Xiaoyu Ni，Haibing Yang made manuscript revisions. All authors reviewed the manuscript.

Compliance with ethical standards

Conflict of interests The authors declare that they have no conflict of interests.

Ethical approval: This article does not contain any studies with human participants or animals performed by any of the authors.

Anastasiadi Z, Lianos GD, Ignatiadou E, Harissis HV and Mitsis M: Breast cancer in young women: an overview. Updates Surg 69: 313–317, 2017.
Odle TG: Precision Medicine in Breast Cancer. Radiol Technol 88: 401M-421M, 2017.
Kolak A, Kaminska M, Sygit K, Budny A, Surdyka D, Kukielka-Budny B and Burdan F: Primary and secondary prevention of breast cancer. Ann Agric Environ Med 24: 549–553, 2017.
Merino BJ, Torres TM and Ros ML: Breast cancer in the 21st century: from early detection to new therapies. Radiologia 59: 368–379, 2017.
Kolde R, Laur S, Adler P and Vilo J: Robust rank aggregation for gene list integration and meta-analysis. Bioinformatics 28: 573–580, 2012.
Dedeurwaerder S, Desmedt C, Calonne E, Singhal SK, Haibe-Kains B, Defrance M, Michiels S, Volkmar M, Deplus R and Luciani J, et al.: DNA methylation profiling reveals a predominant immune component in breast cancers. Embo Mol Med 3: 726–741, 2011.
Aswad L, Yenamandra SP, Ow GS, Grinchuk O, Ivshina AV and Kuznetsov VA: Genome and transcriptome delineation of two major oncogenic pathways governing invasive ductal breast cancer development. Oncotarget 6: 36652–36674, 2015.
Deva MRA, Patel K, Korivi JS, Meenakumari B, Sundersingh S, Sridevi V, Rajkumar T, Pandey A, Chatterjee A and Gowda H, et al.: Identification of lncRNAs associated with early-stage breast cancer and their prognostic implications. Mol Oncol 13: 1342–1355, 2019.
Sinn BV, Fu C, Lau R, Litton J, Tsai TH, Murthy R, Tam A, Andreopoulou E, Gong Y and Murthy R, et al.: SETER/PR: a robust 18-gene predictor for sensitivity to endocrine therapy for metastatic breast cancer. NPJ Breast Cancer 5: 16, 2019.
Lian ZQ, Wang Q, Li WP, Zhang AQ and Wu L: Screening of significantly hypermethylated genes in breast cancer using microarray-based methylated-CpG island recovery assay and identification of their expression levels. Int J Oncol 41: 629–638, 2012.
Turashvili G, Bouchal J, Baumforth K, Wei W, Dziechciarkova M, Ehrmann J, Klein J, Fridman E, Skarda J and Srovnal J, et al.: Novel markers for differentiation of lobular and ductal invasive breast carcinomas by laser microdissection and microarray analysis. Bmc Cancer 7: 55, 2007.
Ritchie ME, Phipson B, Wu D, Hu Y, Law CW, Shi W and Smyth GK: limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res 43: e47, 2015.
Kolde R, Laur S, Adler P and Vilo J: Robust rank aggregation for gene list integration and meta-analysis. Bioinformatics 28: 573–580, 2012.
Chandrashekar DS, Bashel B, Balasubramanya S, Creighton CJ, Ponce-Rodriguez I, Chakravarthi B and Varambally S: UALCAN: A Portal for Facilitating Tumor Subgroup Gene Expression and Survival Analyses. Neoplasia 19: 649–658, 2017.
Li B, Severson E, Pignon JC, Zhao H, Li T, Novak J, Jiang P, Shen H, Aster JC and Rodig S, et al.: Comprehensive analyses of tumor immunity: implications for cancer immunotherapy. Genome Biol 17: 174, 2016.
Li T, Fan J, Wang B, Traugh N, Chen Q, Liu JS, Li B and Liu XS: TIMER: A Web Server for Comprehensive Analysis of Tumor-Infiltrating Immune Cells. Cancer Res 77: e108-e110, 2017.
Li B, Severson E, Pignon JC, Zhao H, Li T, Novak J, Jiang P, Shen H, Aster JC and Rodig S, et al.: Comprehensive analyses of tumor immunity: implications for cancer immunotherapy. Genome Biol 17: 174, 2016.
Li T, Fan J, Wang B, Traugh N, Chen Q, Liu JS, Li B and Liu XS: TIMER: A Web Server for Comprehensive Analysis of Tumor-Infiltrating Immune Cells. Cancer Res 77: e108-e110, 2017.
Grassmann F, He W, Eriksson M, Gabrielson M, Hall P and Czene K: Interval breast cancer is associated with other types of tumors. Nat Commun 10: 4648, 2019.
Bray F, Ferlay J, Soerjomataram I, Siegel RL, Torre LA and Jemal A: Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin 68: 394–424, 2018.
Majidinia M and Yousefi B: DNA repair and damage pathways in breast cancer development and therapy. DNA Repair (Amst) 54: 22–29, 2017.
Soysal SD, Tzankov A and Muenst SE: Role of the Tumor Microenvironment in Breast Cancer. Pathobiology 82: 142–152, 2015.
Shaheen S, Fawaz F, Shah S and Busselberg D: Differential Expression and Pathway Analysis in Drug-Resistant Triple-Negative Breast Cancer Cell Lines Using RNASeq Analysis. Int J Mol Sci 192018.
Wang L, Li J, Liu E, Kinnebrew G, Zhang X, Stover D, Huo Y, Zeng Z, Jiang W and Cheng L, et al.: Identification of Alternatively-Activated Pathways between Primary Breast Cancer and Liver Metastatic Cancer Using Microarray Data. Genes (Basel) 102019.
Gkretsi V, Stylianou A, Kalli M, Louca M, Voutouri C, Zaravinos A and Stylianopoulos T: Silencing of Growth Differentiation Factor-15 Promotes Breast Cancer Cell Invasion by Down-regulating Focal Adhesion Genes. Anticancer Res 40: 1375–1385, 2020.
Costa R, Han HS and Gradishar WJ: Targeting the PI3K/AKT/mTOR pathway in triple-negative breast cancer: a review. Breast Cancer Res Treat 169: 397–406, 2018.
Isakova JT, Vinnikov D, Kipen VN, Talaibekova ET, Aldashev AA, Aldasheva NM, Makieva KB, Semetei KA, Bukuev NM and Tilekov EA, et al.: Gene-to-gene interactions and the association of TP53, XRCC1, TNFalpha, HMMR, MDM2 and PALB2 with breast cancer in Kyrgyz females. Breast Cancer-Tokyo2020.
Donnella HJ, Webber JT, Levin RS, Camarda R, Momcilovic O, Bayani N, Shah KN, Korkola JE, Shokat KM and Goga A, et al.: Kinome rewiring reveals AURKA limits PI3K-pathway inhibitor efficacy in breast cancer. Nat Chem Biol 14: 768–777, 2018.
Mahadevappa R, Neves H, Yuen SM, Bai Y, McCrudden CM, Yuen HF, Wen Q, Zhang SD and Kwok HF: The prognostic significance of Cdc6 and Cdt1 in breast cancer. Sci Rep 7: 985, 2017.
Booher K, Lin DW, Borrego SL and Kaiser P: Downregulation of Cdc6 and pre-replication complexes in response to methionine stress in breast cancer cells. Cell Cycle 11: 4414–4423, 2012.
Zhang S, Tischer T and Barford D: Cyclin A2 degradation during the spindle assembly checkpoint requires multiple binding modes to the APC/C. Nat Commun 10: 3863, 2019.
Ben YK, Doghri R, Mrad K, Ben RN and Ben AF: Cyclin A2 as a potential differential marker of splenic diffuse red pulp small B-cell lymphoma: a report of the first case. Ann Hematol 96: 511–512, 2017.
Whitfield ML, George LK, Grant GD and Perou CM: Common markers of proliferation. Nat Rev Cancer 6: 99–106, 2006.
De Vincenzo A, Belli S, Franco P, Telesca M, Iaccarino I, Botti G, Carriero MV, Ranson M and Stoppelli MP: Paracrine recruitment and activation of fibroblasts by c-Myc expressing breast epithelial cells through the IGFs/IGF-1R axis. Int J Cancer 145: 2827–2839, 2019.
Sahores A, Figueroa V, May M, Liguori M, Rubstein A, Fuentes C, Jacobsen BM, Elia A, Rojas P and Sequeira GR, et al.: Increased High Molecular Weight FGF2 in Endocrine-Resistant Breast Cancer. Horm Cancer 9: 338–348, 2018.
Trusca VG, Fuior EV, Kardassis D, Simionescu M and Gafencu AV: The Opposite Effect of c-Jun Transcription Factor on Apolipoprotein E Gene Regulation in Hepatocytes and Macrophages. Int J Mol Sci 202019.
Chandrashekar DS, Bashel B, Balasubramanya S, Creighton CJ, Ponce-Rodriguez I, Chakravarthi B and Varambally S: UALCAN: A Portal for Facilitating Tumor Subgroup Gene Expression and Survival Analyses. Neoplasia 19: 649–658, 2017.
Liu S, Wang Z, Liu Z, Shi S, Zhang Z, Zhang J and Lin H: miR-221/222 activate the Wnt/beta-catenin signaling to promote triple-negative breast cancer. J Mol Cell Biol 10: 302–315, 2018.
Zhang L, Zhang Z and Yu Z: Identification of a novel glycolysis-related gene signature for predicting metastasis and survival in patients with lung adenocarcinoma. J Transl Med 17: 423, 2019.
Choi S, Wang D, Chen X, Tang LH, Verma A, Chen Z, Kim BJ, Selesner L, Robzyk K and Zhang G, et al.: Function and clinical relevance of RHAMM isoforms in pancreatic tumor progression. Mol Cancer 18: 92, 2019.
Buttermore ST, Hoffman MS, Kumar A, Champeaux A, Nicosia SV and Kruk PA: Increased RHAMM expression relates to ovarian cancer progression. J Ovarian Res 10: 66, 2017.
Purnell MC: Bio-electric field enhancement: the influence on hyaluronan mediated motility receptors in human breast carcinoma. Discov Med 23: 259–267, 2017.
Alvarez C, Tapia T, Cornejo V, Fernandez W, Munoz A, Camus M, Alvarez M, Devoto L and Carvallo P: Silencing of tumor suppressor genes RASSF1A, SLIT2, and WIF1 by promoter hypermethylation in hereditary breast cancer. Mol Carcinog 52: 475–487, 2013.
Terry R, Chintanaboina J, Patel D, Lippert B, Haner M, Price K, Tracy A, Lalos A, Wakeley M and Gutierrez LS: Expression of WIF-1 in inflammatory bowel disease. Histol Histopathol 34: 149–157, 2019.
Zhou Y, Li Z, Ding Y, Zhang P, Wang J, Zhang J and Wang H: Promoter methylation of WNT inhibitory factor-1 may be associated with the pathogenesis of multiple human tumors. J Cancer Res Ther 14: S381-S387, 2018.
Veeck J, Wild PJ, Fuchs T, Schuffler PJ, Hartmann A, Knuchel R and Dahl E: Prognostic relevance of Wnt-inhibitory factor-1 (WIF1) and Dickkopf-3 (DKK3) promoter methylation in human breast cancer. Bmc Cancer 9: 217, 2009.
Kim H, Watkinson J, Varadan V and Anastassiou D: Multi-cancer computational analysis reveals invasion-associated variant of desmoplastic reaction involving INHBA, THBS2 and COL11A1. Bmc Med Genomics 3: 51, 2010.
Ameku T, Taura D, Sone M, Numata T, Nakamura M, Shiota F, Toyoda T, Matsui S, Araoka T and Yasuno T, et al.: Identification of MMP1 as a novel risk factor for intracranial aneurysms in ADPKD using iPSC models. Sci Rep 6: 30013, 2016.
Lim JP, Nair S, Shyamasundar S, Chua PJ, Muniasamy U, Matsumoto K, Gunaratne J and Bay BH: Silencing Y-box binding protein-1 inhibits triple-negative breast cancer cell invasiveness via regulation of MMP1 and beta-catenin expression. Cancer Lett 452: 119–131, 2019.
Schafer SA, Hulsewig C, Barth P, von Wahlde MK, Tio J, Kolberg HC, Bernemann C, Blohmer JU, Kiesel L and Kolberg-Liedtke C: Correlation between SFRP1 expression and clinicopathological parameters in patients with triple-negative breast cancer. Future Oncol 15: 1921–1938, 2019.
Gregory KJ, Roberts AL, Conlon EM, Mayfield JA, Hagen MJ, Crisi GM, Bentley BA, Kane JJ, Makari-Judson G and Mason HS, et al.: Gene expression signature of atypical breast hyperplasia and regulation by SFRP1. Breast Cancer Res 21: 76, 2019.

No competing interests reported.

FigS1.tif
Fig.S1: Normalization of gene expression. (A-B) Normalization of the GSE5764 data set. (C-D) Normalization of the GSE33447 data set. (E-F) Normalization of the GSE124646 data set. (G-H) Normalization of the GSE139038 data set. (I-J) Normalization of the GSE61304 data set. (K-L) Normalization of the GSE20711 data set. Blue represents data before normalization, and red represents data after normalization.

Download PDF

Version 1

posted

You are reading this latest preprint version

Identification of potential crucial genes associated with breast cancer using bioinformatics analysis and experimental verification

Status:

Version 1

Abstract

Figures

Introduction

Methods And Materials

Results

Discussion

Conclusion

Declarations

References

Additional Declarations

Supplementary Files

Status:

Version 1