Gene networks and expression quantitative trait loci associated with platinum-based chemotherapy response in high-grade serous ovarian cancer

doi:10.21203/rs.2.21098/v2

Download PDF

Research article

Gene networks and expression quantitative trait loci associated with platinum-based chemotherapy response in high-grade serous ovarian cancer

https://doi.org/10.21203/rs.2.21098/v2

This work is licensed under a CC BY 4.0 License

Journal Publication

published 13 May, 2020

Read the published version in BMC Cancer →

You are reading this latest preprint version

Background A major impediment in the treatment of ovarian cancer is the relapse of platinum-resistant tumors, which occurs in approximately 25% of patients. A better understanding of the biological mechanisms underlying platinum-based chemotherapy response will improve treatment efficacy through genetic testing and novel therapies.

Methods Using data from high-grade serous ovarian carcinoma (HGSOC) patients in the Cancer Genome Atlas (TCGA), we classified those who remained progression-free for 12 months following platinum-based chemotherapy as “chemo-sensitive” (N=160) and those who had recurrence within six months as “chemo-resistant” (N=110). Univariate and multivariate analysis of expression microarrays identified a differentially expressed gene and co-expression gene networks associated with chemotherapy response. Moreover, we integrated genomics data to determine expression quantitative trait loci (eQTL).

Results Differential expression of the Valosin-containing protein ( VCP ) gene and five co-expression gene networks were significantly associated with chemotherapy response in HGSOC. VCP and the most significant co-expression network module contribute to protein processing in the endoplasmic reticulum, which has been implicated in chemotherapy response. Both univariate and multivariate findings were successfully replicated in an independent ovarian cancer cohort. Furthermore, we identified 192 cis-eQTLs associated with the expression of genes in the co-expression networks and 4 cis-eQTLs associated with BRCA2 expression.

Conclusion This study implicates both known and novel genes as well as biological processes underlying response to platinum-based chemotherapy among HGSOC patients.

Cancer Biology

Oncology

ovarian cancer

relapse

platinum-resistant tumors

treatment

high-grade serous ovarian carcinoma

HGSOC

Ovarian cancer is the most lethal gynecological malignancy and the 8^th leading cause of cancer death in women around the world.¹ According to the Global Cancer Observatory report in 2012, ovarian cancer accounts for 3.6% of all cancer cases and 4.3% of all cancer related deaths worldwide.² High-grade serous ovarian carcinoma (HGSOC) is the most malignant form of ovarian cancer that accounts for up to 70% of all cases.³ Routine diagnosis is often difficult due to the lack of mass screening methods and heterogenous manifestations of the cancer symptoms, which result in approximately 75% of patients diagnosed with advanced stages.⁴ The average 5-year survival rates are 39% for Stage 3 and 17% for Stage 4 cancers.¹

The current standard of care for ovarian cancer is aggressive cytoreductive surgery followed by platinum-based chemotherapy.⁴ However, this standard of care is not effective for all patients, with approximately 25% experiencing relapse within six months following platinum-based therapy, likely due to the development of antineoplastic resistance.⁵ The median survival time for recurrent ovarian cancers range from 12-24 months.⁶^,⁷ Treatment options for patients with recurrent ovarian cancer include non-platinum-based chemotherapy regimens, immunotherapy, and molecular targeted therapy.⁷^,⁸

Ovarian cancer has a multifactorial etiology that includes genetic and non-genetic risk factors. An estimated 23% of cases are hereditary, but the majority are sporadic with multiple reported risk factors such as history of gravidity, infertility, and late age menopause.⁹^,¹⁰ A better understanding of the etiology of ovarian cancer, as well as the genetic mechanisms underlying variable response to platinum-based chemotherapy, is needed for improved diagnosis and treatment. For example, previous studies reported that the BRCA1 and BRCA2 genes, which are associated with increased risk of ovarian cancer, harbor mutations associated with platinum drug sensitivity and survival.¹¹ Similarly, tumor suppressor genes such as RB1, NF1, RAD51B, PTEN have been associated with acquired chemotherapy resistance.¹² Earlier studies have also highlighted the importance of the immune system in the treatment of ovarian cancer. For example, loss of chemokines and disruptions to the IFN-γ pathway have been associated with poor treatment outcomes in HGSOC paients¹³ whereas the NFκB signaling pathway and elevated expression of STAT1 were associated with increased response to platinum therapy.¹⁴^,¹⁵^,¹⁶ However, these known genetic variations do not account for all of the variability in chemotherapy response among HGSOC patients and there is currently no screening method to accurately predict prognosis prior to start of chemotherapy. Thus, further studies are necessary to determine additional modulators of chemotherapy response, which can be used as biomarkers for genetic testing.

The majority of earlier studies of chemotherapy response in ovarian cancer patients used univariate analysis of gene expression data known as differential gene expression (DGE) analysis. For example, DGE analysis identified genes correlated with ovarian cancer subtypes in the TCGA cohort ¹⁷, which have also been associated with differential response to platinum-based chemotherapy¹⁸. Moreover, similar univariate methods have been applied to investigate gene expression differences in cisplatin sensitive vs. resistant ovarian cancer cell lines after cisplatin exposure.¹⁹ A limitation of DGE analysis is that it assumes each gene functions in isolation within the genome, which fails to capture the effects of complex gene-gene interactions. Our study of chemotherapy response in HGSOC patients applies a multivariate approach to identify groups of co-expressed genes, which may contribute to common biological pathways. These genes may each have modest effects that are not detected by conventional univariate analysis. Specifically, we applied Weighted Gene Co-expression Network Analysis²⁰ (WGCNA), which uses an unsupervised machine-learning algorithm to identify clusters of highly correlated or co-expressed genes. Moreover, we correlated sequence variations with co-expressed gene network expression to identify expression Quantitative Trait Loci (eQTLs), which are potentially regulatory variants associated with gene expression. In addition, our study used gene expression data profiled from whole patient tumors, which were obtained during the initial cytoreductive surgery. This allows us to examine the tumor microenvironment and tumor cell intrinsic events which are difficult to be studied in cell-line derived expression datasets. A better understanding of the biological mechanisms regulating chemotherapy response will enable more effective treatment by improving the accuracy of genetic testing and identifying novel therapies for HGSOC patients.

Patient Classification:

From the Cancer Genome Atlas (TCGA) Genomic Data Commons (GDC) portal²¹, we retrieved 587 high-grade serous ovarian carcinoma (HGSOC) patients with available clinical data using the TCGAbiolinks R/Bioconductor package.²² We selected for patients who received platinum-based adjuvant chemotherapy, the majority of which (96%) also received taxane treatment (see Supplemental Table 1 for characteristics of the cohort). A small percentage of the cohort has received additional adjuvant therapies in combination with platinum-based compounds, such as gemcitabine (9%), doxorubicin (2.6%), topotecan (2.6%), bevacizumab (2.2%), and tamoxifen (2.2%) (Supplemental Table 2). The interval between a patient’s last primary platinum treatment and the onset of a recurrent tumor or progression of an existing tumor was used as a metric for determining chemotherapy sensitivity. Patients who developed a new tumor in less than six months following their last platinum treatment were defined as resistant (N=110). In contrast, those who did not have a recurrent tumor event for over a year after their last primary platinum treatment were defined as sensitive (N=160). Individuals who had a recurrent tumor event between six months to one year following chemotherapy were excluded from the study. This strategy for dichotomizing resistant from sensitive patients was used to enrich for the genetic differences.

Transcriptomics Data Processing and Analysis:

Expression Microarrays: Of the 270 HGSOC subjects classified as sensitive or resistant to chemotherapy, 238 (138 sensitive, 100 resistant) had primary tumor microarray expression data available (Affymetrix ht_hg_u133a chip) in the GDC portal. The robust multi-array average (RMA) method²³ in the affy package from Bioconductor²⁴ was used for background correction, log-transformation, and quantile normalization of the probe intensities. Two potential outliers and two duplicated samples were removed from the study during the quality control step using the arrayQualityMetrics²⁵ package (see Supplemental Data 1 for steps of pre-processing), resulting in 135 sensitive and 99 resistant HGSOC subjects in the expression set. Next, probes were filtered using the median absolute deviation (MAD) whereby the top 50% with highest variation (n=11,107) were selected for analysis. This non-specific filtering step removed probes with low variability in expression across the cohort, which are not likely to be differentially expressed between sensitive and resistant patients, reducing the number of multiple testing corrections and therefore, the likelihood of false positives.

Covariates: We assessed multiple potential confounders for correlation with therapeutic outcome including age, race, surgery (cytoreductive) outcome, cancer grade, and cancer stage (Supplemental Table 1). With the exception of age (p = 0.0041), all factors showed no significant difference between chemo-sensitive and chemo-resistant patients. For this reason, age at diagnosis was included as a covariate in all subsequent analyses.

Differential Gene Expression Analysis: The Limma²⁶ package in Bioconductor²⁷ was used to identify differentially expressed genes between chemo-sensitive and resistant groups using linear models. The false discovery rate (FDR) method was employed as a measure for multiple testing correction to control for type I error.

Weighed Gene Co-expression Network Analysis (WGCNA): We performed hierarchical clustering of genes using the R package WGCNA²⁰, which groups genes based on their similarity in expression. This was achieved by first creating a similarity matrix using Pearson correlations of expression among all genes. The resulting matrix was raised to a power of 9, as suggested by the soft-thresholding power estimation plot (Supplemental Figure 1). Raising the correlation matrix to a power enriches for differences between weak and strong signals, allowing for better quantification of gene-gene interactions. The similarity matrix was transformed to a Topological Overlap Matrix (TOM), where the strength of association between a pair of genes is reinforced by the common neighbors shared by them. To avoid excessive splitting of genes into smaller modules, minimum module size was set to 30, split sensitivity (deep split) was set to 4, and modules with similar expression profiles were merged at a height of 0.5 (Supplemental Figure 2). Using principal component analysis, we calculated the module eigengene for each co-expression cluster to summarize module gene expression with a single measure. Each module eigengene was tested for association with platinum chemotherapy response using generalized linear models. Finally, we used Cytoscape²⁸, an open source bioinformatics platform, to visualize significant gene co-expression networks.

Gene function and pathway annotations: The Database for Annotation, Visualization and Integrated Discovery (DAVID) ²⁹ was employed to identify biological pathways and functions that were enriched in each significant gene co-expression module. We also screened significant genes in the GeneMANIA³⁰ database to identify for functional connections reported in published literature. Next, we searched the UCSC transcription factor binding site (TFBS) conservation sites track with DAVID to identify enriched motifs of transcription factors that may co-regulate genes within each cluster. Finally, we used the Drug–Gene Interaction database (DGIdb)³¹, a public database with curation of data describing relationships between genes, chemicals, drugs, and pathological phenotype, to identify genes with prior reported associations with chemotherapeutic agents.

Validation of differentially expressed gene: The Kaplan–Meier plotter tool was used to cross-validate the differential expression of VCP in an independent ovarian cancer cohort (Geo accession identifier: GSE9891).³² This replication cohort included gene expression profiling (Affymetrix Human Genome U133 Plus 2.0 Array) of 285 ovarian tumor samples. Patients were filtered to include those with cancer histology of serous carcinoma and who received chemotherapy containing a platinum compound to allow close comparison with the TCGA ovarian cancer cohort. This step omitted a total of 60 subjects from analysis, which included 21 with endometrioid carcinoma cases and 43 who did not receive platinum therapy (4 overlapping subjects). Thus, 225 patients remained for replication analysis. Patient survival was evaluated using a Cox proportional hazards model and progression-free survival (PFS) was the primary outcome used in the replication analysis.³³

Validation of co-expression networks:

For validation of co-expression networks, the SurvExpress database was used, which allows users to validate the combined effect of multiple gene expression measures with a target trait.³⁴ The same cohort and filtering steps were used for validation (GSE9891, N = 225). The survival curve was evaluated using a Cox proportional hazards model and PFS was the primary outcome used in the replication analysis.

Genomics Data Processing and Analysis:

Genomics Data: Single nucleotide polymorphisms (SNPs) data from germline tissues (DNA extracted from blood or solid non-tumor ovarian tissue) were obtained from the TCGA legacy database. The Affymetrix Genome-Wide Human SNP Array 6.0 was used to capture genetic variations, which detected 906,600 SNPs. Of the 270 subjects from TCGA classified as resistant or sensitive to platinum-based chemotherapy, 266 (157 sensitive and 109 resistant) had genotype data available.

Imputations: The imputation of autosomal chromosomes was performed using the Michigan imputation server pipeline³⁵. We used the 1000 Genome Project phase 3 sequencing data (version 5)³⁶ reference panel for the imputation of missing genotypes. We then used Eagle v.2.3³⁷ for phasing of the genotypes to their respective chromosomes. For the imputation of variants on the X chromosome, SHAPEIT³⁸ was used for phasing in combination with the 1000 genomes project phase 3 (version 5) reference panel (Supplemental Data 2).

Quality control:

Subject level: Two pairs of individuals had a relatedness coefficient (pi-hat) > 0.9, which are likely duplicated samples. One subject from each pair was randomly removed from the dataset. Next, inbreeding coefficients (F) were computed for each subject using PLINK³⁹. A total of 18 subjects with high homozygosity (F>0.05) or heterozygosity (F<-0.05) rates were excluded. Moreover, genetic sex was estimated based on heterozygosity rates (F) of the X chromosome, and four subjects who had undefined genetic sex (F>0.2) were removed from the study.

SNP level: SNPs with minor allele frequencies (MAF) less than 1% or with genotyping call rate less than 90% were removed. This step removed 38,430,595 SNPs with MAF < 0.01, resulting in 9,528,963 SNPs to be used for further analysis.

Genome-wide Association Study: After imputations and quality control, 240 subjects (N= 142 sensitive, 98 resistant) and a total of 9,528,963 SNPs (MAF > 0.1) remained available for analysis (Supplemental Data 3). Plink (v.1.90) was used to compute genome wide and BRCA1/2 targeted association analysis using a logistic regression model. We pruned variants in strong (r² > 0.8) linkage disequilibrium (LD) within the BRCA1/2 loci to determine independent association signals.

Variant annotations: Variant Effect Predictor (VEP)⁴⁰ was used to predict the functional consequence of the identified variants. Similarly, the database of Genome-Wide Repository of Associations Between SNPs and Phenotypes (GRASP)⁴¹ and Clinvar⁴² were used to identify variants with known phenotype associations.

Expression Quantitative Trait Loci (eQTL) Analysis:

Common SNPs (MAF > 0.01) were tested for association with gene expressions of BRCA1, BRCA2, and co-expression networks using the matrixeQTL R package⁴³. The correlation of a genotype with nearby gene expression indicates potential regulatory function of the SNP on the corresponding gene. These regulatory SNPs known as cis-expression Quantitative Trait Loci (cis-eQTL). Cis-eQTLs are defined as correlated SNPs found within 1 Mb from the gene transcriptional start site (TSS).

Univariate or differential gene expression (DGE) analysis tested the association of 11,107 probes with chemo-therapy response in HGSOC patients from TCGA. This identified that low expression of a probe (208648_at) mapping to the Valosin Containing Protein (VCP) gene was significantly associated with resistance to chemotherapy (FDR adjusted p-value < 0.05; Figure 1). Replication analysis in an independent ovarian serous cancer cohort following treatment with platinum antineoplastic agents using the Kaplan–Meier survival curve plotter demonstrated that low expression of VCP is associated with poor progression-free survival (p = 0.015) and shorter median survival time (Figure 2A). In addition to VCP, DGE yielded 628 probes mapping to 534 unique genes that were nominally correlated with chemotherapy response (unadjusted p-value < 0.05). We report these findings in Supplemental Table 3.

The hierarchical clustering of genes using WGCNA resulted in 86 unique modules of co-expressed genes (Supplemental Table 4). Each module was assessed for association with chemotherapy response (results shown in Figure 3B). Five gene clusters (honeydew1, lightcyan1, lightpink3, orangered4, and skyblue3) were significantly co-downregulated in platinum-resistant patients (p < 0.05) (Figure 3A). These were validated in an independent ovarian cancer cohort by Tothill et al.³² using SurvExpress, which demonstrated that the downregulation of the genes in the five modules was significantly associated with reduced patient survival (Figure 2B). These five significant modules were annotated using DAVID, which identified gene enrichment for biological pathways including protein processing in the endoplasmic reticulum, apoptosis, negative regulation of the Wnt signaling pathway, transcription, immune responses, and DNA double-strand break processing involved in repair via single-strand annealing. GeneMANIA analysis showed that genes in these modules were previously reported in 49 publications, some of which documented associations with oncogenic pathways and chemotherapeutic outcomes (Supplemental Data 4).

We performed a search of network module genes in the gene-drug interaction database (DGIdb) and found that 35 genes were associated with chemotherapeutic agents. These include: carboplatin and paclitaxel, which are often used as a first-line chemotherapy option for ovarian cancer patients; gemcitabine and bevacizumab, which are approved agents for the treatment of ovarian cancer; and various tyrosine kinase inhibitors (TKI), which are a type of targeted therapy commonly used for the treatment of chronic myeloid leukemia and other malignancies.⁴⁴

Furthermore, we identified common transcription factor binding sites located within genes from each module. For example, we identified that over 96% of genes (49/53 genes) found in the honeydew1 module have a matching motif for the human organic cation transporter 1 transcription factor (OCT1). Similarly, we report that the acute myeloid leukemia 1 (AML1) motif maps to over 45% of genes found in orangered4 module. Both of these transcription factors are associated with oncogenic processes and therapeutic outcome.^45,46 A detailed list of functional annotations, transcription factors and pathways related to gene modules can be found in Supplemental Data 4.

Our GWAS of SNPs did not identify any variants correlated with platinum chemotherapy response after multiple testing correction. The Manhattan plot (Supplemental Figure 3) demonstrates that none of the SNPs meet the genome-wide significance threshold (p<5x10-8), as indicated by the red horizontal line. This is likely due to insufficient statistical power resulting from the low number of subjects in the TCGA-OV cohort. Next, we performed a targeted association analysis of two well-known genes associated with ovarian cancer and chemotherapeutic outcomes: BRCA1 and BRCA2. Of the 238 SNPs in BRCA1 and 256 in BRCA2, we identified 56 independent variants in BRCA1 and 86 such variants in BRCA2 after pruning for LD (r² > 0.8). Association analysis determined that 8 SNPs in BRCA2 and 1 SNP in BRCA1 were significantly associated with chemotherapy response. GRASP analysis identified that half of the identified BRCA2 variants (rs11571686, rs7337574, rs10492397, rs1207952) have been previously associated with varied Low and High Density Lipoprotein (LDL/HDL) cholesterol levels. Similarly, annotation analysis using Clinvar database reported that 4 of the associated variants in BRCA2 (rs11571584, rs11571686, rs9567600, rs7337574) are linked with an increased risk of developing breast and ovarian cancer at an earlier age (Supplemental Table 5).

Next, SNPs were tested for correlation with the expression of the 5 network modules. This identified 192 cis-eQTLs associated with gene expression in co-expression networks. (Supplemental Data 5). Moreover, of the 8 significant SNPs found in BRCA2, 6 were identified as cis-eQTLs for nearby genes, including 4 that were specifically associated with BRCA2 gene expression (Supplemental Table 5).

In this manuscript, we identified known and novel genes and gene networks correlated with variable response to platinum-based chemotherapy in HGSOC patients. Using a univariate analysis approach, we identified a differentially expressed gene encoding the valosin-containing protein (VCP) associated with sensitivity to platinum-based chemotherapy. In addition, we applied a multivariate co-expression network analysis method which identified five clusters of co-expressed genes correlated with chemo-response. Genes in these modules were enriched for biological pathways such as protein processing in the endoplasmic reticulum, apoptosis, transcription, immune response, negative regulation of the Wnt signaling pathway and DNA double-strand break processing involved in repair via single-strand annealing. Moreover, we identified potentially regulatory variants (i.e. eQTLs) correlated with the expression of known oncogenic genes associated with chemotherapy outcome such as BRCA1/2. Our study contributes to a better understanding of the biological processes underlying chemo-therapy response in HGSOC, which could help facilitate genetic testing and novel therapies.

The most significantly associated probe identified in the DGE analysis was for a gene encoding Valosin-containing protein (VCP, p = 3.91E-06). We have confirmed that this signal is replicated in an independent ovarian cancer cohort with statistical significance (p=0.015). VCP plays a critical role in disintegrating large polypeptide cellular structures for further degradation by proteolytic enzymes. It functions to regulate important pathways of DNA repair, replication and cell cycle progression by removing faulty polypeptide structures from chromatin material, ribosomes, endoplasmic reticulum and mitochondria. VCP is an ovarian cancer-specific essential gene as demonstrated by a pooled short hairpin RNA (shRNA) screen in 25 ovarian cancer cell lines⁴⁷, and is also essential in cyclin E1 overexpressing cisplatin-resistant ovarian cancer cells.⁴⁸ In alignment with these findings, VCP has been investigated as a drug target for ovarian cancer therapy. For example, Bastola et. al. (2016) reported that VCP inhibitors induce cell death in ovarian cancer cell lines through the endoplasmic reticulum stress pathway.⁴⁹ In addition, this study reports an association between low VCP expression and poor response to platinum-based chemotherapy in multiple ovarian cancer cohorts. VCP has also been previously identified as a potential biomarker for predicting the success of platinum-based chemotherapy in lung cancer patients.⁵⁰

In our co-expression network analysis, the gene module “honeydew1” showed the most significant correlation with chemotherapy response (p = 6.53e-05). This association signal was validated with statistical significance (p = 5.88e-07) in an independent ovarian cancer replication cohort. This module includes two probes that map to VCP, a gene that was associated with platinum-based chemotherapy response in our DGE analysis. Genes in this module were associated with positive regulation of mitochondrial membrane potential, protein ubiquitination, mitosis, alternative splicing, and apoptotic processes. Pathway analysis showed that this module is involved in protein processing in the endoplasmic reticulum. A prior study found that VCP plays a crucial role in ovarian cancer cell survival through extraction and degradation of unfolded proteins in endoplasmic reticulum, and noted that lower expression of VCP was associated with poor response to platinum‐based chemotherapy.⁴⁹ In alignment with this finding, genes co-expressed in the honeydew1 module were co-downregulated in chemo-resistant patients.

The honeydew1 module is composed of 76 probes mapping to 53 unique genes, and of these, 45 genes are located in chromosome 9, demonstrating the importance of chromosome 9 in the regulation of chemo-resistance in ovarian cancer. These findings support previous studies, where genetic imbalance and alterations in chromosome 9 have been associated with progression of ovarian cancer and increased cisplatin resistance.⁵¹ Analysis of overrepresented transcription factor binding sites demonstrated that genes in this module may be co-regulated by a common transcription factor known as organic cation transporter 1 (OCT1). We found that over 96% of genes in this module (49/53 genes) contain a nucleotide motif bound by OCT1. Prior studies have reported that silencing OCT1 impaired cisplatin-induced apoptosis in esophageal cancer cells, and that cisplatin-resistant cells were already expressing significantly reduced levels of OCT1.⁴⁵ Taken together, these findings characterize a network of co-expressed genes that is associated with platinum-resistance in ovarian cancer. Genes within this module may be co-regulated by the OCT1 transcription factor, which may be used as a novel potential target for ovarian cancer therapies.

The other four co-expression modules, which were also replicated in the independent cohort, include genes known to be involved in oncogenic process and drug response outcomes. For example, the orangered4 module, which was downregulated in resistant patients, consists of genes associated with regulation of the immune response. Genes in this module are associated with functional annotation terms including immunoglobulin receptor binding, antigen binding, B cell receptor signaling pathway, and phagocytosis. The repression of patient immune response is a well-known cancer survival mechanism, which has been shown to play a role in chemotherapy resistance in HGSOC.¹⁴^,¹⁵ In addition, 10 of the 22 genes in this module are enriched for a common transcription factor binding site: acute myeloid leukemia 1 protein (AML1). This transcription factor is involved in the haematopoiesis process and immune functions such as thymic T-cell development. AML1 expression was found to be associated with cancer cell proliferation, migration and invasion in ovarian cancer.⁴⁶ In addition, we found that the lightpink3 module is strongly associated with the transcription regulation process, which plays a pivotal role in cancer progression⁵². Finally, genes in the lightcyan1 and skyblue3 modules are target regions of well-known chemotherapeutic agents for ovarian cancer such as carboplatin, paclitaxel, bevacizumab and gemcitabine. Many of these genes are also target regions for various TKIs, which have been reported to enhance the efficacy of cisplatin treatment and progression free survival in ovarian cancer.^53,54 For instance, our DGIdb search showed that the expression of the non-receptor tyrorine kinase YES1 (YES Proto-Oncogene 1, Src Family Tyrosine Kinase) in the skyblue3 module and the serine/threonine kinase MAPK1 (Mitogen-activated protein kinase 1) in the lightcyan1 module is inhibited when TKIs are introduced (Dasatinib, Ibrutinib, Rebastinib, Ulixertinib, etc.) (Supplemental Data 4).

Targeted analysis of BRCA1 and BRCA2 SNPs demonstrated that 6 out of 9 variants associated with chemotherapy response were also cis-acting eQTLs, correlated with the expression of BRCA2 as well as neighboring genes N4BP2L1, N4BP2L2, FRY, and STARD13 (nominal p-value <0.05). Both BRCA2 and STARD13 are well known tumor-suppressors, and upregulation of N4BP2L1 and N4BP2L2 has been associated with positive prognosis in ovarian cancer cases.⁵⁵ The majority of cis-eQTLs in BRCA2 was associated with the upregulation of BRCA2 in chemotherapy resistant patients (Supplemental Table 5). The downregulation of BRCA2 reduces the expression of the homologous recombination (HR) pathway-associated RAD51 protein and suppresses DNA repair in ovarian cancer cells, sensitizing them to cisplatin ⁵⁶. In addition, BRCA2 upregulation has been shown to promote HR DNA repair and radioresistance in pancreatic cancer cells ⁵⁷. This finding indicates that the potential regulation of BRCA2 expression by the cis-eQTLs we identified may enhance the HR pathway function in resistant patients. However, functional experiments are needed to confirm this finding. Finally, our annotation results show that half of platinum-response associated variants in BRCA2 are linked with LDL/HDL cholesterol levels (Supplemental Table 5). Prior studies of lung and ovarian cancers consistently reported that cholesterol levels may affect the efficacy of platinum-based chemotherapeutic agents.⁵⁸^,⁵⁹ Our findings indicate a new link between genetic variants in BRCA2 and platinum-based chemotherapy response through cholesterol level regulation.

One limitation of our study is that 96% of patients who received platinum-based chemotherapy also received taxane therapy. Thus, it is impossible to distinguish whether our results reflect platinum sensitivity, or sensitivity to the combinatorial therapy of platinum and taxane. Further studies are needed to validate the association signal between identified genes and platinum-specific resistance. Despite the successful replication of study findings through Kaplan Meier analysis, another limitation is that our analysis and validation results are in silico-based predictions. Further studies are therefore required to identify and ultimately validate the functional effects of gene networks and their association to chemotherapy response through experimental validation.

In this study, we identified genes and gene networks correlated with platinum-based chemotherapy response in high-grade serous ovarian cancer patients, which implicate both known and novel biological mechanisms. Specifically, we identified that reduced expression of VCP is associated with platinum-resistance. This gene is critical for removing unfolded proteins from the endoplasmic reticulum and has been known to be associated with cancer cell survival and response to platinum‐based chemotherapy. In addition, we identified a group of genes associated with platinum sensitivity that are co-expressed with VCP on chromosome 9. Genes from this module are involved in the protein processing in the endoplasmic reticulum pathway, which has been previously implicated in chemotherapy resistance and cancer cell survival. Finally, we report potentially cis-acting regulatory variants in BRCA2 gene that are associated with varied expression of BRCA2. In summary, our study contributes to a better understanding of the biological mechanisms underlying chemotherapy response in high-grade serous ovarian cancer. Our findings could help improve future patient screening and therapeutics for ovarian cancer through the identification of gene signatures that may predict chemotherapy response, as well as the potential discovery of novel drug targets.

DGE: Differential Gene Expression

eQTL: Expression Quantitative Trait Loci

FDR: False Discovery Rate

GDC: Genomic Data Commons

GEO: Gene Expression Omnibus

GWAS: Genome-Wide Association Study

HDL: High Density Lipoprotein

HGSOC: High-Grade Serous Ovarian Carcinoma

HR: Homologous Recombination

KM: Kaplan-Meier

LD: Linkage Disequilibrium

LDL: Low Density Lipoprotein

MAD: Median Absolute Deviation

PFS: Progression-Free Survival

RMA: Robust Multiarray Average

shRNA: pooled short hairpin RNA

SNP: Single Nucleotide Polymorphism

TCGA: The Cancer Genome Atlas

TFBS: Transcription Factor Binding Site

TKI: Tyrosine Kinase Inhibitor

TOM: Topological Overlap Matrix

TSS: Transcriptional Start Site

WGCNA: Weighted Gene Co-expression Network Analysis

Ethics approval and consent to participate

Controlled access to datasets from The Cancer Genome Atlas (TCGA) was provided by the National Institute of Health (NIH). The terms of access are outlined in the Data Use Certification Agreement: https://dbgap.ncbi.nlm.nih.gov/aa/wga.cgi?view_pdf&wlid=10654&tlsid=274. Details of the Human subject protection and data access policies implemented by TCGA are also described: https://www.cancer.gov/about- nci/organization/ccg/research/structural-genomics/tcga/history/policies/tcga-human-subjects-data-policies.pdf. Moreover, local ethics clearance for human subject research was granted by Queen’s University.

Consent for publication

Not applicable.

Availability of data and material

Transcriptomics, genomics, and clinical data used for the analysis of TCGA-OV cohort can be accessed/downloaded from the Genomic Data Commons (GDC) Data Portal (https://portal.gdc.cancer.gov/). Gene expression and clinical data of the replication cohort can be accessed/downloaded from Gene Expression Omnibus (GEO) database (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE9899). Initial version of this manuscript have been deposited in a non-commercial preprint repository (bioRxiv accession number: 740696).⁶⁰

Competing interests

The authors declare no conflicts of interest.

Funding

J.C. is funded by Queen's University, Faculty of Health Sciences Dean's Doctoral Award and Ontario Graduate Scholarship. M.K. receives funding from the Canadian Institutes of Health Research and Ontario Ministry of Research Innovation and Science, Early Researcher Award. Q.L.D. receives funding from the Canadian Institutes of Health Research and Queen’s University.

Author contribution

J.C. performed the data analyses and drafted the manuscript. D.G.T assisted in literature review and manuscript editing. D.G.T., S.N. and A.T. assisted in the data analyses. Q.L.D. designed the research project, supervised data analyses and assisted in the writing of the manuscript. M.K. assisted in the study design and editing of the manuscript.

Acknowledgements

Computations in this manuscript were performed on resources and with support provided by the Centre for Advanced Computing (CAC) at Queen's University in Kingston, Ontario. The CAC is funded by: the Canada Foundation for Innovation, the Government of Ontario, and Queen's University.

Reid, F. World Ovarian Cancer Coalition 2018. World Ovarian Cancer Coalit. (2018).
International Agency for Research on Cancer Website. Globocan 2012 - Home. Webpage (2012). doi:NO:11
Brett M., R. et al. Epidemiology of ovarian cancer: a review. Cancer Biol. Med. (2017). doi:10.20892/j.issn.2095-3941.2016.0084
Cannistra, S. A. Cancer of the Ovary. N. Engl. J. Med. 351, 2519–2529 (2004).
Miller, D. S. et al. Phase II Evaluation of Pemetrexed in the Treatment of Recurrent or Persistent Platinum-Resistant Ovarian or Primary Peritoneal Carcinoma: A Study of the Gynecologic Oncology Group. J. Clin. Oncol. 27, 2686–2691 (2009).
Armstrong, D. K. Relapsed ovarian cancer: challenges and management strategies for a chronic disease. Oncologist 7 Suppl 5, 20–28 (2002).
Foley, O. W., Rauh-Hain, J. A. & del Carmen, M. G. Recurrent epithelial ovarian cancer: an update on treatment. Oncology (Williston Park). 27, 288–94, 298 (2013).
Ozols, R. F. Recurrent ovarian cancer: Evidence-based treatment. Journal of Clinical Oncology 20, 1161–1163 (2002).
Walsh, T. et al. Mutations in 12 genes for inherited ovarian, fallopian tube, and peritoneal carcinoma identified by massively parallel sequencing. Proc. Natl. Acad. Sci. (2011). doi:10.1073/pnas.1115052108
Booth, M., Beral, V. & Smith, P. Risk factors for ovarian cancer: A case-control study. Br. J. Cancer (1989). doi:10.1038/bjc.1989.320
Vencken, P. M. L. H. et al. Chemosensitivity and outcome of BRCA1- and BRCA2-associated ovarian cancer patients after first-line chemotherapy compared with sporadic ovarian cancer patients. Ann. Oncol. (2011). doi:10.1093/annonc/mdq628
Patch, A.-M. et al. Whole-genome characterization of chemoresistant ovarian cancer. Nature 521, 489–94 (2015).
Hao, D. et al. Immunogenomic analyses of advanced serous ovarian cancer reveal immune score is a strong prognostic factor and an indicator of chemosensitivity. Clin. Cancer Res. 24, 3560–3571 (2018).
Koti, M. et al. A distinct pre-existing inflammatory tumour microenvironment is associated with chemotherapy resistance in high-grade serous epithelial ovarian cancer. Br. J. Cancer 112, 1215–1222 (2015).
Au, K. K. et al. STAT1-associated intratumoural TH1 immunity predicts chemotherapy resistance in high-grade serous ovarian cancer. J. Pathol. Clin. Res. (2016). doi:10.1002/cjp2.55
Koti, M. et al. Identification of the IGF1/PI3K/NF κB/ERK gene signalling networks associated with chemotherapy resistance and treatment response in high-grade serous epithelial ovarian cancer. BMC Cancer 13, 549 (2013).
Cancer, T. & Atlas, G. Integrated genomic analyses of ovarian carcinoma. Nature 474, 609–15 (2011).
Sun, J. et al. Large-scale integrated analysis of ovarian cancer tumors and cell lines identifies an individualized gene expression signature for predicting response to platinum-based chemotherapy. Cell Death Dis. 10, 1–12 (2019).
Li, J., Iii, W. H. W., Becker, K. G., Weeraratna, A. T. & Morin, P. J. Gene expression response to cisplatin treatment in drug-sensitive and drug-resistant ovarian cancer cells. 2860–2872 (2007). doi:10.1038/sj.onc.1210086
Langfelder, P. & Horvath, S. WGCNA : an R package for weighted correlation network analysis. BMC Bioinformatics 9, 559 (2008).
Grossman, R. L. et al. Toward a shared vision for cancer genomic data. New England Journal of Medicine (2016). doi:10.1056/NEJMp1607591
Colaprico, A. et al. TCGAbiolinks: An R/Bioconductor package for integrative analysis of TCGA data. Nucleic Acids Res. 44, e71 (2016).
Irizarry, R. A. et al. Exploration, normalization, and summaries of high density oligonucleotide array probe level data. Biostatistics 4, 249–264 (2003).
Gautier, L., Cope, L., Bolstad, B. M. & Irizarry, R. A. Affy - Analysis of Affymetrix GeneChip data at the probe level. Bioinformatics 20, 307–315 (2004).
Kauffmann, A., Gentleman, R. & Huber, W. arrayQualityMetrics — a bioconductor package for quality assessment of microarray data. Bioinformatics 25, 415–416 (2009).
Smyth, G. limma: Linear Models for Microarray Data. in Bioinformatics and Computational Biology Solutions Using R and Bioconductor 397–420 (2005). doi:citeulike-article-id:5722720
Gentleman, R. et al. Bioconductor: open software development for computational biology and bioinformatics. Genome Biol. 5, R80 (2004).
Shannon, P. et al. Cytoscape: A software Environment for integrated models of biomolecular interaction networks. Genome Res. 13, 2498–2504 (2003).
Huang, D. W., Sherman, B. T. & Lempicki, R. A. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat. Protoc. 4, 44–57 (2009).
Warde-Farley, D. et al. The GeneMANIA prediction server: Biological network integration for gene prioritization and predicting gene function. Nucleic Acids Res. 38, (2010).
Cotto, K. C. et al. DGIdb 3.0: a redesign and expansion of the drug–gene interaction database. Nucleic Acids Res. (2017). doi:10.1093/nar/gkx1143
Tothill, R. W. et al. Novel Molecular Subtypes of Serous and Endometrioid Ovarian Cancer Linked to Clinical Outcome. Clin. Cancer Res. 14, 5198–5208 (2008).
Gyorffy, B., Lánczky, A. & Szállási, Z. Implementing an online tool for genomewide validation of survival-associated biomarkers in ovarian-cancer using microarray data from 1287 patients. Endocr. Relat. Cancer 19, 197–208 (2012).
Aguirre-Gamboa, R. et al. SurvExpress: An Online Biomarker Validation Tool and Database for Cancer Gene Expression Data Using Survival Analysis. PLoS One 8, (2013).
Das, S. et al. Next-generation genotype imputation service and methods. Nat. Genet. 48, 1284–1287 (2016).
The 1000 Genomes Project Consortium. A global reference for human genetic variation. Nature 526, 68–74 (2015).
Loh, P. R. et al. Reference-based phasing using the Haplotype Reference Consortium panel. Nat. Genet. (2016). doi:10.1038/ng.3679
Delaneau, O. et al. Integrating sequence and array data to create an improved 1000 Genomes Project haplotype reference panel. Nat. Commun. (2014). doi:10.1038/ncomms4934
Purcell, S. et al. PLINK: A tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559–575 (2007).
McLaren, W. et al. The Ensembl Variant Effect Predictor. Genome Biol. (2016). doi:10.1186/s13059-016-0974-4
Leslie, R., O’Donnell, C. J. & Johnson, A. D. GRASP: Analysis of genotype-phenotype results from 1390 genome-wide association studies and corresponding open access database. Bioinformatics (2014). doi:10.1093/bioinformatics/btu273
Landrum, M. J. et al. ClinVar: Public archive of relationships among sequence variation and human phenotype. Nucleic Acids Res. (2014). doi:10.1093/nar/gkt1113
Shabalin, A. A. Matrix eQTL : ultra fast eQTL analysis via large matrix operations. 28, 1353–1358 (2012).
Jiao, Q. et al. Advances in studies of tyrosine kinase inhibitors and their acquired resistance. Molecular Cancer 17, 1–12 (2018).
Lin, R. et al. Long-term cisplatin exposure promotes methylation of the OCT1 gene in human esophageal cancer cells. Dig. Dis. Sci. (2013). doi:10.1007/s10620-012-2424-9
Keita, M. et al. The RUNX1 transcription factor is expressed in serous epithelial ovarian carcinoma and contributes to cell proliferation, migration and invasion. Cell Cycle (2013). doi:10.4161/cc.23963
Cheung, H. W. et al. Systematic investigation of genetic vulnerabilities across cancer cell lines reveals lineage-specific dependencies in ovarian cancer. Proc. Natl. Acad. Sci. U. S. A. 108, 12372–12377 (2011).
Etemadmoghadam, D. et al. Synthetic lethality between CCNE1 amplification and loss of BRCA1. doi:10.1073/pnas.1314302110
Bastola, P., Neums, L., Schoenen, F. J. & Chien, J. VCP inhibitors induce endoplasmic reticulum stress, cause cell cycle arrest, trigger caspase-mediated cell death and synergistically kill ovarian cancer cells in combination with Salubrinal. Mol. Oncol. (2016). doi:10.1016/j.molonc.2016.09.005
Peng, J. et al. VCP gene variation predicts outcome of advanced non-small-cell lung cancer platinum-based chemotherapy. Tumor Biol. 34, 953–961 (2013).
Devlin, J., Elder, P. A., Gabra, H., Steel, C. M. & Knowles, M. A. High frequency of chromosome 9 deletion in ovarian cancer: Evidence for three tumour-suppressor loci. Br. J. Cancer (1996). doi:10.1038/bjc.1996.75
Ell, B. & Kang, Y. Transcriptional control of cancer metastasis. Trends in Cell Biology 23, 603–611 (2013).
Brands, R. C. et al. Multi-kinase inhibitors and cisplatin for head and neck cancer treatment in vitro. Oncol. Lett. 18, 2220—2231 (2019).
Katopodis, P. et al. Kinase Inhibitors and Ovarian Cancer. Cancers (Basel). 11, (2019).
Koussounadis, A., Langdon, S. P., Harrison, D. J. & Smith, V. A. Chemotherapy-induced dynamic gene expression changes in vivo are prognostic in ovarian cancer. Br. J. Cancer 110, 2975–2984 (2014).
Wan, B. et al. Knockdown of BRCA2 enhances cisplatin and cisplatin-induced autophagy in ovarian cancer cells. Endocr. Relat. Cancer 25, 69–82 (2018).
Xia, F. et al. Deficiency of human BRCA2 leads to impaired homologous recombination but maintains normal nonhomologous end joining. Proc. Natl. Acad. Sci. U. S. A. 98, 8644–8649 (2001).
Wu, Y. et al. Cholesterol reduces the sensitivity to platinum-based chemotherapy via upregulating ABCG2 in lung adenocarcinoma. Biochem. Biophys. Res. Commun. (2015). doi:10.1016/j.bbrc.2015.01.035
Kim, S., Lee, M., Dhanasekaran, D. N. & Song, Y. S. Activation of LXRɑ/$β$ by cholesterol in malignant ascites promotes chemoresistance in ovarian cancer. BMC Cancer 18, 1232 (2018).
Choi, J. et al. Gene networks and expression quantitative trait loci associated with platinum-based chemotherapy response in high-grade serous ovarian cancer. bioRxiv 740696 (2019).

Supplemental Figure 1. Selection of soft-thresholding power for weighted gene coexpression network analysis (WGCNA). Scale independence plot on the left shows the change of scale free fit index (r2) per every increment of power. The mean connectivity plot on the right shows the change of average connectivity between genes for each power change. These two plots give guidance in choosing the optimal power in transforming the similarity matrix. Results from both plots indicate that at power 9, network reaches optimal scale free fit index.

Supplemental Figure 2. Module dendrogram retrieved from hierarchical clustering of module eigengenes. The figure shows the dendrogram (tree diagram) of modules identified from co-expression clustering analysis of WGCNA pipeline. We merged modules showing high similarity to reduce excessive split of genes into many small sized clusters. Red horizontal line shows the threshold we used to merge modules with high similarity.

Supplemental Figure 3. Manhattan plot of genome-wide SNP association study (GWAS). The figure shows the association between each individual SNP and status of chemoresistance. Each dot in Manhattan plot represents an individual SNP, x-axis displays the chromosomes which the variants are from and y-axis shows -log10 transformed p-value. Blue horizontal line shows genome-wide suggestive significance threshold (10e-5) and red horizontal line shows the genome-wide significance threshold (5e-8).

Download PDF

Journal Publication

published 13 May, 2020

Read the published version in BMC Cancer →

Review #1 received at journal
09 Apr, 2020
Reviewer #2 agreed at journal
31 Mar, 2020
Reviewers invited by journal
24 Mar, 2020
Reviewer #1 agreed at journal
24 Mar, 2020
Editor assigned by journal
23 Mar, 2020
Submission checks completed at journal
22 Mar, 2020
Editor invited by journal
22 Mar, 2020

You are reading this latest preprint version

Gene networks and expression quantitative trait loci associated with platinum-based chemotherapy response in high-grade serous ovarian cancer

Status:

Journal Publication

Version 2

Abstract

Figures

Introduction

Methods

Results

Discussion

Conclusion

Abbreviations

Declarations

References

Supplemental Figure Information

Supplementary Files

Status:

Journal Publication

Version 2