Genetic architecture of pancreatic cyst
To our knowledge the genetic architecture of pancreatic cysts has not yet been characterized. The main goals of this study were to determine 1) the genetic architecture of benign pancreatic cysts using GWAS data, and 2) if this was associated with PC.
First, a discovery GWAS was performed to identify genetic associations in individuals with benign pancreatic cysts compared to healthy controls (707 individuals with benign pancreatic cysts (cases) and 14,000 healthy controls of European origin from the UK Biobank (UKBB). Additionally, a pairwise GWAS was performed to determine the genetic associations in individuals with PC compared to those with benign pancreatic cysts (584 individuals diagnosed with PC (cases) versus 707 individuals with benign pancreatic cysts (controls), supplementary Table S1). Individuals diagnosed with malignant cysts were excluded from both the discovery and pairwise GWAS analyses.
Analyses of the discovery GWAS identified rs142409042 genetic variant near the opioid binding protein/cell adhesion molecule like (OPCML) gene on chromosome 11 (Fig. 2A) as having genome-wide significance association (p < 10− 08) with benign pancreatic cysts versus healthy controls (odd ratio OR (95% confidence interval (CI) = 2.583 (1.835–3.636)). Similarly, rs7190458 genetic variant near the BCAR1 scaffold protein, Cas family member (BCAR1) and chymotrypsinogen B1 (CTRB1) on chromosome 16 (Fig. 2B) showed a genome-wide significant association with PC versus benign pancreatic cysts (OR (95%CI) = 0.152 (0.094–0.244)). Notably, there was no overlap of genome-wide significant SNPs p < 10− 08 between the discovery GWAS (benign pancreatic cysts versus healthy controls) and pairwise GWAS (PC versus benign pancreatic cysts) (Fig. 2, supplementary Table S2, S3).
Polygenic risk score (PRS) for prediction of the risk of developing benign pancreatic cyst
We next aimed to develop a PRS to predict the risk of benign pancreatic cysts. We refer to this PRS as benign pancreatic cyst PRS. Genetic variants identified in the discovery GWAS (benign pancreatic cyst versus healthy controls) were used to calculate PRS for each individual across the four groups: healthy controls, benign pancreatic cysts, PC, PC with cyst. To construct the PRS, 32 genetic variants from the discovery GWAS (p < 10− 05) were weighted by their effect sizes. PRS calculation was performed for each individual, and percentiles were established based on the PRS of the healthy control group. Each individual from the four groups was assigned a percentile based on her or his PRS score. The logistic regression model and ANOVA were performed to compare benign pancreatic cysts, PC with cyst and PC with healthy controls. PRS showed a significant association with the risk to develop benign pancreatic cysts versus healthy controls (OR (95%CI) = 2.159 (2.017–2.311), supplementary Table S4). 65% of individuals with benign pancreatic cysts were represented in the upper percentile (Fig. 2C). In the lower percentile, the PC with cyst and PC were represented (ANOVA p < 2 × 10− 16). A similar distribution pattern within the percentiles was noticed when analyzing females and males separately in supplementary Figure S1 and Table S4. This is consistent with the benign pancreatic cyst PRS being in a different range from PC and PC with cyst. Thus, benign pancreatic cyst PRS constructed from discovery GWAS were associated with benign cysts but not with PC or PC with cyst (Fig. 2C).
However, since some pancreatic cysts become malignant, we next investigated whether the genes associated with benign cysts interacted with or overlapped with the genes or gene products associated with PC. For this purpose, we mapped the discovery and pairwise GWAS genetic variants to the corresponding genes. For the next section, discovery GWAS mapped genes are referred to as GWAS-genes. These discovery GWAS-genes were used to test for overlap or molecular interactions with differentially expressed genes/proteins (DEGs/DEPs, respectively) from bulk and single cell transcriptomics or proteomics data. Interactions between GWAS-genes and DEGs/DEPs were sought based on the protein-protein interaction network based on STRING database.
Arp2/3 complex as a common molecular mechanism between pancreatic cancer and pancreatic cyst
We next tested if there was any association between the GWAS-genes and mRNAs/proteins in PC. To define such mRNAs/proteins, we analyzed pancreatic tumor tissues and normal tissues from The Cancer Genome Atlas (TCGA) and Clinical Proteomic Tumor Analysis Consortium (CPTAC) to identify DEGs and DEPs, respectively (Tumor versus normal, adjusted p value (p-adjusted) < 0.05, absolute log fold change > 0.25). We identified 2,169 DEGs/DEPs that were consistently upregulated or downregulated in tumors across TCGA and CPTAC. Of them, six genes were the GWAS-genes, namely NTM, ARPC3, CDCP1, FEZ2, FXYD2, and MYO1D. All the corresponding mRNAs and proteins were upregulated in tumors, except FXYD2 (supplementary Figure S2). We further hypothesized that the protein products of these six GWAS-genes interacted with these similarly upregulated or downregulated DEGs/DEPs within the tissue at a local level, potentially driving progression from pancreatic cyst to PC.
To test this hypothesis, we performed a PPI network analysis for six GWAS-genes based on the STRING database. We utilized PPIs with a confidence score (> 0.95) to exclude spurious interactions random (a score of 1 represents the highest possible confidence in the STRING database). Additionally, we also considered co-expression of the proteins based on the STRING database to identify the co-expressed interaction partners. We found that only one GWAS-gene protein product, ARPC3, was highly connected in PPI network (confidence score > 0.95, Fig. 3A, supplementary Table S5). We further identified ARPC3 protein as primary neighbors, directly interacted with ARPC2, ARPC1B, ARPC4, ARPC5, and ACTR3 (confidence score > 0.95 based on PPI and co-expression) in PC. These DEPs were part of the (Arp 2/3 complex. This complex, in turn, interacted with the effector downstream proteins including ACTG1 and ACTB DEPs with the high possible confidence score (confidence score > 0.95) (Fig. 3B). The Arp2/3 complex genes mentioned above as well as their downstream genes, referred to Arp2/3 complex-associated genes, were linked with the actin-related pathways, including regulation of actin dynamics for phagocytic cup formation 20.
Next, we hypothesized that Arp2/3 complex-associated genes have vital roles in the development of PC. So, we further investigated the expression patterns of their corresponding proteins in PC at both bulk level and the expression patterns of their corresponding mRNA in PC at cellular level.
Arp 2/3 complex-associated genes differed in gene expression and may be promising biomarkers in PC
To investigate the regulation of Arp2/3 complex-associated genes/proteins in PC, we analyzed the DEGs and DEPs of PC versus normal from TCGA and CPTAC. All eight genes and proteins associated with the Arp2/3 complex were upregulated significantly in PC patients of both TCGA and CPTAC datasets (absolute log fold change > 0.25 and p-adjusted < 0.05, Fig. 4A, supplementary Table S6). Upregulation of the Arp2/3 complex which is related to actin cytoskeleton and cellular movement indicated the importance of this complex in PC.
To examine if this pattern was generalizable, we analyzed protein data from 191 PC tumor and 90 paired tumor-adjacent tissues21. The analysis showed that all of the eight proteins were significantly upregulated in the PC patients (absolute log fold change > 0.25, p-adjusted < 0.05, Fig. 4B).
Following this, we extended our analysis to single-cell RNA sequencing (scRNA-seq) data to determine the presence of Arp2/3 complex-associated genes in specific cell types within the PC tumor microenvironment. We analyzed scRNA-seq data from 24 primary pancreatic tumors and 11 normal pancreases, subjecting 41,148 cells from tumor tissues and 15,444 cells from normal tissues to quality control and clustering. This process identified nine distinct cell types, each featuring 272 to 1,982 DEGs between tumor and normal tissues. Among these DEGs, genes ACTB, ARPC2, ARPC3, ARPC4, ARPC5, and ACTG1 of the Arp2/3 complex-associated genes were notably upregulated across multiple cell types, including acinar, fibroblast, endocrine, and epithelial cells (Fig. 4A).
Taken together, these analyses supported that the Arp2/3 complex-associated genes may play an important role in the progression from pancreatic cysts to PC at cellular levels.
Arp2/3 complex-associated genes as potential biomarkers for malignant transformation in pancreatic cysts
To systematically assess Arp2/3 complex-associated genes as potential biomarkers for malignant transformation from low-risk PC cysts to high-risk PC cysts, we explore whether the similar expression pattern of these genes was present in different cell types in HGD IPMN. We analyzed available scRNA-seq data from pancreatic cyst biopsies with HGD IPMN, LGD IPMN and primary pancreatic tumors10. After quality filtering, we analyzed 7,986 cells for cell typing and subsequent analyses. We identified two cell types: epithelial cell and macrophages (supplementary Figure S3). Due to low cell number/counts, we only focused on epithelial cells (n = 4,892 cells). Within this subset, we identified 327 DEGs for HGD IPMN (HGD versus LGD, absolute log fold change > 0.25, p-adjusted < 0.05) and 882 DEGs for PC (PC versus LGD) (supplementary Tables S7, S8). In epithelial cells, six genes associated with Arp2/3 were identified as DEGs in both PC and HGD IPMN when compared to LGD IPMN (Fig. 4C). These genes included ARPC3, ACTB, ACTR3, ARPC2, ARPC5, and ACTG1. In addition, ARPC4 and ARPC1B were found to be differentially expressed in PC epithelial cells compared to LGD IPMN (Fig. 4C, supplementary Table S8).
The further pathway enrichment analyses revealed that Arp2/3-associated genes, including ARPC3, ACTB, ACTR3, ARPC2, ARPC5, and ACTG1, were significantly enriched (p-adjusted < 0.05) in integrin signaling and signaling by Rho family GTPases in both epithelial cells of PC and HGD IPMN (Fig. 4D). These pathways, which are crucial for cellular adhesion and motility, are potentially relevant to malignant transformation in pancreatic cysts22,23.