Label-free quantification
In total, the LFQ method resulted in the identification of 49,146 unique peptides corresponding to 5,464 protein groups with a protein FDR level < 1% (Supplementary Table S1 online). For our data, we considered only a subset of 4,204 proteins with quantitative valid values across at least 70% of the samples. For a functional view of the proteomic data, we used volcano plots to compare expression differences between the ACP and PCP samples. Considering t-test results for pairwise comparisons and employing a filtering strategy based on a p-value < 0.05 and fold-change > 1.5, we identified 1223 proteins that showed significant differential expression, of which 458 were upregulated in the ACP samples and 765 were upregulated in the PCP samples (Fig, 2a, 2b, and Supplementary Table 2 online). To assess the quantitative reproducibility between biological replicates, we calculated the average Pearson correlation coefficients within and between groups. Interestingly, the biological correlation data indicated higher diversity among the ACP samples (average Pearson correlation = 0.76) compared with the PCP samples (average Pearson correlation = 0.88) (Supplementary Fig. 1 online). Hierarchical clustering and principal component analysis revealed tight clustering of the ACP and PCP samples and their corresponding biological replicates, indicating distinct protein expression patterns within each subtype (Fig. 2b and 2c). To analyze the functional differences between the ACP and PCP samples, we performed 1D annotation enrichment analysis based on GO using Perseus software15, and the results revealed 81 statistically significant features with a Benjamini-Hochberg FDR < 0.05 (Supplementary Table 3 online). The terms related to locomotory behavior, RNA processing, gene expression, the mitotic cell cycle, neuron projection, and microtubule-related process were mainly enriched in ACP subtype, while the terms associated with membrane, extracellular matrix disassembly, response to lipopolysaccharide or bacterium, and mitochondrial activity were significantly enriched in the PCP subtype (Fig. 2d).
Functional enrichment analysis was performed by combining two quantification strategies
To select DEPs with consistent results based on these two different quantitative methods, we analyzed the overlap of the DEP sets in each subtype. The overlap between LFQ and TMT in the ACP group was relatively small (17% of all DEPs in the ACP group). In contrast, DEP overlap in the LFQ and TMT data of the PCP group was 35% (426 proteins). We found a strong correlation (R = 0.89) between fold-changes of DEPs identified by both LFQ and TMT quantification strategies, suggesting that our quantitative analysis is highly reproducible and reliable (Fig. 4). To validate the results of our quantitative proteomic analysis, we performed Western blot analysis with a set of ACP samples (n = 4) and PCP samples (n = 4). The expression levels of EPCAM and P4HB were increased in the ACP and PCP sample sets, respectively (Fig. 4d and Supplementary Fig. 3 online), indicating that Western blot substantially verified the expressional differences first obtained by MS.
Enrichment analysis based on GO using the overlapping DEPs (total 574 proteins) was performed to identify the common cellular processed and pathways between the proteins identified in the TMT and label-free quantification experiments. Fischer’s exact test was used to identify numerous biological processes enriched among the DEPs (p < 0.05; Fig. 5 and Supplementary Table 7 online). This analysis revealed that DEPs representing distinct biological processes were significantly enriched in the different CP subtypes. Proteins that were upregulated in ACP were associated with nucleotide-excision repair, RNA splicing, the cell cycle, cell migration, and neuron development. In the case of the upregulated proteins in PCP, mitochondrial organization, oxidation-reduction process, fatty acid metabolic process, cell adhesion, apoptotic signaling pathway, exocytosis, and inflammatory response were significantly enriched.
To explore the collective functions of the DEPs according to subtype, we constructed a network model to describe the interactions among the DEPs involved in the GO BPs specifically enriched in each group (Fig. 6). First, the network model showed the upregulation of several processes that are known to be associated with the nucleus, including the cell cycle (DYNC1H1, GPS1, TUBA1A, RBBP4, BCAT1, CDK9, SAE1, SKP1, RPA1, RPA2, RPA3, HCFC1, NUP210, and CAMK2G) and RNA splicing (CSTF3, SNRPF, HNRNPU, HNRNPM, HNRNPH2, DDX, TARDBP, POLR2A, RTRAF, and PPIL3). The network model further showed that the upregulation of dendrite development (MAP1B and MECP2), neuron development (PPT, NCAM1, and SYT1), and glial cell development (CLU, DAG1, and ILK) were upregulated in ACP. Finally, the network model revealed upregulation of cell migration (PRKCA, TNS1, THBS1, RHOB, and PFN2) and regulation of transmembrane proteins (PARP1, STRAP, and RPS27A) in ACP compared with PCP.
In contrast, the network model shows an upregulation of pathways related to mitochondrial functions, including mitochondrial organization and fatty acid metabolism, in PCP compared with ACP (Fig. 6). Consistent with the upregulation of these pathways, the network model further showed that exocytosis (RAB27B, ECM1, STYL1, SDC1, ITIH3, S100A13, F13A1, TMED10, MYH10, LAMP2, APLP2, and STX4) and inflammation (S100A8, S100A9, NMI, C1QBP, IL1RAP, IL1RN, NFKB1, ANXA1, PRDX5, and LYZ) were also upregulated in PCP. Additionally, the proteins associated with apoptosis were largely upregulated in PCP, suggesting complex alteration patterns in the mitochondrial pathway of apoptosis16. Moreover, the network model showed the upregulation of many proteins involved in cell adhesion. Collectively, the upregulation of these key mitochondrial-associated processes in the network model suggests changed mitochondrial function and activity in PCP compared with ACP.
To identify transcription factors (TFs) that can drive the subtype-related changes in protein expression observed in our proteomic analysis, we predicted transcription factor binding sites (TFBSs) in DEPs and selected candidate regulators. The upregulated proteins in ACP showed TF BS enrichment with USF1, MYC, E2F1, ARNT, and NRF1 in both quantitative proteomic datasets (Supplementary Table 8 and Supplementary Fig. 4 online). For example, several MYC target proteins (HNRNPH2, ANXA6, ARL3, SNTB2, ILK, HNRNPM, FXYD6, NCL, GNAS, HNRNPDL, RBBP4, GPS1, SAE1, and RPA1) were significantly enriched in ACP (Fig. 7a). In addition, NCL, FXYD6, SNTB2, SAE1, PURA, RBBP4, HNRNPH2, and GNAS were enriched in ACP in both datasets as ARNT downstream targets. The proteins in PCP that were upregulated compared with those in ACP showed TFBS enrichment, including SF1, ERR1, NRF1, NRF2, ER, and ELK1 (Supplementary Table 9 online). In the case of ELK1, TOMM40, STOML2, MBP2K, MRPL13, TUFM, UQCRH, DIABLO, ZDHHC5, CLDN7, SEC24C, ACTR2, and TBC1D17 were observed to be significantly enriched with ELK1 targets in both proteomic datasets (Fig. 7b). Moreover, ERR1 was predicted to have 44 downstream target proteins.