KIF20A Expression Level Across Cancers
First, the Oncomine database was used to examine the levels of KIF20A mRNA across a wide variety of cancers. Compared to the expression in the corresponding control groups, the expression of KIF20A in cancers, including bladder, brain and central nervous system (CNS), cervical, colorectal, esophageal, gastric, head and neck, kidney, liver, lung, lymphoma, ovarian, and pancreatic cancers and sarcoma, was higher. Furthermore, two leukemia datasets and one breast cancer dataset revealed lower expression of KIF20A in cancer (Fig. 1A).
Then, we used R (v.3.6.3) to investigate RNA sequencing data from The Cancer Genome Atlas (TCGA) and Genotype-Tissue Expression (GTEx) datasets to further assess KIF20A expression across cancers. As shown in Figure 1B, the KIF20A expression in tumor tissues was significantly higher than that in the corresponding normal tissues in adrenocortical carcinoma (ACC), bladder urothelial carcinoma (BLCA), breast invasive carcinoma (BRCA), cervical squamous cell carcinoma and endocervical adenocarcinoma (CESC), cholangiocarcinoma (CHOL), colon adenocarcinoma (COAD), diffuse large B-cell lymphoma (DLBCL), esophageal carcinoma (ESCA), glioblastoma multiforme (GBM), head and neck squamous cell carcinoma (HNSC), kidney chromophobe (KICH), kidney renal clear cell carcinoma (KIRC), kidney renal papillary cell carcinoma (KIRP), brain lower grade glioma (LGG), liver hepatocellular carcinoma (LIHC), lung adenocarcinoma (LUAD), lung squamous cell carcinoma (LUSC), ovarian serous cystadenocarcinoma (OV), pancreatic adenocarcinoma (PAAD), prostate adenocarcinoma (PRAD), rectum adenocarcinoma (READ), skin cutaneous melanoma (SKCM), stomach adenocarcinoma (STAD), testicular germ cell tumors (TGCT), thyroid carcinoma (THCA), thymoma (THYM), uterine corpus endometrial carcinoma (UCEC), uterine carcinosarcoma (UCS) (all P<0.001) and pheochromocytoma and paraganglioma (PCPG) (P<0.01). However, KIF20A expression was significantly lower in acute myeloid leukemia (LAML) tissue (P<0.001) than in control tissue.
Furthermore, the Clinical Proteomic Tumor Analysis Consortium (CPTAC) dataset revealed that KIF20A protein expression was higher in primary tissues than in normal controls in clear cell renal cell carcinoma (RCC), LUAD and UCEC (Fig. 1C, P<0.001). Analysis of Human Protein Atlas (HPA) data also showed that the expression of KIF20A was higher in clear cell RCC, LUAD, and UCEC than in the respective normal tissues. Representative images are presented in Figure 1D.
In addition, we used the Gene Set Cancer Analysis (GSCA) (Fig. 1E) and GEPIA2 (Fig. 1F) websites to assess the relationship between KIF20A expression and pathological stage in cancers, including ACC, BRCA, CESC, ESCA, KICH, KIRC, KIRP, LIHC and LUAD (all P<0.05), but we did not assess this relationship in the other cancers mentioned above (data not shown).
Prognostic Value of KIF20A Expression in Cancers
We divided the cancer samples into two groups based on KIF20A expression: a high-expression-group and a low-expression-group. The prognostic value of KIF20A expression across cancers was investigated with the GEPIA website. As shown in Figure 2A-J, K and L, KIF20A overexpression was associated with poor overall survival (OS) in the ACC (P=1.1e-05), KIRC (4.9e-05), KIRP (9.3e-05), LGG (P=1e-09), LICH (P=0.0034), LUAD (P=0.00046), MESO (6.3e-07), PAAD (P=0.00019), SARC (P=0.022), USC (P=0.02) and UVM (P=0.011) TCGA datasets. However, low expression of the KIF20A gene was linked to poor OS prognosis for THYM (P=0.0072). Disease-free survival (DFS) analysis (Figure 2M-X) revealed that KIF20A expression was significantly correlated with DFS in twelve cancer types: ACC (P=0.004), BLCA (P=0.012), KIRC (P=7.5e-05), KIRP (P=2.6e-05), LGG (P=6.9e-05), LIHC (P=0.00065), LUAD (P=0.023), MESO (P=0.029), PAAD (P=0.0037), PRAD (P=0.001), SARC (P=0.0014) and UVM (P=0.012). According to these findings, KIF20A expression is correlated with the prognosis of patients with various tumors.
Genetic Alterations of KIF20A Across Cancers
Human malignancies arise as a result of a build-up of genetic mutations. As such, we wanted to examine the genetic alterations of KIF20A in human tumor tissues. Renal clear cell carcinoma showed the highest incidence of KIF20A amplification, with a frequency of 5%. Endometrial carcinoma had the highest rate of KIF20A alteration (>3%), with mutation being the most common alteration type (Fig. 3A). It is worth mentioning that all cases of ocular melanoma with genetic changes had KIF20A deletion (Figure 3A). The types, locations, and case numbers of the KIF20A genetic alterations are further detailed in Figure 3B. We discovered that KIF20A missense mutations were the most common form of genetic alteration, and R445H/C mutations were observed in one case of READ and two cases of UCEC (Fig. 3b). The location of the R445H/C mutation within the KIF20A protein 3D structure is shown in Figure 3C.
Immune Cell Infiltration Analysis
As important components of the tumor microenvironment (TME), tumor-infiltrating immune cells have been found to be related to cancer formation, development, and metastasis. We initially analyzed the relationships of KIF20A expression with the levels of several types of infiltrating immune cells, including B cells, T cells (including T helper 1 (Th1) cells, Th2 cells, CD8+ T cells and regulatory T (Treg) cells), endothelial cells, macrophages and neutrophils. KIF20A expression was linked with the levels of these infiltrating immune cells to varying degrees, as shown in Figure 4. Cancer-associated fibroblasts (CAFs) have been found to influence the function of different tumor-infiltrating immune cells in the stroma of the TME. Herein, we examined the potential relationship between the level of infiltrating CAFs and KIF20A gene expression in different TCGA cancer datasets using the TIMER2, MCP-COUNTER, and EPIC algorithms. We discovered a statistically significant positive association between KIF20A expression and the predicted level of infiltrating CAFs in ESCA, human papilloma virus-negative (HPV-) HNSC, KIRC, KIRP, LGG, LIHC, MESO, PCPG, THCA, UCS, and UVM and a significant negative correlation of KIF20A expression with the predicted levels of infiltrating CAFs in TGCT (Fig. 5A). Figure 5B shows a scatterplot of the data for the above tumors produced using one algorithm. For example, according to the TIDE algorithm, KIF20A expression was positively correlated with the level of infiltrating CAFs in ESCA (Rho=0.27, P=1.64e-04).
Analysis of KIF20A-related Genes
To further elucidate the molecular mechanisms by which the KIF20A gene participates in tumorigenesis, we identified genes associated with KIF20A expression and KIF20A-binding proteins and performed pathway enrichment analyses. Based on the findings, we obtained a total of 100 KIF20A-binding proteins using the STRING tool. The interaction network of the above proteins is shown in Figure 6A. Then, the TCGA pan-cancer KIF20A expression data were analyzed by the GEPIA2 tool. As seen in Figure 6B, KIF20A expression was positively correlated with that of the KIF2C (R=0.81), CDC25 (R=0.81), KIF23 (R=0.8), CCNB1 (R=0.8) and DLGAP5 (R=0.79) genes (all P <0.001). In the majority of the described cancer types, heatmap analysis revealed a positive correlation between KIF20A expression and the expression of the five genes with the highest correlation values (Fig. 6C).