Datasets
All data, including colon cancer gene expression profiles and clinical data, were downloaded from the TCGA website through the Genomic Data Commons Data Portal (https://portal.gdc.cancer.gov/) up to March 2021 [19].Patients with missing survival information were not included in the subsequent steps of the analysis.
Identification Of M6a Related Lncrnas
We extracted the expression matrix of 23 m6A RNA methylation regulators based on a previous study, including writers (METTL3, METTL14, METTL16, WTAP, VIRMA, RBM15, RBM15B, and ZC3H13), erasers (FTO and ALKBH5), and readers (YTHDC1, YTHDC2, IGF2BP1, IGF2BP2, IGF2BP3, YTHDF1, YTHDF2, YTHDF3, HNRNPC, LRPPRC, HNRNPA2B1, FMR1, and RBMX)[20].N 6-methyladenosine-related gene expression data were extracted with the limma package in R software. Next, co-expression analysis (with | Pearson’s R| > 0.6 and p < 0.001) was carried out to identify the correlation among N 6-methyladenosine-related gene expression and lncRNAs. By this method, we acquired the expression data of N 6-methyladenosine-related lncRNAs through the limma package in R software. Moreover, a network plot was depicted via the igraph package to visualize the correlation. Expression data of N 6-methyladenosine-related lncRNAs were combined with clinical survival data via the limma package. Prognostic-related lncRNAs were extracted, and the confidence interval and hazard ratio were calculated by the survival package. Visualization of univariate Cox regression analysis was achieved by drawing forest plots. The differential expression data of prognosis-related N 6-methyladenosine-related lncRNAs between tumor and normal tissues were acquired through the limma package, pheatmap package, reshape2 package and ggpubr package in R software. P value < 0.05 was considered statistically significant. We drafted heatmaps and boxplots.
The Role Of N 6-methyladenosine-related Lncrnas
First, we sorted prognosis-related N 6-methyladenosine-related lncRNAs into two subtypes [21–22], cluster 1 and cluster 2, via ConsensusClusterPlus and limma packages according to the expression of lncRNAs. The calculation method was clusterAlg = “km” and clusterNum = “2”. We executed survival analysis according to lncRNA subtypes via survminer and survival packages to evaluate the prognostic value of N 6-methyladenosine-related lncRNAs. To identify differentially expressed prognosis-related lncRNAs in each cluster and analyze the relationship between lncRNAs and clinicopathological parameters, the pheatmap package was utilized, and a heatmap was depicted. Differential expression of target genes in related subtypes and different types of tissue was identified via the limma package. To clarify the correlation between the target genes and prognostic N 6-methyladenosine-related lncRNAs in colon cancer, gene correlation analysis was conducted via the limma package. P value < 0.05 was considered statistically significant.
The Role Of Immune Cell Infiltration And The Tumor Microenvironment
To explore and calculate the infiltration level of different immune cell types in the samples, the preprocessCore, limma and e1071 packages were utilized, and the relative number infiltrating immune cells was obtained. The analysis of the tumor microenvironment was carried out via the estimate and limma packages, and the stromal score, immune score and estimation core were determined. The above scores were inversely related to tumor purity. Differential analysis of immune cell infiltration in different clusters was carried out by the limma package, and vioplot was used to visualize the differences. Additionally, the infiltration level of each type of immune cell in different clusters of colon cancer was analyzed via the limma package, and a boxplot was drawn. Immediately, we conducted differential analysis of the tumor microenvironment in different subtypes of samples by the limma package according to the immune score, estimate core and stromal score, and corresponding boxplots were generated to further explore the purity of tumor cells in different types.
N 6-methyladenosine-related Lncrna Prognostic Model
Lasso regression was carried out to construct a prognostic model. All samples were divided into a high-risk group and a low-risk group according to the median value of the risk score of prognostic N 6-methyladenosine-related lncRNAs. Training (50%) and test (50%) groups were used for the lasso regression, and related plots were obtained. The survival curves of different groups were depicted, in which high-risk and low-risk groups were compared. To evaluate the accuracy of our model for predicting the survival of patients with the disease, a corresponding ROC curve was obtained via the time ROC package. A risk curve was generated in line with the risk score, survival status and risk associated with N 6-methyladenosine-related lncRNAs. Independent prognostic analysis was conducted to evaluate whether our model was independent of other clinical prognostic factors that affect patient outcomes. Multivariate and univariate analyses were carried out, and the hazard ratio was calculated. Model validation for clinical groups was utilized to test and verify whether our model could be applied to different clinical groups. Risk and clinical correlation analysis was carried out to clarify related high-risk and low-risk N 6-methyladenosine-related lncRNAs and expound the correlation of clinical characteristics and our prognostic risk model. A heatmap was generated via the pheatmap package and limma package. Boxplots of risk and clinical relevance were obtained to evaluate the relationship between risk and the clinical data. Genetic differential analysis was performed to assess the expression difference of target genes in different risk groups in our model in colon cancer. Correlation analysis of risk and immune cells was conducted to evaluate the relationship between immune cells and the risk score. A scatter plot was generated to visualize this nexus and assess whether immune cells were beneficial or detrimental.