Microarray data information
High-throughput gene expression data of patients with NSCLC were obtained from microarray dataset (GSE50081 and GDS3837) in Gene Expression Omnibus (GEO, https://www.ncbi.nlm.nih.gov/geo/) [16] . The GEO database is an international public repository that archives and freely distributes high-throughput gene expression and other functional genomics data sets [17].
In GSE50081 datasets, the expression profiling was performed on RNA from frozen, resected tumor tissues corresponding to 181 samples of patients in NSCLC. All the gene expression data derived from the Affymetrix Human Genome U133 Plus 2.0 Array platforms. All designs and quality control of the microarray experiment and data normalization were in line with the standard Affymetrix protocols. Clinical classification of these NSCLC patients was staged according to the American Joint Committee on Cancer (AJCC) clinical grade or Union for International Cancer Control (UICC) TNM system (8th edition) [18].
In GDS3837 datasets, there were 60 paired primary NSCLC tumor and adjacent normal lung tissue specimens obtained from nonsmoking female NSCLC patients.
Analysis the expression of SCN4B in NSCLC patients
The main differentially expressed genes (DEGs) of 60 paired NSCLC patients were analyzed by comparing the primary NSCLC tumor with adjacent normal lung tissue specimens extracted from dataset GDS3837. The Cut-off values was p < 0.05 and the absolute value of the log2 fold change ≥ 1.5 [19].
Immunohistochemistry staining validation of the SCN4B expression in lung cancer using The Human Protein Atlas database
Immunohistochemistry staining of the expression of SCN4B in normal lung tissue and lung cancer tissue was explored using The Human Protein Atlas (HPA, http://www.proteinatlas.org) and the key word for search strategy was “SCN4B” [20]. The HPA was a valuable tool constituting for researchers studying protein localization and expression in human tissues and cells [21].
Analysis the correlations of SCN4B expression between DNA promoter methylation
The analysis to confirm the relationship between SCN4B expression and DNA promoter methylation was performed using of MEXPRESS [20]. MEXPRESS was an online database for the integration and visualization of gene expression, DNA methylation and clinical data [22]. By default, the SCN4B expression value was selected in order of samples which contain lung tumor and normal tissue samples. The Pearson correlation analysis was then used to calculate the difference of SCN4B expression value between DNA promoter methylation data [20].
Analysis the expression of SCN4B at different clinical classification and characteristics
In GSE50081 dataset, 181 NSCLC patients were divided into different groups according their AJCC clinical grade, TNM stage, gender and age, respectively. We then compared the different expression level of SCN4B within these groups respectively using the Fisher exact or Wilcoxon rank-sum test [23]. Statistical significance was set at 0.05.
In addition, we validated the expression of SCN4B at different stage in lung cancer via Gene Expression Profiling Interactive Analysis (GEPIA, http://gepia.cancer-pku.cn/), which was a a web-based tool provides key interactive and customizable functions including correlation analysis and patient survival analysis [24].
The Kaplan–Meier survival analysis of SCN4Bhigh and SCN4Blow group in NSCLC
The Kaplan–Meier survival analysis was used to estimate the effects of SCN4B expression on the OS of NSCLC. Patients with SCN4B expression values above the median for all NSCLC patients were classified as SCN4Bhigh group, and the others were considered to be SCN4Blow group. The difference of OS and survival status between low or high SCN4B expression group was assessed by log-rank test with R package “survival” [25]. A P value less than 0.05 was identified as significant.
Multivariate analysis and univariate analysis of the prognostic value of SCN4B expression in NSCLC
The Cox regression model was used to conduct multivariable and univariate survival analyses. Multivariate Cox analysis was used to compare the influence of SCN4B expression, along with clinical characteristics including clinical grade, TNM stage, OS, survival status, age and gender. All these clinical characteristics were entered in the multivariate Cox regression analysis as categorical variables, which set p value ≤ 0.05. The cut-off value of SCN4B expression was set based on the best separation [25]. Statistical significance for a two-tailed test was set at 0.05. Univariate Cox analysis was conducted to compare the influence of SCN4B expression and above clinical characteristics on OS and survival status, respectively.