Patients’ general characteristics
In training set, we collected 58 pairs of blood samples. Samples of patients diagnosed with stage III lung cancer were excluded according to postoperative pathology. Rejecting contaminated genome and unqualified samples, a total of 42 pair samples were sequenced after being checked qualified. The patients’ general characteristics is shown in the table (Table 1).
There were 27 lung cancer and 15 benign disease patients in training set, in which there were 18 male and 9 female patients in lung cancer group, and 10 male and 5 female patients in benign disease group. The average age of the two groups were 63.1±8.9 and 52.3±10.7, respectively(P<0.05). The number of stage I and stage II lung cancer patients were 19 (45.2%) and 8 (19.0%), respectively. Grouping by tumor size, there were 18 lung cancer patients (42.9%) and 11 benign desease patients (26.2%) in the group with a diameter less than 3 cm, while those with a diameter of 3 to 5 cmwere 9 (21.4%) and 4 (9.5%). There was no significantly statistical difference (P>0.05) in the general character between the two groups including gender, smoking history and tumor size. The data of two groups are comparable.
The pathology of lung cancer includes adenocarcinoma (18, 42.9%), squamous carcinoma (7, 16.7%) and other types (2, 4.8%) that were big cell carcinoma and sarcomatoid carcinoma. The histology of most lung cancer pateints was adenocarcinoma. In benign lung disease, the postopreative pathology was tuberculosis (3, 7.1%), inflammatory pseudotumor (9, 21.4%), hamartoma (2, 4.8%) and sclerosing hemangioma (1, 2.4%). The number of stage IA, IB, IIA and IIB lung cancer was 5 (11.9%), 14 (33.3%), 6 (14.3%) and 2 (4.8%), respectively. There was no significant difference in all general data (P<0.05) except in age .
Biomarkers detection
Blood biomarkers including CEA, NSE and CYFRA-211 were detected before surgery. In training set, the average vaules of these biomarkers were 3.04±1.64, 22.96±17.04, and 4.54±8.18 ng/ml in lung cancer, and 1.85±0.92, 20.20±7.02, and 1.66±0.86 ng/ml in benign disease, respectively, and there were no significant differences (P>0.05 for each comparison) in the two groups. In validation set, the average vaules of these biomarkers were 2.47±1.30, 14.23±4.90, 2.23±1.00 ng/ml in lung cancer and 1.72±1.02, 15.46±4.62, 2.18±0.82 ng/ml in benign disease groups, and there were no significant differences (P>0.05) in the two groups (Figure 1).
Positive staining of the biomarkers was judged according to the range of clinical reference values. In training set, there were 3/17 and 0/12 CEA positive, 8/16 and 6/12 NSE positive, and 2/13 and 0/12 CYFRA-211 positive patients respectively in lung cancer and benign disease group (P>0.05, respectively in three biomarkers). In validation set, there were 1/18 and 0/8 CEA positive, 4/18 and 1/8 NSE positive, and 3/18 and 1/8 CYFRA-211 positive patients respectively in lung cancer and benign disease group (P>0.05, respectively in three biomarkers) (Table 2).
Cell free DNA detected
Cell free DNA (cfDNA) has been detected in all of the 42 samples in training set. The concentration of cfDNA in lung cancer group and benign disease group respectively was 0.2-3.04 ng/μl and 0.21-1.25 ng/μl, average concentration respectively was 0.53±0.66 ng/μl and 0.54±0.29 ng/μl. The statistical differences was not significant in the two group (P>0.05).
Somatic mutation analysis
The number of mutated genes by targeted sequenced was total 246 in lung cancer and benign disease groups in training set. There were total 522 somatic mutations in the two groups including 374 somatic mutations dectected in lung cancer group and 148 mutations in benign disease group.(Figure 2)
Most of the mutations were single nucleotide variations (SNVs). In lung cancer group, there were 348 nonsynonymous mutations including 347 missense mutations and only one deletion mutation (InDel) in all 374 somatic mutations. In benign disease group, there were 136 nonsynonymous mutations (missense mutations) in all 148 somatic mutations. The average number of somatic mutation respectively was 13.85±7.25 and 9.87±4.27 in two group, and the missense mutation respectively was 12.85±7.08 and 9.07±3.84. The statistical differences was not significant between the two group (P>0.05). (Figure 2)
Analyzing the number of missense mutations in general characteristics of the two groups including the patients’ gender, age, smoking history, tumor size, pathology, stage and N1 station lymphatic metastasis. There were no statistically significant differences in the comparison, respectively (P>0.05).(Figure 3)
CtDNA detected by targeted NGS
Sequenced 42 samples were divided into lung cancer and benign disease group according to postoperative pathology. There were 27 samples in lung cancer group and 15 samples in benign disease control group. Somatic mutations were detected in both two groups. In total 246 genes mutations were detected in the training set (Figure 4). The number of missense mutations of LRP1B, KMT2D, RNF213 and CSMD3 gene was 8 (8/27, 29.6%), 7 (7/27, 25.9%), 7 (7/27, 25.9%) and 6 (6/27, 22.2%) in lung cancer group, and none of these genes were detected in benign disease control group. The mutation sites of the four genes were shown in the Table 3. RNF213 gene mutations located at exon 17, 24, 26, 29 and 59 of chromosome 17. KMT2D gene mutations located at exon 4, 11, 16, 34, 38 and 41 of chromosome 12. CSMD3 gene mutations located at exon 10, 14, 20, 54 and 59 of chromosome 8. LRP1B gene mutations located at exon 2, 8, 16, 51, 83 and 89 of chromosome 2 (Table 3).
LRP1B, KMT2D, RNF213 and CSMD3 gene mutation in lung cancer were more than that in benign disease, and there were significant statistical differences between the two groups (P<0.05). The WHSC1 gene mutated in 10 samples of lung cancer and 8 samples of benign disease, respectively. The GDNF gene mutated in 7 samples in lung cancer and 1 samples in benign disease. There were less than 5 mutations of other genes detected in either one of the two groups. There were no significant statistical differences of these genes in two groups (P>0.05).
Immunohistochemical results
In training set, RNF213, LRP1B, KMT2D and CSMD3 genes were considered to be statistically significant difference in the analyzing of sequenced data in lung cancer compare to benign disease. Immunohistochemistry (IHC) was performed on specimens of 27 lung cancer and 14 benign disease FFPE tissues to detect the expression of RNF213, LRP1B, KMT2D and CSMD3. IHC was not carried out in one sample (B23) because the tissue was too little. After staining, taking photographs and evaluating, we show the representative illustrations of expressions of RNF213, KMT2D, CSMD3 and LRP1B in Figure 5. High and low expressions of the four genes were summarized (Figure 5). High expressions of RNF213, KMT2D and CSMD3 were observed in lung cancer tissues, and low expressions of these genes were observed in benign disease tissues. There were significant statistical differences of these genes between the two groups (P<0.05), especially RNF213 (P<0.005). Low expression of LRP1B was observed in 26 lung cancer tissues and in 14 benign disease tissues. One of the lung cancer samples was high expression. This result was not significantly different (P>0.05).
Validation set results
There were 28 lung nodule patients enrolled the validation set without a definite diagnosis through the chest CT. The largest diameter of the lesion was all less than 3 cm in CT imaging. Malignant or benign nodules could not be confirmed in CT imaging and biomarkers. Blood samples were targeted sequenced in the same method. The number of the test genes RNF213, KMT2D, CSMD3 and LRP1B detected in lung cancer samples was 5, 5, 3 and 2 respectively. Twenty samples were confirmed to be lung cancer and 8 samples were benign nodules with postoperative pathology. RNF213 gene mutated in 25% lung cancer patients. KMT2D, CSMD3 and LRP1B genes mutated in 15%, 10% and 10% lung cancer patients, but KMT2D and CSMD3 genes were detected in 25% and 12.5% benign diseases (Figure 6).
An analysis of GGO and GGN in all samples
We put all sequenced data of the 70 samples in an analysis to determine the veracity of the result. There were 55 patients diagnosed GGO or GGN in chest CT including 36 early stage lung cancer and 19 benign disease. We detected RNF213 gene mutation in 10 (10/36, 27.8%) lung cancer samples and no samples in benign disease (P<0.05). All of these somatic mutations were missense mutation. The specificity of RNF213 gene mutation was 100% in diagnosis of GGO and GGN, and its sensibility was 27.8%.