Background: There were scarcely germline variants of familial lung cancer (LC) identified. We conducted an study with whole-exome sequencing of pedigrees with familial lung cancer to analyze the potential genetic susceptibility.
Methods: Probands with the highest hereditary background were identified by our large-scale epidemiological study and five ones were enrolled as a learning set. The germline SNPs (single-nucleotide polymorphisms) of other five similar probands, four healthy individuals in the formerly pedigrees and three patients with sporadic LC were used as a validation set, controlled by three healthy individuals without family history of any cancer. The network of mutated genes was generated using STRING-DB and visualized using Cytoscape.
Results: Specific and shared somatic mutations and germline SNPs were not the shared cause of familial lung cancer. However, individual germline SNPs showed distinct protein-protein interaction network patterns in probands versus healthy individuals and patients with sporadic lung cancer. SNP-containing genes were enriched in the PI3K/AKT pathway. These results were validated in the validation set. Furthermore, patients with familial lung cancer were distinguished by many germline variations in the PI3K/AKT pathway by a simple SVM classification method. It is worth emphasizing that one person with many germline variations in the PI3K/AKT pathway developed lung cancer during follow-up.
Conclusions: The phenomenon that the enrichments of germline SNPs in the PI3K/AKT pathway might be a major predictor of familial susceptibility to lung cancer.
Figure 1
Figure 2

Figure 3
This is a list of supplementary files associated with this preprint. Click to download.
Supplementary Methods:Support vector machine (SVM) classification
Supplementary Figure S1: Protein-protein interaction (PPI) network constructed using the shared germline mutated genes of the five LC probands. Disconnected genes were removed from the graph.
Supplementary Figure S2: Bioinformatic workflow of next-generation sequencing data processing. FET = Fisher exact test.
Supplementary Table S1: The questionnaire of the living environment
Supplementary Table S2: The demographic and histologic of probands
Supplementary Table S3: Summary of sequencing reads and mapping of the reads
Supplementary Table S4: Somatic mutations of probands
Supplementary Table S5: Shared germline mutations of probands
Supplementary Table S6: Top 10 KEGG pathways of the non-synonymous mutated genes of the individuals
Loading...
Posted 22 Oct, 2020
On 14 Oct, 2020
On 13 Oct, 2020
On 06 Oct, 2020
Received 01 Oct, 2020
Received 19 Sep, 2020
On 16 Sep, 2020
Invitations sent on 09 Sep, 2020
On 09 Sep, 2020
On 12 Aug, 2020
On 11 Aug, 2020
On 11 Aug, 2020
On 13 Jul, 2020
Received 12 Jul, 2020
On 24 Jun, 2020
On 22 Jun, 2020
On 22 Jun, 2020
Received 26 Apr, 2020
On 22 Apr, 2020
On 27 Mar, 2020
Invitations sent on 27 Mar, 2020
On 24 Mar, 2020
On 24 Mar, 2020
Posted 22 Oct, 2020
On 14 Oct, 2020
On 13 Oct, 2020
On 06 Oct, 2020
Received 01 Oct, 2020
Received 19 Sep, 2020
On 16 Sep, 2020
Invitations sent on 09 Sep, 2020
On 09 Sep, 2020
On 12 Aug, 2020
On 11 Aug, 2020
On 11 Aug, 2020
On 13 Jul, 2020
Received 12 Jul, 2020
On 24 Jun, 2020
On 22 Jun, 2020
On 22 Jun, 2020
Received 26 Apr, 2020
On 22 Apr, 2020
On 27 Mar, 2020
Invitations sent on 27 Mar, 2020
On 24 Mar, 2020
On 24 Mar, 2020
Background: There were scarcely germline variants of familial lung cancer (LC) identified. We conducted an study with whole-exome sequencing of pedigrees with familial lung cancer to analyze the potential genetic susceptibility.
Methods: Probands with the highest hereditary background were identified by our large-scale epidemiological study and five ones were enrolled as a learning set. The germline SNPs (single-nucleotide polymorphisms) of other five similar probands, four healthy individuals in the formerly pedigrees and three patients with sporadic LC were used as a validation set, controlled by three healthy individuals without family history of any cancer. The network of mutated genes was generated using STRING-DB and visualized using Cytoscape.
Results: Specific and shared somatic mutations and germline SNPs were not the shared cause of familial lung cancer. However, individual germline SNPs showed distinct protein-protein interaction network patterns in probands versus healthy individuals and patients with sporadic lung cancer. SNP-containing genes were enriched in the PI3K/AKT pathway. These results were validated in the validation set. Furthermore, patients with familial lung cancer were distinguished by many germline variations in the PI3K/AKT pathway by a simple SVM classification method. It is worth emphasizing that one person with many germline variations in the PI3K/AKT pathway developed lung cancer during follow-up.
Conclusions: The phenomenon that the enrichments of germline SNPs in the PI3K/AKT pathway might be a major predictor of familial susceptibility to lung cancer.
Figure 1
Figure 2

Figure 3
This is a list of supplementary files associated with this preprint. Click to download.
Supplementary Methods:Support vector machine (SVM) classification
Supplementary Figure S1: Protein-protein interaction (PPI) network constructed using the shared germline mutated genes of the five LC probands. Disconnected genes were removed from the graph.
Supplementary Figure S2: Bioinformatic workflow of next-generation sequencing data processing. FET = Fisher exact test.
Supplementary Table S1: The questionnaire of the living environment
Supplementary Table S2: The demographic and histologic of probands
Supplementary Table S3: Summary of sequencing reads and mapping of the reads
Supplementary Table S4: Somatic mutations of probands
Supplementary Table S5: Shared germline mutations of probands
Supplementary Table S6: Top 10 KEGG pathways of the non-synonymous mutated genes of the individuals
Loading...