Whole Exome Sequencing of Simultaneous Primary Multifocal Lung Cancer (SMPLC): Case Report and Bioinformatics Analysis

DOI: https://doi.org/10.21203/rs.3.rs-96042/v1

Abstract

Background: To understand the molecular mechanism of synchronous multifocal lung cancer (SMLC) is of great significance for the differential diagnosis of intrapulmonary metastasis (IM) and synchronous multiple primary lung cancer (SMPLC). Recently, next-generation sequencing (NGS) has become a useful tool for understanding SMLC.

Case presentation: In this study, two lesions of a 61-year-old man with lung cancer were detected by whole exome sequencing (WES) and the correlation between different lesions was analyzed at the molecular level. Lesion 1 was adenocarcinoma and lesion 2 was squamous cell carcinoma. Gene mutation and copy number variation (CNV) are different in the two lesions. The genome of lesion 2 is more unstable. The clonal evolution analysis showed that there was no obvious evolutionary relationship between the two lesions, and both lesions were independent double primary lesions. Bioinformatics analysis revealed that the alternate genes of the two lesions were inconsistent in function and pathway. PCA analysis was performed using the Cancer Genome Atlas (TCGA) database and the GTEx database, and it was found that the changed genes in these two lesions were significantly separated from the control group, and the changes of TP53 and EGFR genes in the TCGA database were further described.

Conclusions: These results indicate that NGS may provide new ideas for SMLC classification.

Background

Lung cancer is the leading cause of cancer-related deaths worldwide.1 A more prominent phenomenon is that the first symptom of more and more lung cancer patients is multinodular lesions, which is called SMLC. As we all know, patients with metastatic tumors have a late clinical stage, lose opportunities for surgery, and have a poor prognosis. Most of the SMPLCs are early stage lung cancers. Reports have shown that the 5-year survival rate of SMPLC in stage I can be as high as 75.8%.2 Therefore, the correct differential diagnosis of these multiple lesions is SMPLC or IM, determines the judgment of its clinical stage, the formulation of a reasonable treatment plan, and the evaluation of the patient’s prognosis, and the in-depth understanding of it. However, due to the high heterogeneity of lung cancers and the limitations of detection methods, the classification of SMLC has always been difficult, and its prognosis and pathogenesis are rarely studied.

Recently, the development of molecular pathology has provided many useful methods and tools for the research of SMLC. Researchers try to extract DNA from tumor cells for various detailed and precise analysis, in order to trace the root cause and determine the genetic origin of the tumor. With the use of NGS technology, we can do more than simply detect the mutation sites of known specific cancer genes.

In this study, we reported a case of SMPLC with different clinical features, histopathological morphology, and mutation sites of key genes. Through the whole exome sequencing of 2 lesions in this case, the molecular level differences between different lesions were compared, combined with the analysis of the public database. These results provide an idea for the identification of SMPLC.

Case Presentation

The patient is a 61-year-old man. Chest CT showed that there were 2 lesions in the lung, located in different lobes, with different pathological morphology, and no lymph node metastasis. The pathology identified it as SMPLC. Lesion 1 located in the right middle lung was histologically classified as adenocarcinoma (Fig. 1A). Lesion 2 located in the right lower lung was histologically classified as squamous cell carcinoma (Fig. 1B). Whole exome sequencing was applied for 2 lesions and 1 control (Shihe, Nanjing, China). The library was prepared with Hyper Prep Kit (Kapa) and sequenced on Hiseq 4000 NGS platform (Illumina). After the off-machine data is qualified, the somatic mutation and copy number variation between different lesions are analyzed.

The somatic mutation and CNV were analyzed basing on NGS data. Figure 2A showed the somatic mutations in lesion 1. There were 24 missense mutations, 11 frame shift deletions, 13 silent mutations and other mutations. Chrome 17 had the most mutation sites of six. In lesion 2, 56 of 97 mutations were missense mutations (Fig. 2B). Chrome 6 had the most mutation sites of ten. The mutation agreement rate between the two lesions was only 0.67%. In lesion 1, only TERT had a CNV of gain (Fig. 2C). In lesion 2, 8 genes had CNV of gain while 36 genes had CNV of loss (Fig. 2D). The CNV analysis of copy number variation showed that the genome of lesion 2 was more unstable.

GO and KEGG analysis was used for bioinformatic analysis of the alternated genes, including those with mutation or SNV. Figures 3 and 4 showed the GO and KEGG results of alternated genes from lesion 1 and 2. For cell component enrichment, the alternated genes most enriched in transcription regulator complex and chromosomal region (Fig. 3A and 4A). For biological process enrichment, the alternated genes most enriched in regulation of cellular response to growth factor stimulus and oocyte development (Fig. 3B and 4B). For molecular function enrichment, the alternated genes most enriched in protein serine/ threonine kinase activity and cell adhesion molecule binding (Fig. 3C and 4C). For KEGG pathway enrichment, the alternated genes enriched in hepatocellular carcinoma, gastric cancer and hippo signaling pathways in lesion 1 (Fig. 3D). And no pathway enriched with p value less than 0.05 in lesion 2 (Fig. 4D).

TCGA was a cancer genomics database including over 20,000 primary cancer and matched normal spanning 33 cancer types. We used PCA analysis to compare the alternated genes of lesion 1 and lesion 2 in LUAD/LUSC and control groups from TCGA database, GTEx database was used as a normal control. The 55 alternated genes in lesion 1 had a significantly different expression between LUAD cancer and control group (Fig. 5A). Meanwhile, the 140 alternated genes in lesion 2 had a significantly different expression between LUSC cancer and control group (Fig. 5B).

Figure 6 showed the interactions of gene alteration from lesion 1 and lesion 2. There was no interaction of CNV and mutation from lesion 1. And TP53 was the only interaction gene of CNV and mutation from lesion 2. In lesion 2, TP53 had both missense mutation and loss of CNV. There was no interaction of CNV from lesion 1 and lesion 2. And EGFR was the only interaction gene of mutation from lesion 1 and lesion 2.

We further analyzed the alterations of TP53 and EGFR in TCGA database. In LUAD, 74% and 60% samples had TP53 and EGFR gene alterations (Fig. 7A). Heterozygously and gained alterations were key alterations of TP53 and EGFR in LUAD. In LUSC, 92% and 68% samples had TP53 and EGFR gene alterations (Fig. 7B). Mutation and gained alterations were key alterations of TP53 and EGFR in LUSC. For TP53, wild type accounted for the most both in LUAD and LUSC (Fig. 7C). For TP53, missense accounted for the most mutations in LUAD (Fig. 7C). For EGFR, wild types accounts more than half in LUAD and LUSC (Fig. 7D). And for all detected mutations, missense accounts the most.

Discussion

At present, the most widely used differential diagnosis criterion is the clinicopathological criterion for defining multiple primary lung tumors first published by pathologists Martini and Melamed in 1975.3 Due to limited conditions, the standard can only depend on histological type. However, with the improvement of awareness, more and more defects are exposed by this standard. Lung cancer is highly heterogeneous. Sometimes there are two or three different tissue morphologies in the same lesion. On the other hand, even if the lesions show similar histological types, different molecular changes may occur in the lesions, and these changes are sufficient to indicate their independent origin. Another standard commonly used in clinical practice is the TNM classification system. Paradoxically, a series of studies analyzing multifocal lung tumors showed that SMPLC may have a good prognosis.4 Therefore, relying solely on the diagnostic criteria of histopathology may overestimate the staging of some patients, leading to incorrect staging and treatment. Although the 2007 version of the American College of Chest Physicians (ACCP) revised and optimized the Martini and Melamed standards with the addition of genetic and molecular information, the lack of high-level medical evidence standards makes the diagnosis, treatment and the prognosis of multifocal lung cancer generating very big controversy and conflicting opinions persist.5,6

EGFR is currently the most common lung cancer treatment target. The frequency of EGFR mutation was nearly 50% in LUAD patients and 9.9% in LUSC from Asia-Pacific.7 It’s found that in LUAD with known EGFR gene mutations, the surrounding normal lung epithelial cells also have consistent gene mutations.8 It is suggested that EGFR exists in the early occurrence of lung cancer, which can be used as an important reference for analyzing the source of lung cancer. Yatabe et al performed multi-layer slices and multiple samples of the same tumor on a large sample of LUAD, and detected EGFR gene mutations.9 They found that no case of lung cancer had multiple different EGFR mutations. At the same time, a paired study was conducted on the confirmed primary and metastatic lesions, and the results showed that there is a high degree of consistency between the primary and metastatic tumor tissue EGFR mutations. Regarding multiple lung adenocarcinomas, a study conducted EGFR molecular detection on the tumor tissues of four patients, and found that two of them had exactly the same mutation sites, while the other two were not consistent, reflecting the complexity of the origin of multiple lung tumors.10 Some scholars used comparative genome chip method (aCGH) and detection of somatic cell mutation (EGFR) to analyze the genomic change spectrum of lung cancer patients. The results showed that the consistent rate of copy number changes in the metastatic group was higher than that in the primary group (55.5% vs. 19.6) %, P = 0.04).11

TP53 is a famous tumor suppressor, and the mutation of TP53 is usually related to tumor occurrence, development and poor prognosis.12 Dual TP53 and EGFR mutations were found in 41% of NSCLC patients. And TP53 missense mutation would lead to significantly lower response rates and shorter PFS when receiving EGFR TKI therapy.13 However, the high frequency of TP53 in tumors limited the application of TP53 mutation in distinguishing SMLC. Recently, there have been studies using EGFR mutations to identify small samples of multifocal lung cancer. But the effect is not satisfactory due to common defects such as low mutation rate. Therefore, the detection of a single gene is far from meeting the needs of classification and traceability. The researchers identified 27 cases of SMPLC from 122 cases based on EGFR and K-Ras gene mutation analysis.14 However, there are also different opinions. Takamochi et al classified and analyzed the prognosis of 36 patients based on the mutations of EGFR and K-ras, and concluded that there was no difference in survival rates between two groups.15 Some researchers proposed to combine more genes to establish a differential expression mathematical model to differentiate and diagnose multiple primary and metastatic lung cancers, including p53, p16, p27 and c-erbB2.16 Another study conducted research on East Asian lung adenocarcinoma, and found that the probability of any one of the four genes of EGFR, k-ras, c-erbB2 and ALK mutations can add up to 90%.17,18

In this study, we used WES to reflect the various alternations in tumor tissue. We compared the two histologically different lesions and found they were different in mutations and CNVs. Among the alternated genes, we found EGFR and TP53 as two specific gene. We used TCGA database to further show the alternations of the two genes in the general population. Combine single patient sample and public data could provide more information. However, more samples are necessary to find the key point for identification of SMPLC.

Conclusion

Overall, this study summarizes a rare case of SMPLC involving adenocarcinoma and squamous cell carcinoma. We observed the histological features and sequenced the whole exome of the two lesions. Further analysis of gene mutation, CNV, clone evolution and KEGG pathway by bioinformatics method provides clues to better understand the intrinsic molecular characteristics of SMLC and to carry out accurate differential diagnosis.

Abbreviations

CNV: copy number variation; IM: intrapulmonary metastasis; NGS: next-generation sequencing; SMLC: synchronous multifocal lung cancer; SMPLC: synchronous multiple primary lung cancer; TCGA: the Cancer Genome Atlas; WES: total exome sequencing

Declarations

Ethics approval and consent to participate

This publication is approved by the Ethics Committee of the Affiliated Suzhou Hospital of Nanjing Medical University.

Consent for publication

Consent from the patient is obtained.

Availability of data and materials

Data sharing is not applicable to this article as no datasets were generated or analyzed during the current study.

Competing interests

The authors declare that they have no competing interests.

Funding

This work was supported by the funds of the Suzhou Science and Technology Bureau Project (sys2018084) and Talent Project of Jiangsu Province (WSN-256).

Authors’ contribution

Donglin Zhu carried the case analyses and wrote the manuscript. Ming Hong made histopathological sections and performed genetic analysis. Jinghuan Lv conceived of this study and drafted the manuscript. All authors read and approved the final manuscript.

Acknowledgment

Not applicable.

References

  1. Bray F, Ferlay J, Soerjomataram I, Siegel RL, Torre LA, Jemal A. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin 2018;68(6):394-424.
  2. Yu YC, Hsu PK, Yeh YC, Huang CS, Hsieh CC, Chou TY, Hsu HS, Wu YC, Huang BS, Hsu WH. Surgical results of synchronous multiple primary lung cancers: similar to the stage-matched solitary primary lung cancers? ANN THORAC SURG 2013;96(6):1966-1974.
  3. Martini N, Melamed MR. Multiple primary lung cancers. J Thorac Cardiovasc Surg 1975;70(4):606-612.
  4. Guo H, Mao F, Zhang H, Qiu Y, Shen-Tu Y. [Analysis on the Prognostic and Survival Factors of Synchronous Multiple Primary Lung Cancer]. Zhongguo Fei Ai Za Zhi 2017;20(1):21-27.
  5. Kozower BD, Larner JM, Detterbeck FC, Jones DR. Special treatment issues in non-small cell lung cancer: Diagnosis and management of lung cancer, 3rd ed: American College of Chest Physicians evidence-based clinical practice guidelines. CHEST 2013;143(5 Suppl):e369S-e399S.
  6. Takamochi K, Oh S, Matsuoka J, Suzuki K. Clonality status of multifocal lung adenocarcinomas based on the mutation patterns of EGFR and K-ras. LUNG CANCER 2012;75(3):313-320.
  7. Han B, Tjulandin S, Hagiwara K, Normanno N, Wulandari L, Laktionov K, Hudoyo A, He Y, Zhang YP, Wang MZ, Liu CY, Ratcliffe M, McCormack R, Reck M. EGFR mutation prevalence in Asia-Pacific and Russian patients with advanced NSCLC of adenocarcinoma and non-adenocarcinoma histology: The IGNITE study. LUNG CANCER 2017;113:37-44.
  8. Tang X, Shigematsu H, Bekele BN, Roth JA, Minna JD, Hong WK, Gazdar AF, Wistuba II. EGFR tyrosine kinase domain mutations are detected in histologically normal respiratory epithelium in lung cancer patients. CANCER RES 2005;65(17):7568-7572.
  9. Yatabe Y, Matsuo K, Mitsudomi T. Heterogeneous distribution of EGFR mutations is extremely rare in lung adenocarcinoma. J CLIN ONCOL 2011;29(22):2972-2977.
  10. Chuang JC, Shrager JB, Wakelee HA, Neal JW. Concordant and Discordant EGFR Mutations in Patients With Multifocal Adenocarcinomas: Implications for EGFR-Targeted Therapy. CLIN THER 2016;38(7):1567-1576.
  11. Arai J, Tsuchiya T, Oikawa M, Mochinaga K, Hayashi T, Yoshiura K, Tsukamoto K, Yamasaki N, Matsumoto K, Miyazaki T, Nagayasu T. Clinical and molecular analysis of synchronous double lung cancers. LUNG CANCER 2012;77(2):281-287.
  12. Leroy B, Anderson M, Soussi T. TP53 mutations in human cancer: database reassessment and prospects for the next decade. HUM MUTAT 2014;35(6):672-688.
  13. Labbe C, Cabanero M, Korpanty GJ, Tomasini P, Doherty MK, Mascaux C, Jao K, Pitcher B, Wang R, Pintilie M, Leighl NB, Feld R, Liu G, Bradbury PA, Kamel-Reid S, Tsao MS, Shepherd FA. Prognostic and predictive effects of TP53 co-mutation in patients with EGFR-mutated non-small cell lung cancer (NSCLC). LUNG CANCER 2017;111:23-29.
  14. Liu M, He W, Yang J, Jiang G. Surgical treatment of synchronous multiple primary lung cancers: a retrospective analysis of 122 patients. J THORAC DIS 2016;8(6):1197-1204.
  15. Takamochi K, Oh S, Matsuoka J, Suzuki K. Clonality status of multifocal lung adenocarcinomas based on the mutation patterns of EGFR and K-ras. LUNG CANCER 2012;75(3):313-320.
  16. Chen D, Mei L, Zhou Y, Shen C, Xu H, Niu Z, Che G. A novel differential diagnostic model for multiple primary lung cancer: Differentially-expressed gene analysis of multiple primary lung cancer and intrapulmonary metastasis. ONCOL LETT 2015;9(3):1081-1088.
  17. Li C, Fang R, Sun Y, Han X, Li F, Gao B, Iafrate AJ, Liu XY, Pao W, Chen H, Ji H. Spectrum of oncogenic driver mutations in lung adenocarcinomas from East Asian never smokers. PLOS ONE 2011;6(11):e28204.
  18. Sun Y, Ren Y, Fang Z, Li C, Fang R, Gao B, Han X, Tian W, Pao W, Chen H, Ji H. Lung adenocarcinoma from East Asian never-smokers is a disease largely defined by targetable oncogenic mutant kinases. J CLIN ONCOL 2010;28(30):4616-4620.