Patients and tissue samples. Patients with SMGGNs who underwent surgical treatment in the Department of Thoracic Surgery of the Fourth Hospital of Hebei Medical University from January 2018 to January 2021 were selected as research subjects. Gene sequencing analysis of formalin-fixation and paraffin-embedding (FFPE) samples surgically resected was performed using NGS technology to analyse the mutation statuses of EGFR, KRAS, ERBB2, HER2, BRAF, ALK, ROS1, MET, RET, PIK3CA, KIT, ESR1, PDGFRA, DDR2, HRAS, and NRAS. On this basis, whole-genome sequencing was conducted on suspected tumour samples of the same origin infer the origin of multiple lesions through molecular biology methods.
Inclusion criteria were as follows: (1) patients with at least 2 SMGGNs resected, (2) those who did not receive neoadjuvant chemotherapy or any other anti-tumour treatment before surgery, (3) those with complete medical records and imaging data, and (4) those who signed the informed consent.
This study was approved by the Ethics Committee of the Fourth Hospital of Hebei Medical University (Ethics Approval No.: 2021ky103).
Molecular-biological analysis. All patients underwent postoperative tumour–node–metastasis (TNM) staging per the TNM Staging of Lung Cancer (8th Edition) released by the International Union against Cancer (UICC) [29]. All samples were examined and histologically classified by two pathologists according to the Histological Classification of Lung Tumors published by the World Health Organization (WHO) in 2015 [30]. Genomic analysis was carried out in the Laboratory of Pathology Department of the Fourth Hospital of Hebei Medical University.
Detection of lung cancer-related genes. At least 30 ng of DNA was extracted from each FFPE tumour sample using a DNA extraction kit (QIAamp DNA FFPE Tissue Kit; Qiagen, Hilden, Germany) following the manufacturer's instructions. Next, the DNA concentration was determined by Qubit dsDNA assay. DNA was cleaved by a Covaris M220, followed by end repair, phosphorylation, and linker ligation. Fragments of 200–400 bp in length were selected by magnetic beads (Agencourt AMPure XP Kit, Beckman Coulter, California, USA), then hybridized with a capture probe. Later, magnetic beads were applied for hybrid selection and amplification by polymerase chain reaction (PCR). The quality and size of DNA fragments were measured using a biological analyser with high sensitivity, and the samples were sequenced. Paired-end reads were completed using the MiSeq DX sequencer (Illumina, Inc., California, USA). The genetic maps of captured tissue samples were subjected to targeted deep sequencing using the lung cancer 16-gene panel, involving a 76-kb human genome region (kit: Burning Rock Dx, Lung cure 16-gene panel). The carcinogenic driver mutations of EGFR, KRAS, ERBB2, HER2, BRAF, ALK, ROS1, MET, RET, PIK3CA, KIT, ESR1, PDGFRA, DDR2, HRAS, and NRAS were detected by the 16-gene panel. The sequencing depth of the FFPE samples was 1000×.
Whole-exome sequencing. For patients with suspected SMGGNs of the same origin, all nodules were further analysed by whole-exome sequencing.
1. DNA sample detection
DNA concentrations were accurately quantified by Qubit 3.0, and DNA samples with a concentration ≥ 20 ng/µL and a total amount > 0.4 µg were used for the library construction.
2. Library construction
Genomic DNA was randomly broken up into fragments of 180–280 bp in length using the Covaris crusher. Library construction and capture experiments were conducted using the Agilent SureSelect Human All Exon V5/V6 kit. Following end repair, phosphorylation, and addition of A tails, the two ends of the fragments were ligated with a linker to construct the DNA library. The library with a specific index was hybridized with up to 543,872 biotin-labelled probes in the liquid phase, and 334,378 exons of 20,965 genes were captured using streptavidin magnetic beads. After linear amplification by PCR, the quality of the library was tested, and the qualified DNAs were sequenced.
3. Quality control of the library
After library construction, DNAs were preliminarily quantified using Qubit 3.0. Then, the insert size of the library was detected using Agilent 2100. After the insert size complied with expectations, the effective concentration (3 nmol/L) of DNA in the library was accurately quantified by quantitative PCR to ensure the library quality.
4. On-board sequencing
After the library’s quality was confirmed, Illumina HiSeq PE150 sequencing was carried out in light of the effective concentration and data output requirements of the library.
Statistical analysis. GraphPad Prism 9.4.0 software was adopted for statistical analysis and visualization. Enumeration data are expressed as [n (%)] and were compared between groups by the χ2 test. Measurement data conforming to a normal distribution are expressed as mean ± standard deviation (‾x ± s) and were compared between groups by Student’s t-test. Non-normally distributed data are described as M (P25, P75), and the rank sum test was run to compare them between groups. The significance level was p = 0.05 (two-sided).