Subjects and sample collection
The study was approved by the Xiangya Hospital of Central South University Institute Research Ethics Committee and informed consent was obtained from all participants. Patients with sporadic IAs confirmed by CTA or DSA from June 2020 to December 2020 were enrolled in our study. Exclusion criteria were as follows: 1) IAs associated with arteriovenous malformation; 2) IAs resulting from syndromic disorders such as Autosomal polycystic kidney disease, Ehlers-Danlos syndrome type IV, Loeys–Dietz syndrome, and Marfan syndrome[17]. Besides, health controls without IA from the general population and head trauma patients undergoing craniotomy were also enrolled during the same period. We recruited a total of 105 patients, in which 40 patients with sporadic IAs were assigned to the WES experiment group, 50 without IAs to the WES control group, 10 with IAs to the RNA-seq experiment group, 5 without IAs to the RNA-seq control group. General clinical information was collected, including sex, age, hypertension, hyperlipidemia, diabetes, smoking, and drinking. The rupture, maximum size, and number of IAs were also recorded.
Peripheral blood samples were collected from IA patients and healthy controls for WES, and were stored in ethylenediaminetetraacetic acid (EDTA) anticoagulant tubes with a temperature of -80 ℃. STAs were harvested from IA patients and head trauma patients (controls) during craniotomy surgery for RNA-seq, and were also stored at a temperature of -80 ℃.
WES and bioinformatic analysis
Genomic DNA was extracted from the blood samples of 40 IA patients and 50 healthy controls using a DNA RNA kit (E.Z.N.A). Library construction and WES were performed on the BGISEQ – 500 platform by the HuaDa Gene company (Shenzhen, China). The raw sequencing data were filtered to gain clean data. The sequence reads were aligned to the human reference genome GRCh37/hg19 using BWA. Duplication removing, realignment, variant calling, and annotation of the sequence reads were conducted with Picard, GATK, and SnpEff softwares.
The filtering criteria for single nucleotide variant (SNV) were as follows: 1) removing variants with frequency > 1% in the 1000G all genomes project[18]; 2) removing synonymous variants; 3) selecting deleterious variants by 2 or more prediction tools from SIFT, PolyPhen2, MutationAssessor, and MutationTaster; 4) selecting variants with statistically significant difference between 2 groups. To visualize the filtered SNV landscape, a waterfall plot and Manhattan plot were drawn using the R package “maftools” and “CMplot”, separately.
The filtered SNVs from WES and differentially expressed genes (DEGs) from RNA-seq were integrated to gain rare variants in DEGs. Hardy-Weinberg equilibrium testing was performed for the genotype of these variants in the SPSS software (version 26). We further analyzed the statistical difference of allele- and genotype- frequency and distribution for these variants between IA patients and the controls.
RNA-seq and bioinformatic analysis
Total RNA was extracted from the STA tissues of 10 IA patients and 5 head trauma patients using TRIzol (Invitrogen, Carlsbad, CA, USA). Library construction and RNA-seq were performed on the BGISEQ – 500 platform by the HuaDa Gene company (Shenzhen, China). Single-end 50 base reads were generated. The read quality was inspected using FastQC and MultiQC, and trimmed with Trimmomatic to yield clean reads. Then, for each gene, read counts were calculated, and expression values were further normalized using FPKM.
To visualize the disparity between two groups, principal components analysis (PCA) was performed using the R packages “factoextra” and “FactoMineR”. The gene expression difference analysis was conducted using the “limma” R package (P < 0.05 and log2-fold change > 0.6 or < -0.6). The Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analyses were performed to detect the function of DEGs.
The vascular cell types were analyzed using the R package “xCell”, which could convert gene expression profiles to enrichment scores of 64 immune and stromal cell types. The association analysis was performed between the vascular cells and target genes defined in both WES and RNA-seq. Endothelial cells (EC) had the highest correlation and were identified as target cells.
To identify endothelial-cell-related genes, the association between individual genes and endothelial-cell densities was quantified by weighted gene co-expression network analysis (WGCNA). A soft threshold β was set at 15 to attain a scale-free topology network. The genes in the module with the strongest correlation were selected (gene significance > 0.5 and module member > 0.8). We used these genes, together with the identified SNV genes to construct SNV-gene-regulated protein-protein interaction (PPI) network in ECs.
Validation with reverse-transcription quantitative polymerase chain reaction (RT-qPCR)
Two representative genes SPSN2 and KCNJ12 were selected and validated by RT-qPCR. The RNA left after RNA-sequencing was reverse transcribed to cDNA using RevertAid First Strand cDNA Synthesis Kit (ThermoFisher). ChamQ Universal SYBR qPCR Master Mix (Vazyme, China) and StepOne Real-time PCR system (Applied Biosystems) were used for RT-qPCR reaction. GAPDH served as the internal control. Reactions were repeated in triplicate for each sample. Expression levels were calculated using the 2-ΔΔCt method. Primer sequences were listed in Supplemental Table 1.
Statistical analysis
For continuous variables, Student’s t-test and Wilcoxon test were used to compare the two groups. For categorical variables, Chi-squared test and Fisher exact test were conducted to compare the two groups. Spearman correlation was performed to calculate correlation coefficients. Data were visualized using the R package “ggplot2”. Heatmaps were drawn based on the R package “pheatmap”. All statistical analyses were conducted using R software (version 4.0.2) or SPSS software (version 26.0). P < 0.05 was considered statistically significant.