Transcriptional characterization of vascular smooth muscle cells
To identify regional-specific transcriptional identities in vSMCs populations, we performed single cell RNA sequencing (scRNA-seq) in 3 distinct sites: carotid arteries, aortic arch (ascending thoracic) and descending (thoraco-abdominal) aorta (Figure 1a and 1b). The rationale for selecting these sites was the combination of embryonic origin, hemodynamics, and disease emergence. The aortic root was excluded from the evaluation. The collected aortic arch region extended until the left subclavian branch, in concordance with the limits of neural crest origin5. In total, we sequenced between 3,732 to 4,632 cells per site with 72,685 to 91,597 average reads per cell (Supplementary Figure 1,2; Supplemental Table 2). Quality assessment of each of the three libraries was performed and those cells with low RNA counts and high levels of mitochondrial RNA were removed from the dataset prior to downstream analysis (Supplementary Figure 1,2).
To identify common and regional signatures, the samples were merged and analyzed using Seurat (Figure 1c). This analysis identified 11 distinct cell clusters of varying vessel composition (Figure 1d). To determine the cellular constituency present in the 11 cell clusters, expression of classical cell-type markers was applied identifying six distinct cell populations (Figure 1e, 1f, Supplemental Figure 1, Supplemental Table 3). Vascular smooth muscle cells (vSMCs) represented the predominant cell type in all samples with the carotid library showed the greatest degree of diversity as 32% of the carotid-identified cells were non-vSMC (Supplemental Table 3).
Populations of vSMCs were computationally extracted from the other cell types for in-depth analysis and comprised 1,740 to 3,336 cells per anatomic location (Supplementary Figure 2, Supplemental Table 4). Visualization of the data using t-Distributed Stochastic Neighbor Embedding (tSNE) highlighted the global uniqueness of each population of vSMC per anatomical region, as the data segregated into three distinct mostly contiguous but not overlapping groups that matched the referred anatomical location (Figure 2a, b, Supplementary Excel File 1).
From this cell population 18,940 transcripts were quantified and assessed for site-enriched expression. In this analysis, 1045 transcripts (5.5%) showed unique anatomic location-enriched expression (Figure 2c, Supplementary Excel File 1). Importantly, while vSMCs populations contained these regional-enriched expression patterns, their core identity as per expression of classical vSMC markers– Myh11, Acta2, and Tagln, were unchanged (Supplemental Figure 3).
Molecular drivers of the transcript regional specification were further evaluated by plotting their relative log fold change against the frequency of transcript-producing cells (Figure 2d and Supplementary Excel File 2). The predominant pattern was defined by relatively equal contributions of increased transcript production per cell as well as increased number of transcripts producing cells in a specific region (Figure 2d). For a smaller portion of transcripts, site-enrichment was driven by increased transcript counts per positive vSMC (fold change). This is exemplified by carotid enriched Btg2, which shows 1.5 - log fold increase in transcript quantity per cell in carotid vSMCs, relative to aortic arch and descending aorta (Figure 2d and Supplemental Figure 3). Some site-specific transcripts were characterized by differential increases in the frequency of cells expressing the transcript (cell enrichment). This expression pattern is exemplified by the HOX genes Hoxb3os (carotid) and Hoxa7 (descending aorta) which show between 1.7 to 2.5-fold increases in transcript expressing cells, respectively (Figure 2d and Supplemental Figure 3). Importantly, 38 transcripts (3.6%); including carotid-enriched Cebpb, aortic arch-enriched 2210407C18Rik, descending aorta-enriched Rgs5 show substantial contributions of both features to their site-enrichment (Figure 2d and Supplemental Excel File 2).
Further analysis of the smooth muscle cellular population identified seven cell subclusters recognized by differences in their transcriptional profiles (Figure 2 e,f Supplemental Table 4 and Supplementary Excel File 3). Detailed analysis showed that while the vSMC subcluster derived transcriptional signatures were relatively unique, certain clusters showed high overlap with one of the three the anatomic location defined signatures (Figure 2g). Visualization of the contribution of vSMCs from each of the three anatomic locations to the vSMC subcluster, revealed that subclusters with high gene signature overlap to a given anatomic signature contained substantial cell contributions from that anatomic location (Figure 2h). For example, subcluster 0, predominantly composed of cells isolated from the descending aorta, displayed a gene signature that is 80% identical with the descending aorta anatomic signature (Figure 2g and 2h). In contrast, Subcluster 6, predominantly composed of aortic arch cells (93.5% arch-derived vSMCs) shows minimal gene signature overlap (15%) with the aortic arch anatomic signature.
Expression of the HoxA cluster in combination with hierarchical clustering of the gene signature was utilized to further locate the topological distribution of cells (Figure 2i-k). The data uncovered striking links with developmental origin and topographical location despite maintaining strong core smooth muscle cell identity as highlighted by the high expression of common genes (Supplementary Figure 3). We next proceeded to evaluate unique transcriptional signatures associated with three specific anatomic sites: carotids, ascending aorta and descending aorta and explored the possible association of unique signature with emergence of site-specific vascular pathology.
Unique Carotid vSMCs Signature (caSMCs)
The carotids are a major branch point from the aortic arch responsible for the supply of blood to the brain (Figure 3a) and their tunica media is populated by vSMCs derived from the neural crest1. A total of 632 transcripts (3.3% from total transcripts) constituted the molecular signature of the carotid artery (caSMCs) distinguishing this population from their counterpart vSMCs in the aortic arch and descending aorta (Figure 2c).
caSMCs were the predominant population associated with two subclusters: subcluster 2, (87.81% carotid-derived) and subcluster 5 (100% carotid-derived) (Figure 3b,c and Supplementary Table 4). Differential gene expression analysis of these two subclusters identified 459 genes (subcluster 2), and 291 genes (subcluster 5) whose expression pattern separated these two clusters from other SMC (Supplemental Excel File 3). The overlap between carotid artery anatomic location signature and the gene signatures of the two caSMC dominated subclusters ranged from 22.3% (subcluster 5, 65 of the 291) to 94.1% (subcluster 2, 432 of the 459 genes) (Figure 2g, Supplementary Excel File 1 and 3).
Gene ontology analysis of the 632 anatomic location signature genes showed a significant enrichment for proteins involved with growth factor signaling, vascular development and AP-1 response (Figure 3d, Supplementary Excel File 4). From these pathways candidate gene validation using immunohistochemistry was performed for the genes Klf4, Gucy1a3 and Gelsolin (Gsn).
Klf4, a member of the Kruppel-like factor transcription factor family, was represented in 6 of the 10 top ontological categories (response to growth factor, regulation of phosphate metabolic process, vascular development, AP-1 pathway, cellular response to organic cyclic compound and response to wounding) (Figure 3d and 3e). Immunofluorescence staining of KLF4 showed significantly increased expression in all layers of carotid vascular smooth muscle relative to aortic arch and descending aorta (Figure 3f and 3g). Gelsolin (Gsn), a cytoplasmic, actin-regulating protein was identified as a significantly enriched carotid SMC gene through its membership in two top ontology categories - AP-1 response and response to wounding (Figure 3d and 3h). Immunofluorescence staining of GSN showed significantly increased cytosolic expression in the carotid artery with substantial increases in the layer of smooth muscle closest to the vessel lumen when compared to aortic arch and descending aorta (Figure 3i and 3j). Gucy1a3, the major nitric oxide receptor which was recently associated as causative for the carotid-restricted vasculopathy, Moyamoya21, was also identified as a significantly enriched carotid SMC gene (Figure 3k). Immunofluorescence staining of GUCY1A3 showed high expression in carotid vascular smooth muscle with impressive levels in the smooth muscle layer in closest proximity to the endothelium (Figure 3l and 3m).
Unique Aortic Arch vSMCs Signature (aaSMCs)
The aortic arch is characterized by unique physical forces and flow patterns (Figure 4a) that comprise this region emerge from at least two distinct developmental origins: neural crest and secondary heart field5. Site-enriched expression of 226 genes (1.19 % of all transcripts) defined the anatomic location signature of aortic arch derived SMCs (aaSMCs) from their counterpart in the carotid artery and descending aorta (Figure 2c).
aaSMCs predominantly contribute to the formation of subclusters 1 and 6 (Figure 4b, 4c and Supplementary Table 4). Subcluster 1, the largest aaSMC-dominant subcluster, was composed of 91.8 % aaSMCs and was defined by the expression of 105 genes while subcluster 6 was composed of 93.5% aaSMCs and was defined by the expression of 473 genes (Figure 4c, Supplemental Excel File 3, Supplementary Table 4). The overlap between aortic arch location signature and the gene signatures of the two aaSMC dominated subclusters ranged from 15% (subcluster 6, 72 out 473 genes) to 70.4% (subcluster 1, 74 out of 105 genes) due to multiple factors including gene signature, subcluster size and cellular composition of subclusters (Figure 2g, Supplementary Excel File 1 and 3).
Gene ontology analysis of the aortic arch anatomic location signature showed significant enrichment for transcripts involved extracellular matrix organization, metabolism and autophagy (Figure 4d, Supplementary Excel File 4). From these identified signature genes, validation using immunohistochemistry was performed for the selected candidates: Aggrecan (Acan), extracellular superoxide dismutase (Sod3) and Thrombomodulin (Thbd). Aggrecan, a large proteoglycan, was identified through its membership in 3 ontological categories: extracellular matrix organization, collagen metabolic processes and pyruvate metabolic processes (Figure 4d and 4e). Immunofluorescence staining of ACAN displayed significantly increased expression in all layers of aortic arch vascular smooth muscle, characterized by intense staining surrounding each vSMC cell in the arch relative to the carotid arteries and descending aorta (Figure 4f and 4g). Sod3, was identified as part of the antigen presentation ontology category, (Figure 4d and 4h). Immunofluorescence staining of SOD3 showed significantly increased expression of the protein in all layers of aortic arch vascular smooth muscle relative to the carotid arteries and descending aorta (Figure 4i and 4j). Thrombomodulin (Thbd) a critical component the coagulation cascade was identified as significantly enriched in the aortic arch, through membership in the platelet degranulation ontology category (Figure 4d and 4k). Immunofluorescence staining of THBD showed significantly expression in all layers of aortic arch vascular smooth muscle relative to carotid artery and descending aorta (Figure 4l and 4m).
Characterization of Descending Aorta vSMCs (daSMCs)
SMCs of the descending aorta derive from the somites and the lateral mesoderm 1(Figure 5a). The scRNA-seq data uncovered 187 (1% of total) genes that provided the molecular signature for daSMCs (Figure 2c). daSMCs represented the predominant SMC population of subclusters 0, 3 and 4 (Figure 5b, 5c). The majority of daSMCs resided in subcluster 0 (92.44% daSMC), the largest subcluster characterized by the expression profile of 140 genes (Figure 5c, Supplementary Table 4 and Supplementary Excel File 3). Subcluster 3 and 4 are mixed aortic vSMCs clusters, with subcluster 3 (59.5% descending-derived, 37.78% arch-derived) defined by the expression of 188 genes; while subcluster 4 (54.52% descending-derived, 42.46% arch-derived) was characterized by expression of 24 genes (Figure 5c, Supplementary Table 4 and Excel File 3). The overlap between the descending aorta signature and the gene signatures of the three daSMC dominated subclusters ranged from no overlap (subcluster 4, 0 out 24 genes) to 80% (subcluster 0, 113 out of 140 genes) due to multiple factors including gene signature, subcluster size and cellular composition of subclusters (Figure 2g, Supplementary Excel File 1 and 3)
Gene ontology on the descending aorta anatomic location signature identified significant enrichment for genes involved oxidative phosphorylation, extracellular matrix proteoglycans, and muscle contraction (Figure 5d, Supplementary Excel File 4). Three genes were selected to validate the signature using immunohistochemistry: Aquaporin 1 (Aqp1), Ccdc3, and Perlecan (Hsgp2). Aqp1, a well characterized water channel, was identified in the gene ontology categories: oxidative phosphorylation, muscle contraction, blood vessel development and actin filament-based processes (Figure 5e). Immunofluorescence localization of AQP1 identified the protein preferentially in smooth muscle cells of descending aorta (Figure 5f and 5g). Surprisingly, AQP1 distribution in the endothelium was also site-enriched with limited expression in the descending aorta relative to the aortic arch (Supplemental Figure 4).
Ccdc3, a putative/predicted secretory factor, known for its anti-inflammatory role in the vasculature and effects on lipid accumulation was the top transcriptionally enriched descending aorta gene (Figure 5h). Immunofluorescent staining for CCDC3 showed predominant expression in the descending aorta location relative to aortic arch and carotid artery (Figure 5i and 5j). Perlecan (Hsgp2) a secreted heparan sulfate proteoglycan was identified as descending aorta signature gene candidate through its membership in two ontology categories: ECM proteoglycans and blood vessel development (Figure 5d and 5k). Immunofluorescence identification of Perlecan (Hsgp2) revealed preferential expression in vSMC of the descencing aorta (Figure 5l and 5m).
Identification of CD146/MCAM as a novel site-restricted aneurysm-associated gene
The cohort of genes selectively expressed at specific vascular locations (5.5% of total) offered the opportunity to inquire about their potential contribution to regional-specific pathologies. Thus, we performed an in-silico forward genetic screen of archived loci previously associated with mendelian vascular disease through integration of entries from the Human Phenotype Ontology (HPO) database (https://hpo.jax.org/app/)22 with data from Online Mendelian Inheritance of Man (omim.org)23. Following a detailed curation, 16 reported loci representing 6 vascular associated traits were identified (Figure 6a, Supplemental Figure 5 and Supplementary Table 5). From each locus, a list of regional candidate genes was obtained using the UCSC table browser and subsequently overlaid with the site-enriched single cell data to identity novel candidate genes for each genomic region. A total of 13 transcripts predominantly expressed in the specific vascular regions mapped to vascular disease loci (Supplementary Table 5). Given the discovery-driven nature of this approach, we sought to confirm the feasibility of validating putative disease candidate by further validating the regionally restricted expression of a disease-associated, site-enriched candidate, Tfap2b (Figure 6b, Supplementary Table 6)24. Immunohistochemistry at low magnification revealed a clear predominance of TFAP2B in the ascending portion of the arch leading towards the brachiocephalic artery, after which it becomes significantly reduced (Supplemental Figure 6). In the lower curvature however, expression persists into the region beyond the remnant of the ductus arteriosus (Supplemental Figure 6). At higher magnification, substantial increases in both level and frequency of TFAP2B positive nuclei were predominant to the aortic arch when compared to the descending aorta and carotid arteries (Figure 6c, 6d). These finding strongly supported the notion that site-specific expression might uncover disease associated gene candidates.
Amongst the identified candidate transcripts was the daSMC-enriched gene Mcam, which resided in the critical region of FAA1, a locus described for familial aneurysm identified two decades ago using linkage analysis (Figure 6e)20. Importantly, as reported in the original paper, this locus, unlike others, affect multiple aortic segments with dilation of the thoracic and abdominal aorta. Transcriptionally, Mcam (also known as CD146) was significantly enriched in daSMCs relative to other vascular sites (Figure 6f). Within the tunica media, CD146/MCAM protein was found to decorate the extracellular space of individual vSMCs, particularly concentrated in the ventral side of the descending aorta. The aortic arch and carotid artery showed more limited distribution of the protein with higher concentration in the innermost layers of the vessel (Figure 6g and 6h and Supplementary Figure 7). Upon further analysis of CD146/MCAM expression, we also noted uneven distribution across the circumference of the aortic segment. Descending aortic segments in the ventral aspect displayed substantial increased in CD146/MCAM protein expression across all layers when compared when compared to vSMC layers on the opposite dorsal orientation of the vessel (Figure 6i).
Given that echocardiographic examination of the FAA1 families indicated the involvement of both the thoracic and abdominal aorta in aneurysms, we performed scRNA-seq profiling of vascular segments from dissecting aortic abdominal aneurysms (AAA) modeled by angiotensin II infusion (Figure 7a, b) to gain further insights into the expression profile of MCAM in aneurysmal SMCs. Using this approach 813 to 2440 cells per group were sequenced with 198,600 mean reads per cell (Supplemental Figure 8). We observed six groups of cells representing distinct cell type populations (Figure 7c, d, e). Expression analysis of CD146/Mcam across all identified cell types revealed that a substantial population of SMCs that expressed high levels of CD146/Mcam transcripts (Figure 7f).
SMCs positive for CD146/Mcam were identified in both control and AAA vessels where these cells expressed similar levels of classical SMC markers, Acta2 and Myh11 (Figure 7g). Direct comparison of the expression patterns of these CD146/Mcam+ SMCs from AAA and control showed a substantial decrease in the number of CD146/Mcam transcripts expressed per cell in the AAA derived SMCs (Figure 7h). Given the observed changes at the transcriptional level, expression of CD146/MCAM was assessed in control and AAA sections using immunofluorescence (Figure 7i). SMCs from the AAA samples showed substantially decreased CD146/MCAM at the protein level when compared to those from control aorta despite both SMC populations expressing substantial amounts of alpha smooth muscle actin (Figure 7i). Gene ontology enrichment for transcripts associated CD146/Mcam+ SMCs identified that significant fraction of genes that were upregulated in CD146/Mcam+ SMCs contribute to interactions with extracellular matrix, a crucial process that is altered in aneurysms (Figure 7j).
To confirm a causative role for CD146/MCAM in the pathogenesis of AAA, CD146/Mcam knockout mice were crossed to hypercholesterolemic (ApoE -/-) animals and the resulting progeny were treated with angiotensin II infusion for 28 days (Figure 8a). Mice lacking CD146/MCAM showed substantially decreased survival during the treatment period, with only 30% of CD146/MCAM-null animals reaching the end of the 28-day period, relative to 60% of wild-type littermates (Figure 8b). Measurement of the external diameter of the thoracic aorta at sacrifice showed no significant differences between ApoE−/−Mcam+/+ and ApoE−/−Mcam−/−groups (Figure 8c and 8d). In contrast, abdominal aorta measurements showed significant increases in width between the two groups, with ApoE−/−Mcam−/− animals showing substantially and statistically significant increased dilation of the abdominal aorta (Figure 8e and 8f). Taken together, these findings support the conclusion that expression of CD146/MCAM in the descending-abdominal aorta is protective against the development of AAA.