scRNA-seq clustering by Seurat for cardiac hypertrophy
An overview of flowchart in this study is shown in Figure 1A. Single-cardiomyocyte RNA-seq data were obtained from GSE95140 dataset. GSEA was used to analyze cardiac hypertrophy-related KEGG pathways. As shown in Figure 1B, citrate cycle TCA cycle (ES=0.84, NES=1.82, p-value<0.01, Size=30), cysteine and methionine metabolism(ES=0.60, NES=1.78, p-value<0.01, Size=32), fatty acid metabolism(ES=0.85, NES=2.21, p-value<0.01, Size=39), lysine degradation(ES=0.73, NES=2.23, p-value<0.01, Size=41), propanoate metabolism(ES=0.81, NES=2.08, p-value<0.01, Size=31), apoptosis (ES=-0.33, NES=-1.20, p-value=0.04, Size=80), arrhythmogenic right ventricular cardiomyopathy (ARVC; ES=-0.37, NES=-1.34, p-value=0.03, Size=74), dilated cardiomyopathy (ES=-0.35, NES=-1.30, p-value=0.05, Size=85), ECM receptor interaction (ES=-0.35, NES=-1.30, p-value=0.04, Size=82) and focal adhesion (ES=-0.40, NES=-1.50, p-value<0.01, Size=192) were significantly enriched by differentially expressed genes between TAC and sham groups. Furthermore, cardiac muscle contraction (ES=0.65, p-value<0.01, Size=69), citrate cycle TCA cycle (ES=0.79, p-value<0.01, Size=30), fatty acid metabolism (ES=0.70, p-value<0.01, Size=32), glutathione metabolism (ES=0.68, p-value<0.01, Size=22) and glycolysis gluconeogenesis (ES=0.76, p-value<0.01, Size=46) could be significantly related to the development of cardiac hypertrophy according to TAC time series.
Cardiomyocytes were screened for further analysis (Figure 2A; Table S1). Then, PCA was used to select PC with large standard deviations. Elbow diagram showed that PC was set as 12 (Figure 2B). UMAP was used to classify these cardiomyocytes into six clusters (Figure 2C). After normalization, 3408 highly variable genes were screened and the top 20 genes were visualized, as shown in Figure 2D. Heat maps depicted the top ten marker genes for each cluster (Figure 2E).
Identification of hub pathways for cardiac hypertrophy
Consequently, a total of 288 markers were detected based on Wilcoxon signed rank test among the different cell clusters (Table S2). To probe out potential biological functions of differentially expressed genes, we presented functional enrichment analyses. GO enrichment analysis results showed that these genes were primarily enriched in cardiac hypertrophy-related biological processes including muscle cell development (Count=22, p-value=1.57E-14), myofibril assembly (Count =15, p-value=4.58E-14), ribonucleotide metabolic process(Count =28, p-value=5.57E-14), ribose phosphate metabolic process (Count =28, p-value=9.14E-14) and cardiac muscle tissue development (Count =22, p-value=2.32E-13). Furthermore, these genes participate in myocardial cell components such as contractile fiber (Count =25, p-value=7.23E-18), myelin sheath (Count =23, p-value=2.14E-14), mitochondrial inner membrane (Count =27, p-value=8.43E-14) and actin cytoskeleton (Count =20, p-value=3.95E-07). Also, they had several crucial molecular functions like ubiquitin protein ligase binding(Count =15, p-value=4.20E-05), actin binding(Count =16, p-value=1.86E-04), ion channel binding(Count =9, p-value=3.02E-04), iron ion binding (Count =5, p-value=3.02E-04) and antioxidant activity(Count =7, p-value=6.84E-04) (Figure 3A; Table S3). As visualized in the KEGG pathway map, these genes were mainly involved in cardiac hypertrophy, ion transport, myocardial remodeling, apoptosis, HIF pathway and Metabolize (Figure 3B; Table S4). As shown in Figure 3C, there were distinct differences in the GSVA calculated scores of these KEGG pathways at different stages of cardiac hypertrophy compared to sham operation. Concretely, compared to sham group, cardiac hypertrophy scores were significantly higher in TAC D3, W1, W2 and W4 groups (all p-value<0.05). However, no statistical difference was detected between sham and TAC W8 groups. Compared to sham group, there were significantly higher apoptosis scores in TAC D3, W1and W4 groups (all p-value<0.05) not TAC W2 and W8 groups. Lower metabolise scores were found in TAC W1 and W8 groups (both p-value<0.05) not TAC D3, W2 and W4 groups than sham group. Also, our results showed that HIF pathway scores were distinctly lower in TAC W1 and W2 groups (both p-value<0.05) not TAC D3, W4 and W8 groups than sham group. Furthermore, compared to sham group, there were distinctly lower ion transport scores in TAC W1 group (p-value<0.05), not TAC D3, W2, W4 and W8 groups. Myocardial remodeling scores were remarkably higher in TAC D3, W1, W2, W4 and W8 groups than sham group (all p-value<0.05). Figure 3D depicted that the difference of the GSVA calculated scores of these KEGG pathways across different cell clusters.
Identification of hub genes and their expression in each cell subtype for cardiac hypertrophy
Eight hub genes with the highest degree in the PPI network and the strongest correlation with GSVA calculated score of hub pathways were identified for cardiac hypertrophy, including Ugp2 in the metabolise pathway (Correlation coefficient=0.47, Degree=35), Atp1b1 in the ion transport pathway(Correlation coefficient=-0.43, Degree=12), Flnc and Tgm2 in the cardiac hypertrophy pathway (Flnc: Correlation coefficient=0.48, Degree=22; Tgm2: Correlation coefficient=0.43, Degree=17), Ctsd and Clu in the apoptosis pathway (Ctsd: Correlation coefficient=0.45, Degree=20; Clu: Correlation coefficient=0.42, Degree=18), Pygm and Vldlr in the HIF pathway (Pygm: Correlation coefficient=0.47, Degree=16 Vldlr: Correlation coefficient=0.50, Degree=14) and Flnc in the myocardial remodeling (Correlation coefficient=0.46, Degree=17), as shown in Figure 4A and Table S5. Then, the six cell clusters were defined as fibroblast, CM-A, CM-V, trabecular CM and endothelial cell by singleR package (Figure 4B).
Additionally, Figure 4C showed that there was a high heterogeneity in expression of the eight hub genes across different cell clusters. For example, Vldlr was up-regulated in CM-A (average expression>1.5 and percent expressed=100) and down-regulated in trabecular CM and CM-V (average expression<-1.0 and percent expressed=100). Compared to trabecular CM and CM-A, there was a low percentage of Tgm2 expression in fibroblast, endothelial cell and CM-V (average expression>1 and percent expressed<99.0%).
Furthermore, we compared the expression patterns of these hub genes at different stages of cardiac hypertrophy in mice. Compared to sham operation, the expression levels of Atp1b1 and Vldlr were significantly elevated in mice treated by TAC at the D3, W1, W2 and W4 (all p-value<0.05; Figure 4D). For Clu, there were distinctly higher expression levels in mice treated by TAC at the D3, W1, W4 and W8 than sham operation group (all p-value<0.05). Ctsd expression had an increase expression levels in mice treated by TAC at the W2, W4 and W8 (all p-value<0.05). Compared to sham operation, both Flnc and Tgm2 expression was notably elevated in mice with TAC at the D3, W1, W2 and W8 (all p-value<0.05). Moreover, the expression levels of Pygm and Ugp2 were prominently higher in mice with TAC at the D3, W1, W2, W4 and W8 than in mice with sham operation (all p-value<0.05). These results indicated that there was a heterogeneity of expression patterns hub genes at different stages of cardiac hypertrophy.
Validation of hub gene expression and hub pathways using external datasets
Hub genes and hub pathways were validated using two external datasets (GSE76 and GSE36074). In the GSE76 dataset, our results showed that the expression levels of Atp1b1, Clu, Ctsd, Flnc, Pygm, Tgm2 and Ugp2 were distinctly higher in mice at 1w and 8w following pressure-overload induced cardiac hypertrophy than those in mice with sham operation for 1w (all p-value<0.05; Figure 5A). In converse, Vldlr had evidently lower expression levels in mice at 1w and 8w following pressure-overload induced cardiac hypertrophy than those in mice with sham operation for 1w (both p-value<0.05). As expected, there was no significant difference in their expression between mice with sham operation for 1w and for 8w. Similar to the validation results using the GSE76 dataset, the expression of Atp1b1, Ctsd, Flnc, Pygm, Tgm2 and Ugp2 were significantly increased and the expression of Vldlr was decreased both in mice with heart failure and only hypertrophy induced by TAC from the GSE36074 dataset (all p-value<0.05). For Clu, there was higher expression level in mice with heart failure (p-value<0.05) not only hypertrophy than mice with sham operation (Figure 5B).
The GSVA calculated scores of hub pathways were calculated based on the GSE76 and GSE36074 datasets. As shown in Figure 5C, the GSVA calculated scores of apoptosis, ion transport, metabolise and myocardial remodeling were obviously higher in mice at the first and eighth week after induced by pressure overload compared to sham operation group (all p-value<0.05). Cardiac hypertrophy had a higher GSVA calculated score in mice at the first week after induced by pressure overload than in mice with sham operation for 1w (p-value<0.05). For HIF pathway, its lower GSVA calculated score was found in mice at the first week after induced by pressure overload compared to mice with sham operation for 1w (p-value<0.05). In the GSE36074 dataset, the GSVA calculated scores of apoptosis, cardiac hypertrophy, metabolise and myocardial remodeling were prominently higher both in mice with heart failure and only hypertrophy induced by TAC (all p-value<0.05; Figure 5D). Conversely, the GSVA calculated score of HIF pathway was significantly decreased both in mice with heart failure and only hypertrophy induced by TAC (both p-value<0.05). Furthermore, ion transport had a distinctly lower GSVA calculated score in mice with heart failure compared to sham operation (p-value<0.05).