Our study embarked on a comprehensive bioinformatics and ML analysis to describe the transcriptomic landscapes and identify novel miRNA biomarkers distinguishing HCM samples from healthy controls. We implemented a rigorous preprocessing step, including log2 transformation and quantile normalization, and generated all GEO datasets using Illumina chips, with GSE36946 generated on Illumina Human v2 MicroRNA expression beadchip, GSE36961 on Illumina HumanHT-12 V3.0 expression beadchip, and GSE141910 on Illumina HiSeq 2500 expression beadchip. Thus, we effectively minimized technical biases, enhancing the reliability and comparability of gene expression data across samples.
We identified 155 DEGs between HCM and healthy samples, with 51 upregulated and 104 downregulated. Our analysis also facilitated the identification of common target genes between DEGs and DE-miRNAs, revealing key upregulated genes COL21A1, PROM1, and downregulated genes FOS, BTG2, ELL2, PDK4, SERPINE1, SRGN, and TIPARP. This intersection not only validates the regulatory impact of identified miRNAs on gene expression but also accentuates their potential as biomarkers for HCM.
The GO and KEGG pathway enrichment analyses revealed significant biological insights. The GO analysis showed that DEGs were significantly enriched in biological processes such as regulation of inflammatory response and endopeptidase activity, myeloid cell differentiation, and intracellular zinc ion homeostasis, suggesting a complex interplay of immune response and metabolic regulation in HCM pathology. Furthermore, the KEGG pathway analysis elucidated involvement of DEGs in critical pathways including coronavirus disease, phagosome, complement and coagulation cascades, pertussis, and apoptosis, indicating potential mechanisms through which HCM influences cardiovascular health and disease progression. Studies, such as those on atherosclerosis, have shown that DEGs are notably enriched in pathways related to immune system interactions, metabolic processes, and specific diseases like dilated cardiomyopathy, highlighting the complex interplay between genetic factors and cardiovascular health [41]. These findings underscore the multifaceted nature of cardiovascular pathologies, including HCM, and suggest that similar pathways might be implicated in HCM pathology, warranting further investigation into the roles of infectious agents, immune responses, and thrombotic processes
Our analysis of the GSE141910 validation dataset, which includes transcriptional expression data from 194 samples—28 from patients with HCM and 166 from controls—revealed intriguing discrepancies and validations that highlight the complexity of gene expression regulation in HCM. Notably, our analysis aimed to validate expressions of two genes targeted by upregulated microRNAs, namely COL21A1 and PROM1. Despite expectations based on their upstream miRNA regulation, neither gene exhibited a statistically significant increase in expression within the HCM patient samples as illustrated in Fig. 5A. This outcome could suggest a possible post-transcriptional modulation or involvement of additional regulatory mechanisms that override the miRNA effect, a concept supported by various studies demonstrating the complexity of miRNA-target interactions in cardiac diseases. Research has shown that miRNAs can engage in intricate regulatory networks, impacting gene expression through multiple pathways, including inhibiting translation or promoting mRNA degradation. These interactions are influenced by the degree of complementarity between the miRNA and mRNA and can be modulated by alternative biogenesis pathways and the presence of other regulatory molecules [42]. Furthermore, studies have highlighted the role of miRNAs in various aspects of cardiac physiology and pathology, including cardiac remodeling and heart failure, where miRNAs regulate key genes and pathways involved in these processes. These interactions can be influenced by the cellular context, the presence of other regulatory proteins, and even changes in the physical structure of the myocardium, which can all alter the functional impact of miRNAs [5, 43]. Moreover, studies underscore the importance of miRNAs in regulating mitochondrial function in cardiomyocytes, revealing yet another layer of complexity in how miRNAs influence cardiac pathology and health [44]. These insights collectively enhance our understanding of the regulatory landscape in which miRNAs operate, particularly in cardiac function and disease, and underscore the potential for diverse regulatory mechanisms to modulate or even override miRNA effects.
The significant decrease in expression of the genes ELL2 and PDK4 in HCM samples, as highlighted by our findings, aligns well with the established roles of these genes in cardiac muscle function and energy metabolism. Notably, studies have shown that PDK4 plays a crucial role in the regulation of glucose and fatty acid metabolism within the heart, indicating its potential involvement in the metabolic adaptations seen in cardiac diseases including hypertrophic cardiomyopathy [45, 46]. Moreover, PDK4 is known to be sensitive to changes in nutritional and hormonal conditions, which can influence its expression and activity, affecting the metabolic state of cardiac tissues [47]. This sensitivity to metabolic conditions supports the observed downregulation in HCM and provides a potential link to the altered metabolic demands in hypertrophic tissues. On the other hand, ELL2, while not as extensively studied in the context of cardiac function, is generally involved in transcriptional processes [35]. Its decreased expression might reflect broader disruptions in gene regulation that could contribute to the pathological landscape of HCM.
Interestingly, the bioinformatics prediction of downregulation for BTG2 and SRGN in HCM samples was contradicted by the validation dataset, where both genes were actually found to be upregulated. This discrepancy could indicate a compensatory mechanism within the cardiac tissue in response to hypertrophic stress or might reflect the involvement of these genes in other aspects of cardiac remodeling that are not solely dependent on miRNA regulation. This observation is particularly significant as it suggests that cardiac pathology may involve multifaceted gene expression responses that could be potential targets for therapeutic intervention. The expression trends of BTG2 and SRGN, therefore, may represent a physiological adaptation or maladaptation in the progression of HCM, warranting further investigation into their specific roles and regulation in the heart.
These results collectively contribute to the intricate mosaic of gene-miRNA interactions within the HCM landscape, illustrating that while some findings confirm theoretical predictions, others reveal the limitations of current models and the need for a more nuanced understanding of the molecular underpinnings of this complex disease.
The investigation into the interplay between DE-miRNAs and their gene targets provides crucial insights into the underlying mechanisms of HCM. The functional interaction analysis performed using the GeneMANIA online webtool has elucidated complex networks that underline the pathophysiology of HCM. By incorporating genes like BTG2, ELL2, PDK4, and SRGN, whose expression levels significantly differ between HCM and control samples as per the GSE141910 validation dataset, a detailed interaction network was revealed. This network, comprising 24 genes and 230 links with seven types of interactions, primarily highlighted physical interactions, which accounted for over three-quarters of all connections. This predominance suggests a significant role of protein-protein interactions in the molecular dynamics of HCM. The network also emphasized key metabolic and biosynthetic processes such as the regulation of sulfur metabolic [48] and various acetyl-CoA related processes [49], pointing to metabolic alterations as crucial elements in HCM pathology. These findings suggest potential therapeutic targets and warrant further investigation into how these metabolic pathways might be manipulated to mitigate HCM progression.
The crucial role of miRNAs in HCM pathology is highlighted by their ability to regulate gene expression post-transcriptionally [50]. Our analysis points to miRNAs like hsa-miR-373 and hsa-miR-514 as vital players in the molecular mechanisms of HCM. Recent research has expanded on identifying miRNA biomarkers for HCM, applying ML algorithms such as RF, SVM, and LR for HCM classification based on miRNA profiles [8]. Such biomarkers are also under investigation for other cardiovascular conditions like myocardial infarction [51–53], myocardial fibrosis [54], coronary artery disease [9], heart failure [55], ischemia [56], stroke [57] and dilated cardiomyopathy [58]. Our use of ROC analysis assessed five DE-miRNAs for HCM detection, with AUC values ranging from 0.821 (hsa-mir-10a*) to 0.884 (hsa-mir-373). Additionally, we enhanced DE-miRNA classification with traditional ML algorithms, SVM and RF. Our dual strategy first utilized five DE-miRNAs as ML model features; then we included additional miRNAs with AUCs over 0.80, totaling 26, for model training.
The first method improved the AUC to 0.953 (95% CI [0.882–1.000]) for SVM, marginally surpassing RF with an AUC of 0.922 (95% CI [0.835–1.000]). Our extended approach further increased discriminatory power to an AUC of 0.995 (95% CI [0.980–1.000]) for both SVM and RF. Although both algorithms showed identical AUC values, SVM slightly outperformed RF in terms of accuracy (97.4% vs. 94.7%) and precision (97.0% vs. 94.1%).
In this study, we identified two key genes and five miRNAs linked to HCM, confirming their diagnostic value through an external dataset and machine learning techniques. While our results offer a hopeful method for diagnosing HCM, certain limitations must be considered. The foundation of our investigation is bioinformatic analysis; hence, additional experimental and clinical studies are necessary for confirmation. Moreover, future research should explore the underlying roles of the identified genes and miRNAs in HCM.