Network models provide a powerful framework for integrating heterogeneous datasets and are at the forefront of multi-omics data analysis and interpretation, enabling the discovery of hidden patterns, relationships, and insights that would otherwise be difficult to detect using traditional approaches. By leveraging the interconnectedness and interdependencies within a co-expression network, such as nodes representing genes and edges representing relationships between them, network models offer a holistic view of complex systems, such as human diseases.
In this study, we provide a network-based modelling approach integrating the biggest number of cutaneous transcriptomic datasets of AD, to date. We computed gene co-expression networks of lesional and non-lesional skin, both consisting of 17,903 genes. Notably, the lesional network contained substantially fewer edges (1,649,569) than the non-lesional network (1,924,001) highlighting disrupted patterns of co-expression in AD skin lesions and indicating dysfunctional gene regulatory mechanisms.
In recent years, the advancement of analysis strategies of disease-related networks, led to the identification of so-called “disease modules”. Disease modules are subsets of genes highly interconnected within a disease network from which we can gain insights into the pathophysiology of the disease under study. Genes belonging to a disease module often work together in specific pathways or functional units, shedding light on how impaired molecular mechanisms contribute to the disease development or progression [45, 46]. By combining data-driven evidence with prior knowledge, we here identified a robust disease module of the AD skin lesion encompassing 2,452 genes.
As this community of genes related to the atopic lesion was derived by several features including gene deregulation, gene-gene co-expression relationships, and genetic variation associated with the disease, the computation of the disease module overperforms the computation of differentially expressed genes as usually identified and used for downstream analysis in single-dataset transcriptomic studies [35].
While gene sets derived from differential expression analysis alone usually reveal a pleiotropy of pathways related to inflammation and innate immune responses, the disease module identified in this study showed the strongest enrichment for pathways targeted by new state-of-the-art treatment solutions for AD such as biologics targeting IL4-IL13 signalling (dupilumab and tralokinumab), and drugs targeting janus kinases 1,2 and 3 (barticitinib, upadacitinib and abrocitinib). However, the lack of efficacy in some patients highlights the need for other effective molecules, and underlines the importance of developing targeted medicine tailored to patients characteristics. Further enriched pathways reveal the relevance of TNF, NF-kB, and chemokine signalling as well as a strong signature of epidermal barrier dysfunction. Here we also identified a group of genes that is overrepresented in a high number of known dysregulated pathways in the AD lesion. Such genes, including MMP9, MMP3 and COL1A2 among others, might represent key players in the molecular impairment underlying the aberrant immune response and the dysfunctional barrier as characteristic for AD. In fact, keratinocytes express MMP9 as an important factor in the maintenance of the epithelial barrier function [47].
The analysis of the disease module revealed transcriptomic signatures from abundant cell types in the skin such as keratinocytes, eccrine gland cells, fibroblasts, and endothelial cells as well as immune cells such as lymphocytes and innate immune cells indicating that a complex dysregulated multi-cellular system sustains the phenotype under study. Understanding such complex multi-cellular interactions is crucial for advancing disease modelling and therapeutic interventions. In this perspective, it emerges the need for more sophisticated disease modelling procedures that incorporate the interactions and crosstalk among different cell types, which can lead to more precise and effective treatments.
Importantly, the characterisation of the gene connectivity within the disease module leveraged fundamental insights in the gene deregulation underlying the skin lesion. In fact, while relationships between known AD-associated genes are overrepresented in pathways that are typically disrupted in immune-mediated dermatological diseases, such as interleukin signalling, extracellular matrix organisation, collagen formation and degradation, connection between AD-associated and non-associated genes underlined mRNA processing indicating that the gene regulation machinery is tightly intertwined with known AD biomarkers such as interleukins and chemokines to sustain the AD pathological phenotype.
In this study, we investigated the interactional properties of genes belonging to the disease module, so as to give further insights into its functional characterisation. We demonstrated that, ranking the edges based on data-driven and prior knowledge-driven evidence of association with AD, is a valid approach to fully exploit network medicine principles to prioritise putative biomarkers of disease.
In fact, we hypothesise that exploiting properties of connectivity patterns within the disease network model can be an alternative and efficient tool for biomarker identification. The paradigm is that, while traditionally a biomarker is intended as a disrupted expression profile of a single gene specifically occurring in the skin lesion, network medicine suggests that certain patterns of connectivity involving two or more interacting genes could be exploited as biomarkers of lesions.
While the use of precompiled drug libraries is the state of the art in drug discovery, the principles by which the drugs are included are usually not evaluated. However, a rational inclusion process of drugs in libraries for drug discovery would enable a more effective process with a sensible improvement of the predictions, as we recently demonstrated in a case study on COVID-19 drug repositioning [48].
Since disease modules provide a formal perspective over the complexity of disease mechanisms, network pharmacology approaches can take advantage of this knowledge by considering the dependencies among genes belonging to the disease module. Given the high reliability of disease modules identified by integrating data-driven and a priori knowledge about the disease under study, compound libraries can be, in turn, designed in order to target multiple components belonging to the disease module, aiming to optimize the identification of compounds putatively effective for the disease. The definition of tailored compound libraries based on disease modules can leverage the identification of compounds that can have a broader impact on the disease phenotype and offer potential synergistic effects. Although the employment of network pharmacology approaches to identify candidate drugs and targets is well established, the possibility of using such principles to suggest desired structural properties to be considered in the discovery of drug candidates is less explored. Indeed, while current network pharmacology approaches are centered on molecular biology aspects, here we establish the paradigm that intimately integrates molecular biology with computational chemistry in order to construct tailored compound libraries.
We here demonstrate how the disease module can assist in building a custom compound library for drug discovery in AD. To do so, we prioritised common pharmacophores of drugs employed in the treatment of AD or other similar phenotypes whose target genes belong to the disease module. We identified a substructure shared among the investigated drugs. Afterwards, we performed a virtual screening on multiple high-dimensional repositories and defined a custom drug library to tailor drug discovery efforts towards the treatment of atopic dermatitis. This study is subject to certain limitations due to the limited amount of clinical data available along with the transcriptional profiles obtained from public repositories. The absence of detailed clinical information prevents us from establishing meaningful associations between the disease module and relevant clinical parameters, such as disease severity and elapsed time since initial diagnosis. Furthermore, the lack of information on whether patients underwent active pharmacological therapy precludes us from discerning any potential influence of ongoing or terminated pharmacological treatments on the druggability profile of the disease module, which resulted, eventually, in the tailored compound library.