Immune Cells Landscape of the Training Dataset
GSE36169 expression data were processed and normalized. Boxplots showed the normalized gene expression profiles. Hierarchical cluster analysis was performed using the "hclust" function in R. In GSE36169 dataset, the outlier data, GSM882149 and GSM882150, were excluded from further analysis. Principal component analysis (PCA) scatter plots showed significant differences between the haired group and the bald group (Supplementary Fig. 1a–h). We used the ssGSEA algorithm to investigate the differences in immune cell infiltration between the haired group and the bald group (Fig. 1a). The increased infiltration of γδT cell, central memory CD8+ T cell, mast cell and immature B cell were significantly higher in the bald group than in the haired group (Fig. 1b). At the same time, the decreased infiltration of activated CD8+ T cell, effector memory CD4+ T cell, eosinophils and neutrophils were significantly higher in the bald group than in the haired group (Fig. 1c). The results indicating that these immune cells may are essential to the changes in the immune microenvironment.
GSEA Revealed the Involvement of the Related Immune System
We used the "limma" R package to identify 239 statistically changed genes (adjusted p-value < 0.05) from the GSE36169 dataset and then subjected them to GSEA. The statistically changed genes were enriched in a total of 13 pathways, of which 6 were immunity-related pathways, including innate immune system, adaptive immune system, cytokine signaling, interferon-γ (IFN-γ) signaling, interferon signaling and interleukins signaling (Fig. 2a, c). At the same time, these pathways were activated (Fig. 2b), suggesting that immune responses may be involved in the progression of hair loss.
Identification of DEGs and Functional Enrichment Analysis
A total of 112 DEGs, including 22 up-regulated and 90 down-regulated genes, were obtained based on adjusted p-value < 0.05 and |log2FC| > 1 between the bald group and the haired group in the GSE36169 dataset (Fig. 3a-b). Furthermore, KEGG and GO functional enrichment analyses were performed to determine the biological features of these 112 robust DEGs. KEGG pathway analysis revealed that DEGs were markedly enriched in hair growth related pathways (e.g., Wnt signaling pathway, TGF-β signaling pathway, Hippo signaling pathway) and IL − 17 signaling pathway (Fig. 3c). GO functional enrichment analysis revealed hair cycle, keratinocyte differentiation and keratinocyte differentiation were downregulating (Fig. 3d). These results indicated that hair growth and pro-inflammatory factors were involved in the progression of hair loss.
Identification of Hub Genes and hub IRGs
An interaction network between proteins encoded by the 112 DEGs was constructed using the STRING database. The interaction network comprised 84 nodes and 158 edges, visualized using the Cytoscape software (Fig. 4a). We then used the cytoHubba-MCC plugin to identify 10 hub genes, including COL1A1, MMP2, MMP9, TIMP3, THBS1, LOX, BMP2, COL11A1, CTSK and PTPRC (Fig. 4b). To investigate the IRGs expression characteristics in the bald group, DEGs were screened to generate IRGs based on the ImmPort database. The number of overlapping IRGs between the ImmPort database and DEGs were 21 (Fig. 4e) (Table 1). The PPI network was constructed to show the relationship between IRGs (Fig. 4f). The 4 hub IRGs, namely MMP9, PTPRC, BMP2 and THBS1, were obtained by the intersection with hub genes and IRGs (Fig. 4g). Boxplot showed MMP9 and PTPRC were significantly upregulated in the bald group compared to those in the haired group. In contrast, BMP2 and THBS1 were significantly downregulated in the bald group compared to those in the haired group (Fig. 4h).
Correlation analysis was performed to explore the expression patterns of hub genes and IRGs. First of all, various correlation was observed between the hub genes. In terms of the hub IRGs, MMP9 was positively correlated with PTPRC, and BMP2 was positively correlated with THBS1 (Fig. 4c). Secondly, we further investigated the correlation between the hub gene and the differentially infiltrated immune cells. COL11A1, CTSK, LOX, MMP2, MMP9 and PTPRC were positively correlated with the differentially infiltrated immune cells. In contrast, a negative correlation was observed between the differentially infiltrated immune cells and BMP2, COL11A1, THBS1 and TIMP3. (Fig. 4d).
Functional enrichment analysis of the hub IRGs
Functional enrichment analysis of the hub IRGs were explored using GSEA, and indicated that the pathways of allograft rejection, coagulation and IFN-γ response are highly enriched (Fig. 5a-d). These analyses verified that the hub IRGs are closely involved in the adaptive immune response and chronic inflammatory.
Validation of the Differentially Infiltrated Immune Cells and Hub IRGs in the Verification Dataset
We validate the differentially infiltrated immune cells and the expression levels of the hub IRGs in the GSE90594 dataset. GSE90594 expression data were processed and normalized. Boxplots showed the normalized gene expression profiles. Hierarchical cluster analysis was performed using the "hclust" function in R. In GSE90594 dataset, the outlier data, GSM2407377, GSM2407382, GSM2407383, GSM2407384, GSM2407388, GSM2407390, GSM2407391, GSM2407392, GSM2407397, GSM2407398 and GSM2407399, were excluded from further analysis. Principal component analysis (PCA) scatter plots showed significant differences between the bald scalp of male AGA patients (AGA group) and the haired scalp of healthy men (control group) (Supplementary Fig. 2a–h). First, the immune cells landscape of the verification dataset was largely consistent with these of the training dataset (Fig. 6a). Second, the increased infiltration of γδT cell, central memory CD8+ T cell, mast cell and immature B cell were significantly higher in the AGA group than in the control group (Fig. 6b). Third, the decreased infiltration of activated CD8+ T cell, effector memory CD4+ T cell, and eosinophils was significantly higher in the AGA group than in the control group, whereas, there was no significant difference in the decreased infiltration of neutrophils between the two groups (Fig. 6c). At last, the expression difference of the hub IRGs genes between the AGA group and the control group in the verification dataset was consistent with these of the training dataset (Fig. 6d). These validated analyses demonstrated that the significant difference of the immune cell infiltration and the hub IRGs expression was not only between the bald scalp and the haired scalp of the male AGA patients, but also between the bald scalp of male AGA patients and the haired scalp of healthy men.