Single-cell and spatial transcriptome characterization of male breast cancer
To delve into the cellular ecosystem and molecular characteristics of MBC, we conducted scRNA-seq and ST-seq on 34 fresh tissue samples from 27 patients. This included 20 scRNA-seq samples from 13 patients (13 in local areas, 4 near areas, and 3 lymph areas), as well as ST-seq samples from 14 patients (Supplementary Data 1). Following multiple quality control steps, we obtained single-cell transcriptomes of 204,419 cells from all scRNA-seq data for subsequent analysis. Utilizing Seurat V523 for integrated all scRNA-seq data, we identified 27 cell clusters (Extended Data Fig. 1a), covering 9 main cell types: epithelial cells (50.11%), NK/T cells (20.57%), fibroblasts (10.13%), B cells (6.59%), macrophages (5.21%), endothelial cells (4.77%), plasma cells (1.55%), mast cells (0.59%), and neutrophils (0.48%) (Fig. 1a). The majority of samples retained diverse cell types (Supplementary Data 2). Epithelial cells were detected in all samples (Extended Data Fig. 1b and Supplementary Fig. 1), and all types of cells showing distinct boundaries between with different cell populations (Extended Data Fig. 1c). Epithelial cells were mainly enriched in local areas and near areas (59.4% and 49.6%), while significantly fewer were found in lymph nodes (19.5%). In contrast, lymph node immune cells were enriched at 80.0%, 30.6% in local areas, and least in near areas (13.7%). Due to the small volume of MBC tissue samples used in this study, the sampling sites of local areas and near areas were very close, and the main cell types and their distributions in these two tissues were extremely similar (Extended Data Fig. 1d). All these cells were manually annotated based on their specific marker genes, which showed significant specificity in different cell populations (Fig. 1b and d, and Supplementary Data 3).
For ST-seq data, an average of 2,908 spots per slice were obtained (range: 2,097 to 4,268 spots), as detailed in Extended Data Fig. 1e and Supplementary Data 4. Using the same marker gene list as used in scRNA-seq, we calculated specific cell meta-modules for each spot and manually annotated them (Fig. 1d, e, and Supplementary Data 5). Unlike in scRNA-seq, in ST-seq data, the expression of marker genes in immune-related spots showed more overlap. Therefore, we calculated the Shannon entropy for different types of spots in each slice. The analysis results showed that the Shannon entropy of epithelial cell spots was generally lower in all slices (Extended Data Fig. 1f), indicating higher consistency in gene expression patterns among epithelial cell spots and relatively higher purity of epithelial cells. In most ST data, the median percentage of epithelial cell spots was 45.42% (range: 16.59%-62.67%), followed by fibroblast spots with a median percentage of 16.76% (range: 2.637%-36.785%). In contrast, fewer immune cell spots were obtained in ST data, especially T cell spots (5.77%-14.57%) (Fig. 1f, g).
Subtype specific energy metabolism and neuronal signaling pathways in MBC cells
Given the limited research and data on MBC cells, in this study, we focused on these critical malignant epithelial cells to deeply explore the molecular biological characteristics of MBC at the cellular level. We used the numbat24 tool, a somatic haplotype-aware analysis method based on somatic copy number variation (CNV), to infer genomic aberrations from the scRNA-seq data of each patient. The analysis results showed that MBC patients exhibited extremely high genomic heterogeneity. For example, we observed minimal genomic alterations in patients P1, P2, P7, P6, and P9, while patients P8 and P12 showed mainly chromosome amplifications, and patients P4, P3, P5, and P11 exhibited mainly chromosome deletions. Patients P10 and P13 showed a diversity variety of genomic alteration types (Supplementary Fig. 2). Through numbat analysis, we identified a total of 87,218 malignant tumor cells (85.14% of total epithelial cells) in all scRNA-seq data, with the highest proportion in lymph areas samples (99.50%), followed by local areas tissues (90.70%), and the lowest in near areas tissues (65.50%) (Expanded Fig. 2a, b).
To further explore the heterogeneity of tumor cells in MBC, we classified these cancer cells into six different subtypes based on functional genes associated with cancer development, including neuroendocrine-like cancer cells (GFRA1, NF1, GRIA2)25–27, proliferation-like cancer cells (CD24, TUBB3, TCIM), metastasis-like cancer cells (RNF181, MDK, BST2), invasion-like cancer cells (MKI67, TOP2A, NDC80), immune-like cancer cells (CD74, PTPRC, PARP14), and EMT-like cancer cells (VIM, FN1, MMP2) (Fig. 2a, b, and Supplementary Data 6). Overall, proliferation-like cells exhibited higher Shannon diversity than other subtypes in MBC (Expanded Fig. 2c), and the Shannon diversity of different subtypes varied among different patients (Expanded Fig. 2d).
In current clinical practice, the treatment strategies for MBC often reference those for FMBC, primarily because the majority of MBC patients are hormone receptor-positive, similar to postmenopausal ER-positive and/or PR-positive FMBC28,29. Our research data shows, at the single-cell level, genes associated with FMBC, such as ESR1, PGR, and PRLR, are highly enriched in neuroendocrine-like cancer cells (Fig. 2b). Additionally, we found significant differences in the number of specific marker genes among different subtypes of cancer cells, with neuroendocrine-like cancer cells having the most abundant unique marker genes (expressed in over 75% of the population), while proliferation-like cancer cells did not exhibit population-specific marker genes (Fig. 2c and Supplementary Data 5). This phenomenon may be due to the high Shannon diversity within the proliferation-like cell population, leading to less specific expression of marker genes.
Furthermore, we calculated cell cycle scoring to assess the proliferative capacity of different subtype cancer cells, and the results showed that invasion-like cancer cells exhibited the strongest proliferative ability (Fig. 2d). To explore the representative biological functions of different subtype cells, we conducted Gene Ontology (GO) enrichment analysis. The analysis results indicate that neuroendocrine-like cancer cells are closely associated with regulating circadian rhythm events, which may suggest their involvement in inducing and/or promoting malignant events such as distant metastasis of tumor cells30–32 (Fig. 2e).
We defined four tumor-associated meta-module signal scores, including neurotrophic, angiogenesis, glycolysis, and fatty acid metabolism. To validate the biological and clinical significance of these four signal scores, we analyzed their correlation with male cancers in the TCGA pan-cancer dataset. The results showed that these four signal scores were associated with poor prognosis in male malignant tumors (Supplementary Fig. 3).
Next, we analyzed the expression levels of these signal scores in six different subtypes of cancer cells in MBC. We found that invasion-like and metastasis-like cancer cells mainly rely on glycolysis for energy metabolism, while neuroendocrine-like cancer cells primarily depend on fatty acid metabolism. Furthermore, EMT-like, proliferation-like, and neuroendocrine-like cancer cells, which are associated with tumor proliferation and metastasis, exhibited highly active neurotrophic and angiogenesis signals (Fig. 2f, Supplementary Data 8).
We further observed the spatial distribution of these signals within solid tumors and found that the neurotrophic, glycolysis, and fatty acid metabolism signals were more intense in areas with dense cancer cells, while the angiogenesis signal was widely distributed throughout the tumor tissue (Fig. 2g-m, Supplementary Fig. 4, Supplementary Fig. 5). At the single-cell level, neurotrophic and angiogenesis signals showed positive correlations among different subtypes of tumor cells (Fig. 2o, Extended Data Fig. 2e), and at the sample level, the intensity of neurotrophic signals was positively correlated with the number of cancer cells (Extended Data Fig. 2f). Previous studies33,34 and current evidence suggest that neurotrophic signals may be involved in regulating tumor angiogenesis and the proliferation of cancer cells in MBC.
To further validate these observations, we analyzed the correlation between neurotrophic and angiogenesis signals in 29 types of male cancers in the TCGA database, particularly finding significant correlations in breast cancer (BRCA), prostate cancer (PRAD), and pancreatic adenocarcinoma (PAAD) (Fig. 2p, q and Supplementary Fig. 6). High-intensity neurotrophic and angiogenesis signals were closely associated with poor prognosis in 14 types of male malignant tumors (Supplementary Fig. 7).
Pseudo-evolution tree of cancer cell Subtypes
To explore the evolutionary trajectories of cancer cells in MBC, we conducted pseudo-time analysis on the six cancer cell subtypes and constructed a pseudo-evolution tree. In the pseudo-evolution tree, we identified one main branch and four minor branches. The pseudo-evolution tree depicted that invasion-like, EMT-like, and metastasis-like cancer cells predominantly located in the main trunk of the pseudo-evolution tree, while neuroendocrine-like cancer cells highly enriched at the terminal branches (Fig. 3a), and proliferation-like cancer cells did not enrich at the main nodes of the pseudo-evolution tree. This finding may suggest that in the early stage of MBC, tumor development mainly focuses on local tissue invasion and begins to undergo EMT and metastasis. In the mid-term, it shifts towards to suppress immune cells for evading immune attacks, while in the late stage, it mainly transforms into Neuroendocrine-like cancer cells to regulate the cells in the TME. Additionally, we used the SComatic35 tool, a somatic mutations detection method from sequencing reads, to calculate the somatic mutation burden in different subtype cancer cells (Supplementary Data 9). The results showed that at the population level, the mutation burden of neuroendocrine-like cells was significantly higher than that of other subtypes (Fig. 3b), but there were differences in different tissue spaces (Extended Data Fig. 3a).
Further analysis revealed that among the six subtypes of cells, the top ten genes with the highest mutation frequencies include ACACA, which plays a crucial role in fatty acid biosynthesis, ESR1 closely related to FMBC, and UTY gene located on the Y chromosome (Fig. 3c and Extended Data Fig. 3b). Previous studies have confirmed the importance of cancer gender dimorphism and the stability of the Y chromosome in the prognosis of male cancer patients36,37. To explore the relationship between gene mutation burden on different chromosomes and different subtypes of cells in MBC, we analyzed the distribution of gene mutations on all chromosomes in different subgroups (Fig. 3d), and the gene mutation frequency in different chromosomes (Extended Data Fig. 3c). The results showed that neuroendocrine-like cells had the poorest chromosomal stability, while immune-like cells had relatively stable chromosomes. We particularly noticed that among the top ten genes with the highest mutation frequencies on the Y chromosome in MBC, UTY had the highest mutation burden, and the NLGN4Y gene related to neurotrophic factors also showed a high mutation burden in MBC (Fig. 3e). Further analysis on male-related malignant tumors (such as BRCA and PRAD) in TCGA revealed that the UTY gene may affect the occurrence and development of MBC by influencing the intensity of tumor angiogenesis signaling (Fig. 3f). This result suggests that the UTY gene may be a key risk factor in MBC patients, which is an important factor independent of previously discovered in FMBC.
Regulation of different subtypes of cancer cells by neuroendocrine-like cancer cells in MBC
We further analyzed the area distribution differences of different subtypes of tumor cells in MBC. In local areas, neuroendocrine-like, metastasis-like, and proliferation-like cells dominate, while in near areas, proliferation-like cells and EMT-like cells predominate. In lymph areas, immune-like and invasion-like cells are predominant (Fig. 4a). Additionally, the heterogeneity of cancer cells exhibits significant differences in different sample spaces. Invasion-like and immune-like cells show higher Shannon diversity in near areas, while neuroendocrine-like, metastasis-like, and proliferation-like cells exhibit higher diversity in lymph areas (Fig. 4b). EMT-like cells show the highest diversity in local areas.
To explore how the concentration of neurotrophic factors in tumor tissue alters the metabolic capabilities of tumor cells and promotes angiogenesis, we utilized the Compass38 tool, a single-cell metabolism profiling algorithm, to conduct metabolic functional analysis of cancer cell subpopulations in different tissue compartments. The results indicate that, in MBC, cancer cells from different subgroups exhibit higher activities in amino acid metabolism, steroid metabolism, glycolysis, and fatty acid metabolism in local areas (Fig. 4C, Extended Data Fig. 4a, Supplementary Data 10), consistent with previous research findings suggesting a high enrichment of fatty acid metabolism signals in tumor-dense regions. We speculate that this phenomenon may be associated with the highly enriched neurotrophic signals and angiogenesis signals in the tumor regions of MBC. Therefore, we analyzed the cell-cell communications among these six subtypes of cancer cells to identify the pathways involved in their interactions. The result suggested that in MBC samples, neuroendocrine-like cancer cells extensively produce and receives neurotrophic signals (such as NRG and NPY signals) and growth factor-related signals (such as FGF signals), promoting the development of the population. Additionally, neuroendocrine-like cancer cells also synergistically promote the process of EMT in cancer cells, through the SEMA3 neurotrophic pathway and the angiogenesis-related pathway ANGPTL (Fig. 4d, Extended Data Fig. 4b-f).
We found that these signaling pathways of cell-cell communications exhibit spatial distribution differences, with the SEMA3 signaling pathway highly enriched in the core region with high cancer cell density, while the ANGPTL signaling pathway is highly enriched in the peripheral region with low cancer cell density (Fig. 4i, Extended Data Fig. 4g, Supplementary Fig. 8). Regardless of the spatial distribution of these signals, their intensity consistently increases as they approach tumor cells (Figure j, k, Extended Data Fig. 4h-j). Further analysis revealed that the average intensities of SEMA3 and ANGPTL signaling pathways among MBC samples are highly positively correlated with the proportion of EMT-like cells (Fig. 4l), further confirming the potential critical role of SEMA3 and ANGPTL signaling pathways in promoting the process of EMT in MBC. Validation results from male pan-cancer tissue samples from TCGA indicate that the intensities of SEMA3 signaling and receptor signaling are higher in metastatic male cancer patients than in non-metastatic patients (Fig. 4m). In the male pan-cancer metastatic population, the intensity of the SEMA3 signaling pathway is significantly associated with poor prognosis, especially in pancreatic adenocarcinoma (PAAD) and esophageal carcinoma (ESCA) (Fig. 4n-p).
Characterization of T cell subtypes in MBC and their cell-cell communications with cancer cells
Tumor-infiltrating T cells are core players in the TIME and are closely associated with clinical outcomes. Immune checkpoint blockade (ICB) has achieved tremendous success clinically, but its efficacy varies significantly across different types of cancers39. In order to gain a deeper understanding of the interactions between different subtypes of cancer cells and T cell in MBC, we conducted clustering analysis of T cells. We identified a total of 29 distinct cell clusters, categorized into 3 major T cell subtypes and 9 minor T cell subtypes, defined by their specific expression of marker genes (Fig. 5a, b, Supplementary Data 11). Among the minor subtypes are naïve T cells, cytotoxic T cells, Treg cells, NK/NKT cells, inhibitory T cells, exhausted T cells, and proliferating T cells. Notably, proliferating T cells account for the smallest proportion (0.43%), while naïve T cells are the most abundant (43.13%). We found significant differences in gene expression and location distribution of different T cell subtypes in MBC, for instance, lymph areas are predominantly enriched with naïve T cells; in local areas, minor T cell subtypes are more abundant, with proliferating T cells mainly concentrated (Fig. 5b, c). Within the major T cell subtypes, CD4, CD8, and NK/NKT cells exhibit distinct biological functions in different sampling areas, primarily determined by differential expression of their marker genes. In local areas, the three major T cell subtypes show similar expression patterns of NEAT1 and COX6C genes (Fig. 5d). Additionally, we conducted metabolic analysis of T cells in local areas and others (Supplementary Data 12), assessing differences in metabolic capabilities of different T cell subtypes in different areas. The results show that the metabolic capacity of the three major T cell subtypes in local areas is generally lower than in other areas, especially in terms of fatty acid metabolism and anaerobic glycolysis (Fig. 5e, Extended Data Fig. 5a). To explore the interactions between different cancers and T cells in MBC, we conducted cell-cell communications analysis of all T cells and six subtypes of cancer cells in different areas (Fig. 5f-q, Extended Data Fig. 5b-j). We found that in local and near areas, all six major cancer cell subtypes and all minor T cell subtypes exhibit strong intercellular interactions, while in lymph areas, cancer cells primarily interact with immune cells through the MIF signaling pathway, which shows strong signal strength in all three areas (Fig. 5f, j, n, Extended Data Fig. 5b, e, h). Furthermore, we observed that in local areas, immune-like cells play the role of "Sender" in cell-cell communications signaling pathways (Fig. 5i), while in near and lymph areas, they act as “Mediator” regulating the communication between other subtypes of cancer cells and T cells (Fig. 5m, q).
Neuroendocrine-like cells work with TAMs to drive EMT and angiogenesis in MBC
Tumor-associated macrophages (TAMs) play crucial roles in TIME40,41, participating in tumor progression, promoting angiogenesis, and immune suppression, closely associated with adverse prognosis and/or clinical events in solid tumor patients. To gain insight into the role of TAMs in MBC, we clustered TAMs into 21 distinct cell clusters and defined these clusters into six major subtypes of TAMs based on specific marker genes (Supplementary Data 13), including Mph SPP1+, Mph CCL4+, Mph MERTK+, Mph S100A9+, Mph STNM1+, and Mph CXCL9+. The results showed that in MBC, the proportion of Mph SPP1 + cells, associated with promoting tumor progression42, was the highest in TIME (total proportion 45.45%, 51.4% in local areas), while the proportion of Mph CXCL9 + cells, associated with inhibiting tumor progression43, was the lowest (total proportion 2.02%, 1.48% in local areas) (Fig. 6a, b). We further investigated the functional differences of subtype of TMAs and found that TMAs enriched different biological functions in different areas (Fig. 6c). Similarly, we found that the metabolic capacity of TAMs in local areas was generally lower than that in others (Supplementary Data 14); due to the small number of Mph CXCL9 + and Mph STNM1 + cells, they were not included in the inferred metabolic functions (Fig. 6d, Extended Data Fig. 6a). The interaction between TAMs and cancer cells greatly affects the occurrence of tumor-related malignant events. Therefore, we conducted cell-cell communications analysis of TAMs and cancer cells in local areas of MBC.
The results showed that six different subtypes of cancer cell promote the growth, proliferation, and differentiation of different subtypes of TAMs through CSF and GDF signaling pathways, especially Mph MERTK + cells (Fig. 6e, f, Extended Data Fig. 6b, c). In addition, tumor cells suppress TAMs through the MIF intercellular signaling pathway to achieve immune escape (Fig. 6g, Extended Data Fig. 6d). Interestingly, although TAMs send high-intensity TNF-related signals, these signals are not received by cancer cells in MBC (Fig. 6h, Extended Data Fig. 6e). In MBC, cancer cells promote the development of the TAMs community by secreting cytokines. Similarly, Mph MERTK + cells promote the proliferation and development of neuroendocrine-like cells by secreting the signal of IGF, and promote EMT of cancer cells through the HGF signaling pathway (Fig. 6i, j, Extended Data Fig. 6f, g). To validate this conclusion, we calculated the combined signal strength of HGF and IGF in the TCGA male malignant tumor patient dataset and found a strong correlation between the combined signal strength of HGF and IGF and the signal strength of neurotrophic signal and angiogenesis signal in metastatic and non-metastatic male malignant tumor patients, and the combined signal strength of HGF and IGF was closely related to the adverse prognosis of male malignant tumors (Fig. 6k-m, n).
Neuroendocrine-like cells act as signal distributors in the regulation of CAFs to promote cancer cell EMT and invasion
Cancer stroma plays a significant role in promoting tumor proliferation, recurrence, and treatment resistance44. Among all the stromal cells, in TIME, cancer-associated fibroblasts (CAFs) are abundant and closely associated with cancer progression45. To understand how CAFs regulate tumor cells in MBC, we clustered fibroblasts into 23 clusters and annotated four major subtypes of CAFs: inflammatory CAFs, vascular CAFs, EMT-like CAFs, and antigen CAFs (Fig. 7a, b, Supplementary Data 15). By analyzing the marker genes of these major CAFs subtypes, we calculated meta-module scores and found that, except for vascular CAFs, which displayed a more diverse meta-module, the meta-module expression of the other three CAFs subtypes was highly specific (Fig. 7c). Using ssGSEA, we compared the enrichment differences of marker genes of CAFs subtypes in biological signaling pathways in different areas and found that the impact of different environment on the functional expression of CAFs was more significant than the differences between subtypes (Fig. 7d). Additionally, we found that CAFs in near areas had stronger comprehensive metabolic capacity compared to those in local areas. This suggests that the density of cancer cells may affect the biological functions and metabolic capacity of CAFs (Fig. 7e, Extended Data Fig. 7a, Supplementary Data 14).
Further analysis of the cell-cell communications between CAFs and six different subtypes of cancer cells in MBC revealed that proliferation-like cancer cells affected the development of vascular CAFs through the EDN signaling pathway (Fig. 7f, Extended Data Fig. 7b), while vascular CAFs promoted the formation of cancer cell EMT process through the VEGF signaling pathway (Fig. 7g, Extended Data Fig. 7c). Simultaneously, immune-like cancer cells and EMT-like cells affected the development of CAFs community through the SPP1 signaling pathway (Fig. 7h, Extended Data Fig. 7d). Furthermore, CAFs community affected the proliferation and development of proliferation-like cancer cells, neuroendocrine-like cells, and other cancer cells through the TWEAK signaling pathway and NPY signaling pathway (Fig. 7i, j, Extended Data Fig. 7e). In the cancer stroma microenvironment of MBC, the NPY signal produced by CAFs and other tumor cells is mainly received by neuroendocrine-like cells and transmitted to other subtypes of cancers cells (Fig. 7k).