Tissue Selection Criteria and Ethical Approval
To capture a range of tissue microenvironment-associated characteristics, four patients with diverse clinical and pathological profiles were chosen. The selection included two cases of luminal type breast cancer (LBC), one HER2 2+/HR- breast cancer, and one case of triple-negative breast cancer (TNBC), classifications based on immunohistochemistry (IHC). The samples were collected post-treatment, subsequent to neoadjuvant chemotherapy or PD-1 immunotherapy. Among these patients, two presented with lymph node metastatic carcinoma characterized by extensive tumor infiltration and lymphatic invasion - one TNBC and one HER2 2+/HR-. The remaining two samples were collected from a recurrent LBC that had metastasized to the chest wall and from an LBC with lung metastasis, respectively. All specimens were formalin-fixed and paraffin-embedded (FFPE) and archived before the experiments. The utilization of all clinical materials received approval from the Ethical Review Board of the Cancer Hospital Chinese Academy of Medical Sciences (National Cancer Center), under clinical registration number 22/074-3275.
Clinical Charting and Histopathological Characterization of Breast Cancer Tissues
The clinical details for the samples included in this study are presented in Supp. Table 1. We assessed the universal histological markers typically evaluated in breast cancer tissues using immunohistochemistry (IHC), including the Estrogen receptor (ER), HER2, progesterone receptor (PR), and Ki-67. In addition, individual samples underwent further IHC analysis to evaluate androgen receptor (AR), GATA3, cytokeratin 5/6 (CK5/6), E-cadherin (E-cad), EGFR, P53, NapsinA, TTF-1, and PD-L1. The results of these histological assessments are shown in Supp. Table 1.
Pathological Evaluation Using Immunohistochemistry (IHC) Staining
Immunohistochemistry (IHC) was conducted on a Ventana Benchmark Autostainer (Roche Diagnostics) or a Bond automation system (Leica, Bond Max) using protocols established under clinical pathological standards. Tissue sections, 5µm thick, were initially heated to 75°C, deparaffinized in EZ Prep (Roche), and then dehydrated in a ethanol series (80%, 90%, 95%, 95%, 100%) (Beijing Yili Fine Chemicals). Antigen retrieval was performed using Cell Conditioning 1 (CC1, Roche) before the application of primary antibodies. The specific primary antibodies, detailed with clone numbers, were incubated for 16–28 minutes at optimized concentrations (ER: SP1, PR: 1E2, HER2: NEU from Roche, and Ki-67: MIB-1 from Gene Technologies). Additional primary antibodies are listed in Supp. Table 2. A horseradish peroxidase (HRP)-conjugated secondary antibody (Roche) was applied subsequently, followed by colorimetric amplification with DAB chromogen and hematoxylin counterstaining. Experienced pathologists conducted all histological assessments, and representative images were captured at 100x and 200x magnification using a KFBio imaging station.
Spatial Profiling Using Xenium In-Situ Technology and Subsequent Hematoxylin and Eosin (H&E) Staining
FFPE tissues of 5µm thickness were sectioned and arranged on Xenium slides (10X Genomics, PN-1000460) within specified sample areas (10.45mm x 22.45mm). The sections were deparaffinized in xylene, rehydrated through a graded ethanol series, and then treated with permeabilization enzyme to allow efficient RNA exposure (Perm Enzyme B, 10X Genomics 3000553). This was followed by overnight hybridization with Xenium DNA probes (Xenium Human Breast Gene Expression Probes 2000826, detailed in Supp. Table 3) at 50°C using a thermocycler (Bio-Rad C1000). After hybridization, the padlock DNA probes bund to target mRNA were ligated at 37°C (Xenium Ligation Enzymes A/B: 2000397/2000398) for 2 hours. Circularized DNA probes for each gene were PCR-amplified in situ by rolling circle amplification (RCA), generating multiple copies of target-specific DNA barcodes. Gene decoding was performed on the Xenium instrument (10X Genomics). Essentially, the workflow takes a cyclic approach and, in each cycle, fluorescence-labeled probes with complementary nucleotide sequences for individual targets react with RCA products, and signals are x, y, z registered. DAPI nucleus staining is used to allocate individual cells and for cell segmentation purposes using a built-in machine learning algorithm. As the fluorescence-guided decoding process takes place, recorded four-colored signal combinations are generated, counted and used to construct a gene-to-count matrix used for downstream processing.
Samples were immediately proceeded with H&E staining after the Xenium decoding process. In general, samples were washed in Mill-Q water for 2 mins and stained in hematoxylin (MHS16, Sigma Aldrich) for 20 mins, blued in bluing reagent (CS702, Dako) for 1 min, and then eosin (3801615, Leica) for 30 secs. Samples were finalized, dehydrated and covered with coverslips for digital imaging (Pannoramic MIDI, 3D Histech) at 40x magnification.
Raw Data Handling, Quality Control, Preprocessing, and Dimension Reduction
Raw data from the Xenium instrument's output, including feature-cell matrices and cell boundary files, were integrated into the Seurat V5 pipeline. During quality control (QC), we aimed to retain a broad dataset, setting the inclusion criteria at a minimum of 5 gene features per cell and genes expressed in at least 5 cells across the tissue. The gene count upper limit per cell was determined based on the distribution of the number of features per cell. We assessed gene expression complexity by calculating the log10 transformed gene feature count divided by the log10 UMI count (log10GenesPerUMI), setting the acceptable range from 0.6 to 0.985. The expression matrix underwent log normalization following the Seurat procedure. FindVariableFeatures from Seurat was utilized for the automated selection of highly variable genes, identifying 252 such genes in our dataset. Data were then z-scored using the ScaleData function. For inter-sample integration, we applied the robust principal component analysis (RPCA) method via the IntegrateData function. Post-merging, the data underwent another z-score transformation to prepare for downstream processing. Dimension reduction was performed through t-Distributed Stochastic Neighbor Embedding (t-SNE) or Uniform Manifold Approximation and Projection (U-MAP), using the default Louvain clustering algorithm. To delineate clusters, the top 20 principal components were extracted, and 34 clusters were defined at a resolution of one, employing the FindNeighbors and FindClusters functions.
Spatial Single-Cell Referencing, Phenotyping, and In-Situ Projection
Cell type annotation was performed according to a pre-defined marker gene list (detailed in Supp. Table 4). We initially annotated the main cell types: breast cancer epithelium using markers such as EPCAM, KRT5, KRT6B, ERBB2, GATA3, and EGFR; breast glandular cells using SERPINA3 and DSC2; immune cells using markers PTPRC, TRAC, and CD3E; fibroblasts using SPARC, PDGFRA, and CCDC80; smooth muscle cells using PDGFRB, MYH11, and ACTA2; and endothelial cells using PECAM1, VWF, and AQP1.
Subsequent immune cell re-clustering, based on the top 15 principal components at a resolution of 0.8, resulted in 23 identifiable clusters. These clusters were annotated as T cells (CD3E, CD3G), B cells (CD19, CD79A, MS4A1), macrophages (CD68, CD163), myeloid cells (ITGAM), plasma cells (TNFRSF17), monocytes (CD14, FCGR3A), natural killer cells (GNLY, NKG7, NCAM1), dendritic cells (CD83, FCER1A), neutrophils (CEACAM8), and mast cells (CPA3, CTSG, TPSAB1). To delineate T cell subsets further, we annotated them as CD4 + T cells (CD4), CD8 + T cells (CD8A, GZMB), regulatory T cells (FOXP3, IL2RA), naïve T cells (SELL, CCR7), effector T cells (PRF1, CCL5), effector-memory T cells (IL7R, EOMES), exhausted T cells (LAG3, TIGIT), and proliferative T cells (MKI67). For in-situ gene annotation, the feature-cell matrix and cell boundary files were utilized, and following the above described cross-sample data merging, genes of interest were plotted using ggplot2. The same approach was applied for cell-type annotation.
Gene Regulatory Network Analysis Using Cytoscape
We analyzed genes co-regulated with ESR1, PGR, and ERBB2 using data imported from the STRING database, excluding any unconnected entities. The Cytoscape plugin CytoHubba was employed to identify key regulatory networks, with the degree of connectivity score serving as a metric to determine key nodes within these networks. The color intensity of the nodes corresponds to their connectivity scores, while the intensities of edges reflect the confidence scores between linked nodes.
Cellular Neighborhood Analysis of Breast Cancer Samples
For the cellular neighborhood (CN) analysis, we defined a cell neighbor window as the 10 cells nearest to each anchor cell, which is a cell designated as central within its respective neighborhood (15, 37). Each CN was quantified as a collection of cell types, with frequencies summed to 100% based on our in-situ cell annotations. We performed hierarchical clustering of CNs across all cells in a tissue using K-means clustering within R (version 4.2.0), setting the number of clusters (K) to 10. The prevalence of each CN type within individual samples was normalized so that the sum across the tissue equaled 100%, thereby providing an estimation of the proportional representation of each CN type across the entire tissue sample.
Region-of-Interest Selection Criteria Based on H&E and Xenium Data
Regions exhibiting neoplastic characteristics and adjacent stromal areas, serving as controls, were delineated in all samples using hematoxylin and eosin (H&E) staining by a qualified pathologist. For spatial analysis, 20 ROIs were chosen from tumor-enriched areas and 10 from immune-stromal areas in each sample, resulting in a total of 120 ROIs with sizes ranging from 14,751 to 313,038 µm^2. We also referred to paired IHC data for Ki-67 and HER2, in addition to H&E staining, to accurately identify the tumor-immune interface for subsequent ROI selection. Spatial x-y indices of individual ROIs were exported from the cell boundaries files, generating an expression matrix for each ROI. Those data were compiled and processed in the same way as described above.
Spatial Cell-Cell Interaction, Distance-Based Cellular Distribution, and Spatial Enrichment Analysis
To assess cell-cell associations at the pan-tissue level, we calculated the average expression levels of individual genes across different cell types as annotated by Xenium, comparing gene-wise correlations among all pairs. For interactions within specific ROIs, we employed the CellTrek R package, utilizing its SColoc module to estimate spatial compositional differences between samples using Kullback-Leibler divergence (KL). SColoc creates a minimum spanning tree (MST) to approximate spatial cellular adjacencies, with graphical representations generated through its Graphical User Interface (GUI). In these graphs, edges represent the strength of inter-cell type associations (with a threshold set at 0.3), while colors and sizes denote cell type and frequency within each ROI, respectively.
For profiling based on spatial distance, we defined each anchor cell type according to Xenium annotations and mapped the distribution of other cell subsets within a 100µm radius (38, 39). To assess spatial enrichment of single-cell subsets, we first determined the proportions of individual cell types within specific ROIs (denoted as Props-in-ROI). We then calculated the mean proportion of these cell types across all ROIs (MeanProps-in-ROI) for defined phenotype groups, such as responders and non-responders.
We further calculated the proportion of the same cell subsets across the entire tissue sample, referred to as Prop-in-slide. We then calculated the enrichment score (ESscore) as the ratio of MeanProps-in-ROI to Prop-in-slide (40). Additionally, we computed the standard deviation (SD) of cell proportions across the ROIs (SDProp-in-ROI). To assess the consistency of cell type distribution across different ROIs, we calculated the ratio of MeanProps-in-ROI to SDProp-in-ROI, yielding an estimate of inter-ROI variation (ESV). For visualization plotting, we also calculated the log-transformed proportional ratio (Log(ESscore)) as Log10(MeanProps-in-ROI/Prop-in-slide).
Spatial Ligand-Receptor Analysis Using CellPhoneDB
The CellPhoneDB R package was employed to interrogate Xenium expression profiles within predefined tumor-enriched ROIs (41). Available ligand-receptor pairs with normalized expression data were extracted, including PTPRC-MRC1, CXCL12-CXCR4, CD86-CTLA4, CD80-CTLA4, and CD274-CD80. A permutation test was conducted to assess the co-expression levels of selected ligand-receptor pairs and their co-occurrence probabilities relative to a null distribution generated from random pairs. The results were then plotted for individual cell types. Significance was established at a p-value threshold of 0.05, with results displayed as the negative logarithm to the base 10 of the p-value (-log10(p-value)).
In-Situ Gene Imputation Test Using Xenium Data
To assess the efficacy of current in-situ expression inference methods, we analyzed various input data formats. Single-cell RNA-seq data, serving as the reference, were obtained from the 10X Genomics preview dataset for human breast, which can be accessed at 10X Genomics Xenium in-situ Preview Dataset for Human Breast. Additionally, in-house generated Xenium in-situ breast cancer data, derived from this study, were utilized. Both normalized and non-normalized (raw) data formats were tested; for the latter, outputs were further processed with or without normalization transformation. Two principal spatial gene expression prediction algorithms were employed: SpaGE, which relies on k-nearest-neighbor regression, and Tangram, which is predicated on a deep-learning approach (42, 43). The predictive capacity of these algorithms was tested by split the Xenium in-situ expression data into three random subsets, using two-thirds for model training and the remaining third to validate the predictions against actual Xenium expression data. The accuracy of these in-situ gene prediction methods was evaluated by comparing predicted and true expression values across selected ROIs, with the Pearson correlation coefficient serving as the measure of expression correlation.
Statistical Analysis Methods
All statistical analyses were conducted in R, version 4.2.0, employing specific packages as needed. The primary basis for comparing sample groups was treatment efficacy, categorized as PD-1 responders versus non-responders. Analyses incorporated cell proportional ratios, cell distance metrics, and frequencies within cellular neighborhoods (CN). A non-parametric Wilcoxon rank-sum test was utilized to determine statistical significance, with the following thresholds: not significant (n.s.) for p > 0.05, * for p < 0.05, ** for p < 0.01, *** for p < 0.001, and **** for p < 0.0001. Bar graphs illustrate the mean expression with standard deviation (SD), while box-and-whisker plots depict the 25th to 75th percentiles, with whiskers extending from the minimum to the maximum values, including all data points.