Patient cohort
This study was performed at the Department of Gastroenterology and Hepatology of the University Medical Centre Groningen, The Netherlands. Patients with ulcerative colitis (UC) underwent ileocolonoscopy to determine the extent of active disease before the start of treatment with vedolizumab (VDZ). We included patients with histologically confirmed active left-sided disease or pancolitis (warranting the initiation of VDZ treatment), and excluded patients with exclusively right-sided disease in order to make evaluation of treatment response by sigmoidoscopy possible. No minimal Mayo score was applied for inclusion, resulting in the inclusion of patients with proctitis and mild pancolitis, both underrepresented in previous clinical trials. Furthermore, we included UC patients with prior exposure to anti-tumor necrosis factor alpha (anti-TNFα) medication as well as anti-TNFα naive patients. We collected clinical information on sex, age and disease duration (at the time of endoscopy), disease location, anti-TNFα exposure, concurrent IBD medication and prior IBD-related surgeries (Table 1). For numeric variables, a standard 2-sample t-test was performed and for categorical variables a chi-squared test of independence was applied. All included patients underwent initial endoscopy at least 2 weeks before the start of VDZ treatment (pre-VDZ) as well as follow-up endoscopy at week 14 (post-VDZ). VDZ was administered at week 0, 2, 6 and 14. Patient clinical care was conformed to standard Dutch treatment protocols (https://www.mdl.nl/richtlijnen). All patients provided written informed consent through GE-ID IRB (NL58808.042.16) or Parelsnoer IRB (NL24572.018.08).
Definition of response to VDZ
For each patient, response to VDZ was evaluated at week 14 (Supplementary Table 2). The main response outcome was considered Physician Global Assessment (PGA) since it incorporates clinical response, endoscopic response and biochemical measurements, such as C-Reactive Protein, hemoglobin, and fecal calprotectin. Response by PGA was scored by three independent gastroenterologists (one treating, and two independent); in cases of dispute, the majority vote was considered. Furthermore, individual scores for clinical response, endoscopic response and mucosal healing were recorded for all patients. Clinical response was defined as a decrease in Simple Clinical Colitis Activity Index (SCCAI) by at least 3 points post-VDZ with no reported rectal bleeding.36 Endoscopic response was defined as a decrease in total Mayo score by at least 3 points post-VDZ in the last three segments of the intestinal tract. Mucosal healing was recognized when a maximum of one segment of the intestinal tract received a maximum Mayo score of 1 during post-VDZ endoscopy. Since SCCAI scores showed high heterogeneity, and week 14 is a very early time point to confidently call endoscopic response or mucosal healing37, patients were stratified into responders and non-responders to VDZ based on PGA for all downstream analyses.
Sample collection and cryopreservation
At both time points (pre- and post-VDZ), mucosal biopsies (3 biopsy bites per sample) and peripheral blood were collected. If applicable, an additional peripheral blood sample was collected at the start of VDZ treatment (week 0). When possible, paired biopsies from inflamed and non-inflamed regions of the colon were collected from the same patient (Supplementary Table 1). Biopsies were collected in RPMI 1640+GlutaMax (Thermofisher Scientific) supplemented with 10% fetal calf serum (FCS) (Thermo Fisher Scientific) at 4 °C, and were preserved in cold freezing medium containing 90% FCS and 10% dimethyl sulfoxide (DMSO) (Sigma-Aldrich) after collection. The procedure of cryopreservation was performed as previously described.38,39 Biopsies were stored at -80°C overnight and then were transferred to liquid nitrogen for long-term storage. The peripheral mononuclear blood cells (PBMCs) were isolated from whole blood collected in CPT Vacutainer tubes (BD) according to standard protocol.40 PBMCs were cryopreserved in freezing medium (RPMI 1640 supplemented with 40% FCS and 10% DMSO), kept overnight at -80°C and then also stored in liquid nitrogen. We aimed to limit the handling time of each biopsy and PBMC sample from collection moment till cryopreservation to 1 hour.
Preparation of single cell suspensions
Gut mucosal biopsies
Gut mucosal biopsies were dissociated into single cells using the 'one-step collagenase' protocol, as described previously.39 In short, cryopreserved biopsies were quickly thawed at 37°C and washed twice with 10 mL of ice-cold thawing solution (phosphate-buffered saline (PBS-/-) with 5% FCS). For tissue digestion, 3 biopsy bites for each sample were incubated for 30 minutes at 37°C in 2.5 mL enzymatic digestion solution – RPMI 1640+GlutaMax supplemented with 2% FCS, 200 iU/mL collagenase IV (Sigma-Aldrich), 10 iU/mL DNAse II (Sigma-Aldrich), 100 U/mL SUPERase-In RNase inhibitor (Thermo Fisher Scientific) while shaking at 700 rpm. Enzymatic dissociation was quenched by adding 1 μL/mL 0.5M EDTA (Thermo Fisher Scientific) and placing the tubes on ice for 2 min. Biopsies were then washed in 10 mL PBS and shortly incubated in 100 μL TrypLE express enzyme (Thermo Fisher Scientific) at 37°C. After that, 2 mL cold RPMI/2% FCS was added, cells were spun down at 350 g for 5 min, resuspended in 2 mL cold wash buffer (PBS supplemented with 0.4% Bovine Serum Albumin Solution (BSA) (Sigma-Aldrich) and 10 iU/mL DNase II) and filtered through a 70 μm strainer (Corning) into a new tube. Single cells were washed twice in ice-cold PBS/0.4% BSA, spun down at 350 g for 5 min and resuspended in 1 mL ice-cold PBS/0.4% BSA.
PBMCs
The isolated cryopreserved PBMCs were quickly thawed at 37°C and slowly transferred to a 15 mL tube. Then pre-warmed RPMI 1640 was added drop-by-drop while doubling the volume in stepwise manner until the total volume of 14 mL. PBMCs were centrifuged at 300 g for 5 min, resuspended in 2 mL pre-warmed RPMI 1640 and placed in the incubator at 37°C for 1 hour to let the cells rest. After that, cells were washed twice in 2 mL RPMI/0.04% BSA, filtered through a 40 μm strainer if needed, spun down and resuspended in 1 mL RPMI/0.04% BSA.
Cell counting and cell viability
Single gut mucosal cells and PBMCs were manually counted using Trypan Blue reagent (Sigma). If cell suspension contained cell aggregates, it was once more filtered through a strainer and washed. If cell viability of a sample was < 80%, cells were washed two more times with a corresponding wash buffer. All cells were kept on ice on a rocking platform until further processing.
Multiplexing by cell hashing
A total of 400,000 cells from each sample (biopsy and PBMC) were transferred to a new low-bind tube and labeled with barcoded antibodies (also known as cell hashing) using TotalSeq oligo-conjugated hashtag antibodies (Biolegend) according to the manufacturers’ protocol.41 In short, cells were first incubated with 5 μL Human TruStain FcX (Biolegend) for 10 min at 4°C, then with 0.5 μg hashtag antibody for 30 min at 4°C on a rocking platform. Next, cells were washed with 1.4 mL of Cell Staining Buffer (Biolegend), then washed with ice-cold PBS/0.4% BSA to remove redundant hashtags, spun down at 350 g for 5 minutes and resuspended in 100 μL ice-cold PBS/0.4% BSA. A maximum of 8 samples were processed at a time and each was labeled with a unique hashtag antibody: TotalSeq™-A0251 anti-human Hashtag 1 (cat.no 394601), TotalSeq™-A0252 anti-human Hashtag 2 (cat.no 394603), TotalSeq™-A0253 anti-human Hashtag 3 (cat.no 394605), TotalSeq™-A0254 anti-human Hashtag 4 (cat.no 394607), TotalSeq™-A0255 anti-human Hashtag 5 (cat.no 394609), TotalSeq™-A0256 anti-human Hashtag 6 (cat.no 394611), TotalSeq™-A0257 anti-human Hashtag 7 (cat.no 394613) or TotalSeq™-A0258 anti-human Hashtag 8 (cat.no 394615). The leftover unhashed cells were kept on ice on a rocking platform for further preparation for spectral flow cytometry.
10x Genomics library preparation and sequencing
The single-cell RNA sequencing (scRNAseq) libraries were prepared using Chromium Single Cell 3ʹ Reagent Kits v3 (10x Genomics, PN-1000075). Maximum of four samples were pooled/multiplexed per chip channel (Chromium Single Cell B Chip Kit, 10x Genomics, PN-1000153) aiming for a total recovery of 12,000 cells per library. Each sample was loaded into two separate chip channels, so that 8,000 cells could be recovered per sample. For each four-sample pool, two types of libraries were prepared: (1) a cDNA library according to the 10x manual CG000185 Rev D, and (2) a hashtag-derived (HTO) library according to the manufacturer’s protocol (https://www.biolegend.com/en-us/protocols/totalseq-a-antibodies-and-cell-hashing-with-10x-single-cell-3-reagent-kit-v3-3-1-protocol). All libraries were indexed using Chromium i7 Multiplex Kit (10x Genomics, PN-120262) and sequenced on an MGISEQ-2000 (BGI, Hong Kong), featuring 28bp, 8bp, 100bp (read 1, index, read 2) paired-end reads. The cDNA libraries were sequenced at 70,000 reads/cell and HTO libraries at 5,000 reads/cell.
High-dimensional spectral flow cytometry
The leftover unhashed single gut mucosal cells and PBMCs were used for surface staining with a custom Cytek Immunomonitoring 38-antibody flow cytometry-based panel. The panel was designed on the basis of previously described 40-color panel42, which now included markers to detect integrins. Full panel description (including vendors and catalog numbers) is shown in Supplementary Table 3. All used antibodies were titrated to determine the optimal working concentrations for mucosal cells and PBMCs separately. Cells were stained according to the published protocol.42 In brief, cells were washed with cold PBS, incubated with Live Dead UV Blue (1:40 dilution, Thermo Fisher Scientific) for 15 min in the dark. Then cells were washed with PBS/1% BSA, resuspended to a total volume of 100 μL and incubated with 10 μL Brilliant Stain Buffer Plus (BD Biosciences), 5 μL Human BD Fc Block (BD Biosciences) and 5 μL True-Stain Monocyte Blocker (Biolegend) for 10 min. After, cells were incubated with anti-TCRγδ for 10 min in the dark, followed by incubation with anti-CCR7 and anti-CD183 for another 10 min. The rest of the antibodies in the panel were added to the cells in the following manner: first, non-brilliant blue (BB), non-brilliant violet (BV) and non-brilliant ultraviolet (BUV) antibodies were added as a master mix; next, BB, BV and BUV antibodies were added as a separate mix; lastly, anti-immunoglobulin antibodies (anti-IgG, anti-IgD and anti-IgM), anti-CD16, followed by anti-HLA-DR and anti-CD39 (multicolor (MC) panel variation) or anti-CD18 and anti-CD29 (MC-integrin panel variation) were added to cells one by one. Once all antibodies were added, cells were incubated for 30 min in the dark, washed with 3 mL of PBS/1% BSA, spun down at 400 g for 6 min and resuspended in PBS/1% BSA to the final volume of 200 μL. Appropriate single stain controls were prepared using cells or UltraComp eBeads™ Plus Compensation Beads (Thermo Fisher Scientific). Samples were immediately measured using the 5-laser CytekTM Aurora spectral cytometer. Compensation and unmixing were performed using the SpectroFlo software.
Transcriptomic data analysis
Data pre-processing and clustering
Preparation of single cell suspensions, scRNA library generation and sequencing were performed in two batches (8 patients in batch 1 and 17 patients in batch 2). For all mucosal cells and PBMCs, FastQ files were used as an input for Cellranger (v.3.1.0) to perform filtering, barcode counting, and alignment. The human GRCh38-3.0.0 reference genome was used for alignment. Souporcell (v.1) doublet detection was performed on the alignment data.43 Once the Seurat object (satijalab.org) was generated, HTO metadata were integrated and data were filtered for doublets based on the HTO assignment. In cases where a cell was HTO-negative, the Souporcell assignment was used instead. Since we did not observe a strong batch effect, batch 1 and 2 objects were subsequently merged without any further batch correction (Extended Data Fig.1). The generated data were normalized through the SCTransform method implemented in Seurat and filtered for cells with a fraction of mitochondrial genes (MT) higher than 70% for mucosal cells or higher than 15% for PBMCs. Next, principal component analysis (PCA) was performed considering the first 30 components through the RunPCA function in Seurat. Finally, FindNeighbors and FindClusters functions (default parameters) were used to identify cell-based clusters.
Cell type annotation
The Seurat object with mucosal cells was divided into three compartments: epithelial (EPCAM+), immune (PTPRC+), and stromal (THY1, SOX10, MADCAM1). A further filtering step (MT < 20%) was applied to the mucosal immune compartment. The Azimuth cell type annotation method in Seurat was used to automatically annotate cell types within each compartment. It relied on a reference single-cell dataset, where cells in the query dataset were labeled based on their closest projected reference cell (cell-to-cell approach). The mucosal cells were annotated based on the training dataset of Smillie et al.22, while the PBMC data used the human PBMCs Azimuth dataset.44 Additionally, biologically relevant cell types not present in the biopsy reference dataset were added by selecting cells with specific marker gene expressions. Specifically, paneth-like cells were added to the epithelial compartment (LYZ, REG3A, module score > 2). The plasma compartment was further subdivided into IgA, IgG, IgM, and Ig negative (IgA marker: IGHA1> 250, IgG marker: IGHG1> 80 and IgM marker: IGHM>900) and plasmacytoid dendritic cells (pDCs) cells were identified (ITM2C, PLD4, SERPINF1, LILRA4, IL3RA, TPM2, MZB1, SPIB, IRF4, SMPD3, module score > 0.6). Epithelial compartment contained a total of 71,547 cells and 16 cell types, while the mucosal immune compartment reached 68,809 cells and 25 cell types. Finally, a total of 18,832 cells and 13 cell types were identified within the stromal compartment. Using the same cell type annotation approach, a total of 95,057 PBMCs were profiled, and 30 cell types were identified.
Expression of integrins
To determine the differential expression of VDZ-targeted (ITGA4 and ITGB7) and untargeted (ITGAL, ITGAM, ITGAX, ITGB1 and ITGB2) integrins on peripheral and gut mucosal immune cells by scRNAseq, the MAST function as built into the Seurat R package (v.4.3.0) was used. All p values were Benjamini-Hochberg corrected for multiple testing. Additionally, we calculated the ratios of ITGB7+ immune cells in PBMCs relative to gut mucosa. To this end, we first derived the proportion of ITGB7+ cells in PBMCs and in the mucosa separately for each immune cell type, then proportion in PBMCs was divided by the proportion in the mucosa, and the negative log2 values were used to plot the resulting data as box plots. Statistical differences were tested using non-parametric two-tailed Wilcoxon Rank Sum Test (p < 0.05).
Cell abundances analysis
To quantify differential celltype abundances, Bayesian models were used to address the problem of the proportionality of the data and relatively low sample size. Specifically, we chose a Bayesian multinomial logistic-normal linear regression model called Pibble (fido package, v.1.0.2)45 considering results significant at credible intervals over or equal to 75%.46 To further ensure the robustness of our results, correction for covariates including prior anti-TNFα exposure, anatomical location of the gut of biopsy, and the presence of inflammation, was also performed. Additionally, we performed the non-parametric two-tailed Wilcoxon Rank Sum Test (p < 0.05) on the proportions of each cell type in different comparisons.
Differential gene expression analysis and pathway analysis
To determine the differential gene expression for selected cell types in peripheral and gut mucosal cells (scRNAseq data), the Seurat implementation of MAST was performed. All p values were Bonferroni corrected for multiple testing. Then. the identified statistically significant features were used as an input for Gene Set Enrichment Analysis (GSEA) with ClusterProfiler (v.4.4.4) which bases its gene ontology (GO) term indication on ‘org.Hs.eg.db’ database (Carlson M (2022). _org.Hs.eg.db: Genome wide annotation for Human_. R package v.3.15.0.).
Pseudobulk analysis
To enable the global assessment of gene expression patterns derived from our scRNAseq dataset of gut mucosal cells, we employed a pseudobulk analysis approach. This strategy involved aggregating the expression profiles of individual cells to generate a composite, or 'pseudobulk', transcriptomic profile. Two distinct pseudobulk datasets were created: one included all cells collectively, and the other exclusively included epithelial cells. The differential gene expression analysis between different “pseudobulk” replicates was performed using the MAST function as built into Seurat R package (v.4.3.0, Benjamini-Hochberg adjusted p < 0.05). By amalgamating the expression data in this manner, we aimed to investigate overarching gene expression patterns across all cells collectively, mitigating the noise and variability inherent to single-cell measurements and facilitating more traditional differential expression analyses.
Cell-cell interaction analysis
Cell-cell communication analysis was performed through the integration of two different computational methods. We first interrogated our data using the R package CellChat (v.1.4.0), a public database containing information on ligand-receptor interactions and able to provide a first overview of the single-cell interaction per specific condition.27 We specifically focused on the investigation of cell-cell communication in responders and non-responders at baseline.
Next, we performed cell-cell communication analysis using NicheNet through the R package nichenetr (v.1.1.0).28 NicheNet analysis provides a deeper understanding of intercellular interactions by using a background set of differentially expressed (DE) genes, which allows us to predict potential ligand-receptor interactions and determine if specific cell-cell interactions resulted in variations in downstream gene expression. The DE genes selection was performed based on the gene presence in at least 10% of the specific cell types, an absolute log fold change (LFC) of 0.25, and a Bonferroni p value adjusted of 0.05. During NicheNet analysis, we focused on specific target cells (inflammatory monocytes, inflammatory fibroblasts, and Tregs) that were treated separately as receiver and sender cells. Furthermore, we performed Nichenet pairwise analysis on a previously selected group of interesting cell-types, which comprises inflammatory monocytes, inflammatory fibroblasts, regulatory T cells (Tregs), , CD8+IL17+ (Tc17) cells, IgG+ plasma cells, natural killer cells (NKs), dendritic cells (DCs), microfold (M) cells and activated endothelial cells (ECs).
Analysis of high-dimensional spectral flow cytometry data
Data pre-processing
The acquired flow cytometry data in the form of FCS files were imported to the OMIQ analysis platform. No additional compensation or scaling of the data was applied. We based our pipeline on the provided workflow in the OMIQ tutorial (https://www.omiq.ai/guides/cytometry-analysis-tutorial). The data was overlaid with the relevant patient and sample metadata.
Cell type annotation
The data were manually gated to exclude doublets, dead cells, aggregates and CD45- cells. In biopsies, 29 samples were selected and all cells (n = 797,670) were used for downstream analysis. In PBMCs, 48 samples were selected with a target count of 10,000 selected cells per file, leading to 480,000 cells for analyses. Afterwards, a Uniform Manifold Approximation and Projection (UMAP) algorithm was run to produce two-dimensional embeddings of the data for visualization. Data were clustered using FlowSOM. Metaclusters were grouped into populations using a combination of surface marker expression profiles, heatmaps based on cell type defining markers (CD38, CD19, CD27, CD8, CD14, CD56 and CD4) and gating templates which were designed according to known gating strategies.42,47 Gut mucosal biopsies were annotated in three large immune compartments: myeloid cells, B cells, and T cells combined with NKs. Similarly, PBMCs were annotated at low resolution into four compartments: myeloid cells, NKs and ILCs, B cells, and T cells. This pipeline was repeated for each subset to obtain the final cell type annotation using a combination of computational and manual gating strategies (Extended Data Fig.2a, Supplementary Table 3). Integrins (α4, αE, αL, αX, β1, β2 or β7) were gated on their single positive expression on all cell types (Fig. 2a). To avoid individual user bias, cell type annotation was confirmed by three authors independently using Kaluza and OMIQ.
Cell abundances analysis
Cell types were aggregated into populations that corresponded with the cell type annotations derived from the scRNAseq data. The quantities of each cell type population, both pre-VDZ and post-VDZ, were subsequently tabulated. These tallies across diverse subsets were then amalgamated into a singular dataframe, in preparation for subsequent analysis. Additionally, metadata corresponding to the time point and PGA response were cataloged. Given the compositional nature of the data, a combined method involving Pibble with credible intervals over or equal to 75% (fido package, v.1.0.2) and the non-parametric two-tailed Wilcoxon Rank Sum Test was employed. For the two-tailed Wilcoxon Rank Sum Test, statistically significance was determined at a p value threshold of < 0.05. There was no threshold imposed on the cell count number for the analysis.
Integrin surface presence on immune cells
To examine the alteration in integrin expression on peripheral blood lymphocytes and colonic immune cells under VDZ therapy by Cytek, the median fluorescence intensities of single integrin-expressing cells were analyzed. Specifically, positive populations of cell types expressing individual integrins were selected from both the peripheral blood and colonic immune cells, guided by blanco staining controls. Subsequently, median fluorescence intensities of these integrin-expressing cells were compared using the non-parametric two-tailed Wilcoxon Rank Sum Test between responders and non-responders pre-VDZ treatment and 14 weeks after. Only results with p value < 0.05 were considered statistically significant. The ratios of β7+ cells in PBMCs relative to the gut mucosal cells were calculated by dividing the proportion of β7+ PBMCs by β7+ mucosal cells for each shared immune cell type. The negative log2 values were obtained, and non-parametric two-tailed Wilcoxon Rank Sum Test (p < 0.05) was applied.
Immunohistochemistry
To confirm the macrophage-like identity of inflammatory monocytes, we performed CD68 and Galectin-3 immunohistochemical stainings of UC colonic mucosa. Sequential 4µm paraffin slides from resected colonic tissue of a UC patient were deparaffinized, followed by antigen retrieval with a Citrate buffer pH6 and blocking of endogenous peroxidase with 0.3% hydrogen peroxidase (Sigma-Aldrich). Afterwards the slides were incubated with PBS + 1% BSA (Sigma-Aldrich) for 30 min, followed by incubation with a primary antibody – Galectin-3 (1:500 dilution, Santa Cruz) or CD68 (1:100 dilution, Dako) for 1 hour in the dark. Appropriate secondary and tertiary horseradish-peroxidase labeled antibodies were used for both stainings and diluted with a 1:50 ratio in staining solution (PBS + 1% BSA + 1% human serum). To complete the staining, a solution of DAB (Sigma-Aldrich) and 0.03% peroxidase was used, followed by a counterstaining of hematoxylin and mounting with Eukitt (Sigma-Aldrich). Slides were digitally scanned using a Nanozoomer (Hamamatsu).
DATA AVAILABILITY
Once the manuscript is published, the raw single-cell RNA-seq data (FASTQ files), processed spectral flow cytometry data (FCS files), anonymized Seurat objects and basic phenotypes, will be accessible upon request at the European Genome-Phenome Archive (EGA). The accession EGA code will be provided after acceptance for publication. In addition, the processed data will be made available upon request to the reviewers during the reviewing process.
CODE AVAILABILITY
Open source codes and scripts used for the analyses or figures are available at the GitHub repository (https://github.com/WeersmaLabIBD/Response_to_vedolizumab_in_UC)