Human participants recruitment and sampling ICU Cohort
PBMC and plasma samples were collected longitudinally from cohorts of moderate COVID-19 (non-Intensive care) and severe (Intensive care) COVID-19 and severe (Intensive care) Sepsis patients, and healthy age-matched healthy controls (Table 1). Hospitalized cases diagnosed as COVID-19 were enrolled by physicians using clinical manifestation and PCR test results. Sepsis patients were enrolled by physicians using clinical manifestation based on sepsis-3 criteria. Age-matched control subjects were collected at Osaka University Hospital and Osaka General Medical Center. Patients with COVID-19 were further categorised by the WHO ten-point clinical progression scale, (Table 1) [73].
Additionally, PBMC and plasma samples were collected from a cohort of healthy donors during a vaccination time-course with the Pfizer BioNTech BNT162b2 SARS-CoV-2 vaccine (BNT162b2) (Table 1). Samples were collected from August 2020 to May 2021 at Osaka University Hospital and Osaka General Medical Center. This study followed the principles of the Declaration of Helsinki and was approved by the institutional review board of Osaka University Hospital (permit nos. 907 and 885 [Osaka University Critical Care Consortium Novel Omix Project; Occonomix Project]). Informed consent was obtained from the patients or their relatives and the healthy volunteers for the collection of all blood samples.
Peripheral blood sample processing
Peripheral blood mononuclear cells (PBMCs) were isolated from fresh blood samples using Ficoll-Paque density gradient centrifugation, resuspended at 1-2E6 cells/ml Cellbanker 1 (Takara Bio) before being subject to controlled freezing at -80°C in a CoolCell device (Corning) and later stored in gas-phase liquid nitrogen.
Mass cytometry antibody production.
We obtained Indium-113 and -115 and gadolinium-157 (Trace Sciences) for conjugation to antibodies. X8 polymer MaxPar kits (Standard Biotools) were used to conjugate Indium and lanthanide isotopes, while MCP9 polymer kits were used to conjugate Cadmium isotopes according to the manufacturer’s instructions. Platinum-labeled antibodies were conjugated with cisplatin as previously described [74]. Conjugated antibodies were stored in a PBS-based antibody stabilizer or HRP-protector stabilizer for cadmium labeling (Candor Biosciences). All antibodies were titrated using control PBMCs to obtain optimal staining concentrations. Antibody staining panels were either prepared fresh, or prepared in bulk and stored as aliquots at -80°C.
CD45 barcoding and cell staining for mass cytometry.
Samples were measured across 21 experimental runs. The 21st run used an updated panel for B cells that included tetramers for SARS-CoV-2 spike protein.
Frozen PBMC samples were defrosted in a 37°C water bath for 2 minutes and poured into a 15-ml tube and 5 ml of prewarmed RPMI containing 10% FCS and 20 IU/ml Pierce Universal Nuclease for Cell Lysis was added. Samples were then washed with the same buffer without resuspending, then resuspended in 2 mL of CyFACS buffer, and live cells counted by Acridine Orange/Propidium Iodide staining on a LUNA-FL fluorescence cell counter (Logos Biosystems). In each experiment, up to 4E6 viable cells per sample were labelled with a seven choose-three pattern of anti-CD45 barcodes (89Y, 113In, 115In, 194Pt, 195Pt, 196Pt, and 198Pt) to give a combination of up to 35 barcoded samples per experiment, including a spike in control used to monitor batch effect (Figure S1B) [75]. Barcoding antibody aliquots in CyFACS were prepared in bulk and stored at -80°C. Samples were incubated with CD45 barcodes together with FC-block and anti-CXCR5 biotin (in run 21 a direct anti-CXCR5 antibody was used to avoid interference with streptavidin-biotin B cell tetramer conjugates) in CyFACS buffer (PBS with 0.1% BSA and 2 mM EDTA) for 30 min at room temperature (RT), and then washed once by adding 5 mL CyFACS buffer, centrifugation at 400g and the supernatant was poured off. The barcoded cells were resuspended in 700 μl CyFACS buffer, pooled and filtered through a 70 μm filter (Miltenyi) to remove dead cell debris. At this stage 5-10% cells were transferred to a 15 mL tube (Lineage stain tube, total PBMCs used for lineage proportions across CD45+ PBMCs). The remainder of the cells were separated into a CD3+ fraction (T stain tube) and CD3-depleted fraction (non-T stain tube) using a magnetic bead-based CD3 positive selection kit (StemCell). Cells in each tube were stained with separate surface and intracellular panels for T cells (CD4, Tfh, Treg, CD8 and γδT), non-T cells (Monocytes, DCs, B and NK cells), and total lineage proportions (panel information Table S2). Premixed and frozen master mixes were used to reduce batch effects. For the 21st experiment, only the non-T stain tube was stained using panel for B cells.
Cells were stained with the respective surface stain antibody cocktail for 45 min at RT. The cells were then washed twice in CyFACS buffer (400g, 5 min, RT), stained using 500 nM Zirconium based viability stain [76] in PBS for 5 min at RT, washed, and then fixed and permeabilized using the Foxp3 Transcription Factor Staining Buffer Set according to the manufacturer’s protocol (eBioscience). The cells were subsequently stained with the respective intracellular antibody cocktail for 45 min at 4°C and then washed twice (centrifugation at 800g, 5 min, 4°C) in CyFACS buffer and once in PBS. The cells were then fixed overnight in 2% formaldehyde (Thermo) in PBS containing 1 μM DNA Cell-ID Intercalator-103Rh (Standard Biotools).
Preparation of B cell tetramers for mass cytometry staining
B cell tetramers were prepared in a similar way to previous reports [77, 78]. Briefly, biotin-labelled SARS-CoV-2 wild-type full-length Spike (Miltenyi), wild-type RBD domain or B.1.1.529/OmicronRBD (Acro Biosystems) that had been reconstituted at 100-200 μg/mL in PBS10 were mixed with Streptavidin (SA)-FITC, SA-PE or SA-APC (Biolegend) respectively in three steps separated by 15 minutes to give a final 4:1 molar ratio of protein:SA and total incubation time of RT for 45 min. Tetramer staining mix was prepared by adding each tetramer to CyFACS with 2.5 μM Biotin to saturate unbound streptavidin and minimise probe cross-reactivity. Following barcoding and mixing of cells, they were stained with the three tetramers (wtSpike-FITC, wtRBD-PE and B.1.1.529/OmicronRBD-APC) at RT for 45 min in a total volume of 550 μl. Then cells were washed three times in CyFACS buffer (400g, 5 min, RT) following which the staining protocol was identical to that for experiments 1-20 but using the B-cell staining panel (Table S2).
Mass cytometry data acquisition
Cells were washed (800g, 5 min, 4°C) once in 5 mL CyFACS buffer and twice in 2 mL Cell Acquisition Buffer (CAS, Standard Biotools). The cells were diluted to 1.1E6 cells/mL in CAS with 15% EQ Four Element Calibration Beads (Standard Biotools) and passed through a 35-μm filter immediately before collection. Cells were run at 200 to 500 cells/s on a Helios mass cytometer (Standard Biotools), or CyTOF XT. Flow Cytometry Standard (FCS) files were normalized to EQ beads and concatenated using Standard Biotools software before export.
B cell in vitro plasmablast differentiation assay
The B cell in vitro plasmablast differentiation assay was performed using frozen PBMC from healthy Leukopak donors (StemCell). For each assay, ~1E8 cells in liquid nitrogen were thawed as above except 50 mL tubes were used for the initial thawing and washing step. 10% of PBMC was set aside for sorting of non-B cells while 1% of PBMC was set aside on ice for the whole PBMC control. The remainder was enriched for B cells using the EasySep™ Human B Cell Isolation Kit (StemCell). Enriched B cells were stained with a panel of fluorescently labeled antibodies (Table S1) and total PBMC were stained with anti-CD19 and anti-IgD to allow sorting of non-B cells (CD19-IgD-). Cells were stained for 30 min at RT and washed twice in 5 mL PBS + 10% FCS (PBS10), then resuspended in 1-2 mL of PBS10 and sorted at 4°C on a BD FACSymphony S6 cell sorter (BD). B cells were sorted into six different populations (Figure S7D) after which non-B cells were sorted. Sorted cells were transferred from 5 mL FACS tubes into 15 mL tubes and spun down. Up to 18 conditions were included in the B cell assay. The base media for the B cell assay (RPMI + 10% FBS + Pen/strep (cRPMI) with 1 μg/mL anti-CD3 antibody (Biolegend), 10 U/mL IL2 (Immunace35, Shinogi) and 10 ng/mL IL-1β (Peprotech). For further stimulation of B cells, some conditions were supplemented with a cocktail of 10 ng/mL IL-10, 10 ng/mL IL-4 and 50 ng/mL IL-21 (Peprotech) added on day 4 of the assay. To prepare the assay, in 96-well U bottom plates (Corning #163320), 100 μl of media (containing 2x concentration of stimulants) was added to wells followed by 100 μl of cells in cRPMI. For the whole PBMC condition, 1E5 of the whole PBMC (100 μl at 1E6 cells/mL) was added to respective wells. For B cell assay conditions, sorted non-B cells were added to each well (9E4 cells in 50 μl) followed by sorted B cells (1E4 cells in 50 μl) yielding an approximately physiological ratio of 10% B cells to 90% non-B cells in 200 μl per well. Plates were incubated at 37°C 5% CO2 for six days. On the fourth day, 50 μl of media was aspirated from each well and replaced with 50 μl cRPMI or cRPMI with 4x cocktail for the day 4 cocktail condition.
On day 6 samples were stained and measured by CyTOF with the following modifications. Cells were pipetted out of wells into 15 mL tubes, spun down and first labeled with a six choose-three pattern of anti-CD45 barcodes (89Y, 113In, 115In, 194Pt, 195Pt, and 196Pt) to give a combination of up to 18 barcoded samples per experiment with 198Pt to be used for dead cell staining. To allow incorporation of puromycin, IdU and BrU into cellular macromolecules [55], the staining protocol included a 30 min pre-stain at 37°C in cRPMI at which time cells were also stained for chemokine receptors (CCR7, CXCR3, CXCR5) (Table S2). Cells were then stained with surface and intracellular panels. B cell assay samples were run on a CyTOF XT using suspension mode in CAS+ buffer with 10% 6-element EQ beads (Standard Biotools). Flow Cytometry Standard (FCS) files were normalized to EQ beads and concatenated using Standard Biotools software before export.
Mass cytometry data analysis
FCS files were uploaded to Cytobank software (Beckman Coulter) and gated as positive for DNA, negative for EQ beads, below a threshold on the Zr live/dead stain (or 198Pt negative in the B-cell assay), with normal ion cloud Gaussian parameters (Figure S1A), before being manually debarcoded based on the CD45 barcodes (Figure S1A, B) and exported as FCS files.
Firstly, FCS files were imported into R (v.4.3.0) via flowCore (v.2.12.2) and batch corrected (SOM size 10 by 10, arcsinh 5 transformed) with cyCombine (v.0.2.15)[79]. Spike in samples in every experiment were used to confirm the accuracy of batch correction. In the T stain tube, CCR2, which was not in the panel for experiments 1 and 2, was imputed onto these two batches using the salvage_problematic() function of cyCombine. In the Lineage stain tube, CD33, CD95, CXCR4, CD27, IgD, CD56, Helios, Granzyme B, TCRγδ, CD38, HLA-DR were imputed from experiments 2-20 onto experiment 1 using the CyTOF 2 panels vignette of cyCombine.
Following batch correction samples were exported as FCS files and then re-imported to R and converted to single cell experiment format (SCE) and analysed with CATALYST (v.1.24.0) [37, 38] as previously described [34]. Briefly, data was then compensated using a compensation matrix derived from a compensation panel of single-metal-labeled compensation beads. Data was clustered using flowSOM (using a 10 by 10 SOM and subsequent meta-clustering) using a multi-step process. The first step employed a limited set of markers optimised to separate into the main immune lineages. Secondly, each lineage population was sub-clustered to obtain fine cell sub-types. Detailed cell type annotation was based on prior knowledge combined with analysis of heatmaps and UMAPs (downsampling to 1000 cells per sample and 40 nearest neighbours). In the case of CD4 cells, CXCR5+Foxp3- Tfh were also handled as a subcluster to allow their in-depth analysis.
Non-T stain tube data for experiments 1-20 was clustered to obtain B cells, NK cells and Myeloid & DCs. For B cells, the panel for experiments 1-20, which was designed for clustering all CD3- cells, was different from that of experiment 21, which was focussed specifically on B cells and tetramers. However, we sub-clustered the B cell data from experiments 1-21 together using only shared markers after batch correction with cyCombine using the CyTOF 2-panels vignette. For simplicity we merged the CD73+ or CD73- Naive and CD73+ or CD73- Classical B cell types previously identified by Glass et al [13]. To calculate isotype proportions, the B cell SCE was reclustered using only IgG, IgA, IgD and IgM and isotype proportion per phenotypic cluster was calculated.
Differential abundance testing among clusters was performed with diffcyt (v.1.20.0) in DA mode using edgeR (v.3.42.4) and default settings. Prior to plotting, clusters with 0% abundance were set to 0.01% for display on Log scaled plots. Expression levels of markers were tested for statistical significance using Kruskal-Wallis followed by Dunn’s with Holm adjustment with rstatix (v. 0.7.2).
ForceAtlas2 Visualization
For visualisation of B-cells by forceatlas2, 5000 cells per cluster were used to generate a KNN edge matrix in vortex software [80], the resulting exported XML file containing edge and node lists was imported into Gephi (v.0.10.1) network analysis software for layout by the Forcealtas2 algorithm using, Tolerance = 1, Approximate repulsion = 1.2, scaling =1 , gravity =4 and dissuade hubs ON [39].
CITE-seq
CITE-seq was performed on 12 samples: three COVID-19 day 1 samples, three Sepsis day 1 samples, and samples from three vaccine cohort donors after their second and third vaccine dose. PBMC for each sample were thawed as above and cells were barcoded with TotalSeq-C Hashtag antibodies. Cell pellets were resuspended in a total of 100 μl barcode mix (0.5 μl hashtag antibody in PBS10) and stained for 30 minutes at 4°C. Cells were then washed twice in 5 mL PBS10 (400g, 5 min, 4°C), resuspended in 1 mL PBS10 and then mixed into a single tube and centrifuged (400g, 10 min, 4°C).
The cell pellet was resuspended in a total of 200 μl staining mastermix containing fluorescently labelled sorting antibodies, TotalSeqC antibodies and two freshly prepared SARS-CoV-2 wild-type full-length Spike tetramers (Table S2) and stained for 30 minutes at 4°C. Cells were then washed twice in 5 mL PBS10 (400g, 5 min, 4°C) and sorted on a FACSaria III (BD) into 1.5 mL tubes. IgD- B cells (CD19+IgD-CD4-CD14-) (post sort viability >95%) were resuspended at 700 cells per μl and submitted to the Osaka University Biken NGS Core for library preparation and sequencing.
Single-cell RNA library construction and sequencing
Single-cell suspensions were processed with a 10x Genomics Chromium Controller using the Chromium Next GEM Single Cell 5' Kit v2, Chromium Next GEM Chip K Single Cell Kit and Dual Index Kit TT Set A according to the manufacturer’s instructions. Library preparation and sequencing was done by loading approximately 16,500 live cells per well on the Chromium controller to generate 10,000 single-cell gel-bead emulsions (5 wells total). Oil-encapsulated single cells and barcoded beads (GEMs) were reverse-transcribed on a Veriti Thermal Cycler (Thermo). The resulting cDNA tagged with a cell barcode and unique molecular index (UMI) was amplified to generate single-cell libraries, which were quantified with an Agilent Bioanalyzer High Sensitivity DNA assay (Agilent). The amplified cDNA was enzymatically fragmented, end-repaired, and polyA tagged followed by cleanup and size selection using SPRIselect magnetic beads (Beckman-Coulter). Next, Illumina sequencing adapters were ligated to the size-selected fragments followed by another round of cleanup. Finally, sample indices were selected and amplified, followed by a double-sided size selection using SPRIselect magnetic beads.
For VDJ repertoire profiling, full-length VDJ regions were enriched from amplified cDNA by PCR amplification with primers specific to the Ig constant regions using the Chromium Single Cell Human BCR Amplification Kit and enriched VDJ segments were used for VDJ-library construction. After final library quality assessment using an Agilent Bioanalyzer High Sensitivity DNA assay, samples were sequenced on an Illumina NovaSeq 6000 as paired-end mode (read1: 28bp; read2: 91bp).
Preprocessing of single-cell RNA sequencing data
Data was pre-processed using the CellRanger multi pipeline (version 7.0.0). Analysis was done in R (v.4.3.0) using the Seurat package (v.4.3.0). Since samples were split across multiple GEM wells, a well ID was first added to the barcodes followed by sample assignment based on Hashtags using the HTODemux() function. Data was then sent through the following QC pipeline. The data was filtered for hashtag singlets and the following QC metrics (500 > sample > 8000 mRNA features per cell, < 10% mitochondrial reads, > 5% ribosomal reads and < 3000 ADT counts per cell). Ribosomal and mitochondrial genes were then removed along with the MALAT1 gene. Data was then normalised according to recommended settings with RNA (method = LogNormalize, scale.factor = 10000) and ADT (method = CLR, margin = 2). Cell cycle was then assigned using CellCycleScoring and data was scaled regressing out the %mitochondrial and cell cycle difference (S score minus G2M score) from the RNA data. This resulted in a Seurat object with 28491 QC-passed cells (10341 for COVID-19, 2296 for Sepsis and 15854 for Vaccine samples).
Analysis of single-cell RNA sequencing data
FindVariableFeatures() was used to identify the list of top 3000 variable genes. IGHV, IGKV and IGLV immunoglobulin genes were removed from this list. Seurat (v.4.3.0) multimodal clustering of the GEX (mRNA) and ADT data was performed. Principal component analysis (PCA) was run on the GEX data using the variable features and another PCA was run on all ADT features. FindMultiModalNeigbors() was then run using PCs 1-6 of GEX and PCs 1-4 of ADT at k=50 nearest neighbours. The wnnUMAP was made and data was clustered using FindClusters with SLM algorithm and resolution = 2. The 24 resulting clusters were then manually merged and annotated to give clusters shown (Figure 4B). SARS-CoV-2 wtSpike tetramer positive cells (total: 1042) were annotated as ≥ 0.2 in both the PE and APC ADT channels (Figure 5B). Differential expression between populations was performed with FindMarkers() at default settings.
Anchor transfer
Anchor transfer of the datasets [52] was done with Seurat (v.4.3.0) in R (v.4.3.0) using a supervised PCA made from the wnn graph, and k.filter set to NA. IgD+ cells were removed from the query data, as the reference was sorted for IgD- cells (RNA cutoff: 0.1, protein cutoff: 0.5).
RNA velocity analysis
Count matrices of unspliced and spliced RNA was generated using velocyto (v.0.17.17) alignment to hg38 in zsh (v.5.9) and merged with loompy (v.3.0.6) in python (3.9.16). RNA velocity was calculated using the dynamic model in scVelo (v. 0.2.5) with variable features pre-defined in the seurat analysis, 50 principal components, 50 neighbors, and 10 shared counts. scVelo was run in Jupyter notebook (v.7.0.0) using the following dependencies: anndata (v.0.9.1), scanpy (v.1.9.3), numpy (v.1.21.1), scipy (v.1.10.1), pandas (v.1.5.3), scikit-learn (v.1.2.2), matplotlib (v.3.7.1), python-igraph (v.0.10.6), louvain (v.0.8.0), pybind11 (v.2.11.1), hnswlib (v.0.6.2), ipython (v.8.14.0), tqdm (v.4.65.0), and ipywidgets (v.8.0.7). For use in python, the seurat object was converted to h5ad using SeuratDisk (v.0.0.0.9020) in R (v.4.3.0).
Analysis of BCR data
The Immcantation [81] and scRepertoire (v. 1.11.0) [82] workflows were used for analysis of BCR data. In scRepertoire the ‘strict’ definition of clonotype was used for all analyses. The Immcantation workflow starts by running igblast (version 1.21.0) using Python scripts provided by Immcantation to give the “filtered_contig_igblast_db-pass.tsv files”. Given that the data spanned 5 GEM wells, a well ID was appended to the cell_id and sequence_id columns, after which the data was concatenated, therefore allowing cells found in both the GEX and BCR datasets to be matched by their cell barcode.
Briefly, data was filtered for productive BCRs, and cells with none or multiple heavy chains were removed (yielding 21300 cells with a single heavy chain). Then clonotyping was done with Scoper using an automatically-determined threshold. For calculating clonal overlap between Seurat clusters, clonotyping results were filtered for IgH and then metadata columns from the seurat object were attached giving 15103 cells found in both datasets (6692 for COVID-19, 1538 for Sepsis and 6873 for Vaccine), of which 460 were spike positive.
Somatic Hypermutation (SHM) on heavy chain data was then calculated using observedMutations(..., regionDefinition = IMGT_V) (Shazam) (IMGT database updated May 2023), yielding 20687 cells mapped to germline of which 14659 were also found in the GEX data (453 spike positive). Dowser (v.2.0.0) was then used to format and plot the germline-mapped cells as phylogenetic clone trees (build = “pml”) with metadata attached from the Seurat object.
Clonal overlap between Seurat clusters was calculated as the Jaccard index, which is the ratio of the number of intersecting clonotypes to the union count of clonotypes between two clusters. The raw count of intersecting clonotypes between pairs of clusters is also written on the heatmaps. IgH gene usage heatmaps by Seurat cluster were calculated using vizGenes() function in the scRepertoire pipeline.
Analysis of plasma antibody levels by Luminex and ELISA
During preparation of PBMC, plasma samples were also collected and stored at -80°C. Serum titers of SARS-CoV-2 antibodies were assessed using the Bio-Plex Pro Human IgG SARS-CoV-2 N/RBD/S1/S2 4-Plex Panel kit (BioRad) according to the manufacturer's directions on a BioPlex-200 machine (BioRad). Freshly thawed plasma samples were centrifuged (1000g, 10 min, 4°), diluted 1:1000 (4 μl into 196 μl then 5 μl in 95 μl), and run in duplicate. Background subtracted MFI duplicates were averaged and spearman correlations with cellular proportions from the corresponding PBMC samples were calculated with Corrplot v.0.92.
An Anti-SARS-Cov-2 SPIKE IgE ELISA kit (Matriks Biotech) was run according to the manufacturer’s directions using 1:100 and 1:500 dilution of each plasma sample. The same samples were run as for measuring SARS-CoV-2 IgG antibodies.