Participant enrollment. Adult patients (n = 121) diagnosed with rheumatoid arthritis (RA) according to the American College of Rheumatology/European League Against Rheumatism (ACR/EULAR) 2010 classification criteria 34 were recruited from Peking University People’s Hospital, Beijing, China. Healthy volunteers (n = 99) with no history of inflammatory arthritis and rheumatic diseases were enrolled. All the participants were Chinese and the vast majority of them were Han nationality. Subjects were excluded if they 1) had underwent tonsillectomy; 2) had any current infection (especially in the oropharynx); 3) were suffering from serious diseases (e.g., cancer, heart failure, renal failure); 4) had taken antibiotic treatment or probiotic supplements in the three months prior to sample collection; 5) were strict vegetarians, alcoholics or subject with other unusual dietary modes; 6) were pregnant or lactating women. Detailed information of the cohort was given in Supplemental Tables 1 and 3.
Sample collection and DNA extraction. The tonsillar microbiota samples were collected by following the procedure of Human Microbiome Project (http://hmpdacc.org/doc/HMP_MOP_Version12_0_072910). Briefly, sterile cotton swabs, pre-moistened in sterile saline, were inserted into the oral cavity and rubbed around the mucosal surface of palatine tonsil for 5 seconds without touching the uvula, tongue, or other oral structures by using of a tongue depressor. The sampling was performed twice in the left and right tonsils, respectively. The right and left sides were pooled together as a combined specimen. To detect possible contamination, several negative controls were prepared and subjected to the same procedures. Immediately after swabbing, each swab must be swirled in a garnet bead tube containing 750 µl buffers (MoBio Laboratories, Inc., Carlsberg, CA, USA). The swab sponge should be pressed against the tube wall multiple times for 20 seconds to ensure transfer of bacteria from swab to solution. Put the tubes in a Ziploc bag and place over ice. The samples were placed at -80°C until DNA extraction.
Genomic bacterial DNA was extracted using the MoBio PowerSoil® DNA Isolation Kit 12888-100 protocol (MoBio Laboratories) according to the manufacturer’s recommendations. DNA concentrations were determined using the Qubit® 3.0 fluorescent quantitation kit (Thermo Fisher Scientific, Waltham, MA, USA). Extracted DNA was stored at -80°C prior to sequencing.
16S rRNA gene sequencing and analyses. The V3-V4 hypervariable regions of the 16S rRNA gene were amplified (RA = 121, healthy controls = 99). PCR reactions were performed by using unique fusion primers designed based on the universal primer set, 338F (5’-GTACTCCTACGGGAGGCAGCA-3’) and 806R (5’-GTGGACTACHVGGGTWTCTAAT-3’), along with barcode sequences35. The primers were designed incorporating the Illumina adapters and a sample barcode sequence. Triplicate PCR reactions were performed for each sample and visualized on 2% agarose gels. PCR amplicons were purified using Agencourt AMPure XP kit (Beckman Coulter, Inc, Brea, CA, USA) and quantified by Qubit® 3.0 fluorescent quantitation kit (Thermo Fisher Scientific), then pooled at equimolar amounts using the Illumina TruSeq Sample Preparation procedure. The amplicon library was constructed following the manufacturer’s instructions (Illumina, San Diego, CA, USA) and quantified by KAPA Library Quantification Kit KK4824 (KAPA Biosystems, Woburn, MA, USA). The completed library was sequenced on an Illumina MiSeq (Illumina, San Diego) platform using dual-index sequencing strategy according to the Illumina recommended protocol 36.
16S rRNA amplicon sequences were processed based on the quantitative insights into microbial ecology (QIIME2, https://qiime2.org/) platform 37. Briefly, the forward and reverse reads were truncated at the 260th and 250th base, respectively, to avoid sequencing errors at the end of the reads. Noisy sequences, chimeric sequences and singletons in sequencing data were eliminated using DATA2 38. Denoised paired-end reads were joined, setting maximum mismatch parameter of 6 bases, which means a minimum similarity threshold of 90% on the overlap zone of the forward and reverse reads. The representative sequences (named “feature” in QIIME2 nomenclature) were defined at 100% similar merged sequences, after then low occurrence (n < 5 in pool samples) sequences were removed. We used the term “operational taxonomic unit (OTU)” instead of “feature” in the whole article for convenience. Then, taxonomy of OTUs was identified using classify-sklearn classification methods based on the Greengenes v13.8 database 39 via the “q2-feature-classifier” plugin. We aligned the OTU sequences into the SILVA rRNA gene databases (release 138)40, and the mapped OTUs with > 98.5% similarity and > 90% coverage was considered as species level annotation. The phylogenetic analysis was performed in QIIME2 with “qiime alignment mafft”, “qiime alignment mask”, and “qiime phylogeny fasttree” commands. Lastly, all samples were rarefied to an even sampling depth of 20,000 sequences, when calculating the OTU and taxa relative abundances. To capture major changes across the whole cohort, we focused on common taxa out of all mapped ones, excluding those that were present in less than 0.01% in relative abundance.
Alpha and beta diversities were calculated in QIIME2 platform with “qiime diversity core-metrics-phylogenetic” command. The Shannon’s diversity index, observed OTUs, Faith’s phylogenetic diversity (a qualitative measure of community richness that incorporates phylogenetic relationships between the OTUs) and Pielou’s evenness were used to reflect community richness and evenness. The Jaccard distance, Bray-Curtis distance, unweighted and weighted UniFrac distances, were implemented to assess the similarity or dissimilarity between individuals.
Whole-metagenome shotgun sequencing and bioinformatic analyses. The fresh genomics DNA samples (RA = 32, HC = 30, including 13 RA and 11 HC from the former cohort) were mechanically fragmented to ~ 400 bp with Bioruptor Pico (Diagenode, Belgium). A magnetic beads-based method was used for DNA fragments selection following a standard protocol (Agencourt AMPure XP). Libraries were prepared by using the NEBnext® Ultra™ II DNA Library Prep Kit for Illumina® (New England BioLabs). The Illumina HiSeq X platform was then used for 2 × 150 bp paired-end whole-metagenome sequencing (WMS). Initial base calling of WMS data was performed based on the system default parameters under the sequencing platform. The quality control used the following criteria: (1) reads were removed if they contained more than 3 ‘N’ bases or more than 50 bases with low quality (< Q20); (2) no more than 10 bases with low quality (< Q20) or assigned as N in the tails of reads were trimmed. The remaining reads were then mapped to the reference human genome (GRCh38) using the Centrifuge algorithm41, to remove host DNA contamination.
A de novo gene catalogue was constructed based on the metagenomic data from the tonsillar samples of all individuals. High quality reads were used for de novo assembly via MEGAHIT 42, which generated the initial assembly results based on different k-mer sizes and then merged and extended them using the parameter “-precorrection”. Ab initio gene identification was performed for all assembled scaffolds taking advantage of MetaGeneMark 43. The predicted genes were then clustered at the nucleotide level by CD-HIT (version 4.5.4) 44, and genes sharing greater than 90% overlap and greater than 95% identity were treated as redundancies. A non-redundant gene catalogue was generated after removing the redundancy genes.
The abundance of genes in the non-redundant gene catalogue was quantified as the relative abundance of reads. First, the high-quality reads from each sample were aligned against the gene catalogue based on SOAP v2.21 45 using a threshold that allowed at most two mismatches in the initial 32-bp seed sequence and 90% similarity over the whole read. Then, only two types of alignments were accepted: (1) those in which the entirety of a paired-end read could be mapped onto a gene with the correct insert size; (2) those in which one end of the paired-end read could be mapped onto the end of a gene only if the other end of the read mapped outside the genic region. The relative abundance of a given gene in a sample was finally estimated by dividing the number of reads that uniquely mapped to that gene by the length of the gene region and by the total number of reads from the sample that uniquely mapped to any gene in the catalogue. The resulting set of gene relative abundances for all samples was termed a gene profile. To capture major changes across the whole cohort, we focused on common genes out of all mapped ones, excluding those that were present in less than 0.01% in relative abundance.
The Kyoto Encyclopedia of Genes and Genomes (KEGG) database was used for functional annotation of genes using Blast KOALA (v2) 46. Each protein was assigned a KEGG orthologue (KO) on the basis of the best-hit gene (BLASTP e-value < 1e-10 and protein level similarity > 30%) in the database. For KO profiling, we used KO assignment of each gene from the original gene catalog and summed the relative abundance of genes from the same KO to yield the abundance of that KO. The relative abundance of a module was calculated from summation of the relative abundance of its corresponding KOs.
S. salivarius K12 culture. S. salivarius K12 (ATCC®BAA1024™) was grown in Todd-Hewitt broth (THB, Hopebio, Qingdao, China), then serial dilutions were inoculated onto M17 agar medium (Hopebio, Qingdao, China) and incubated at 37°C in a 5% CO2 atmosphere47. The bacterial plaque that plump, smooth and distributed independently on the plate was chosen to move into a centrifugal tube with sterile probe, and was shaken to dissolve sufficiently. Then repeat the above steps, and the first generation of the strain was obtained and identified by 16S rRNA gene sequencing.
Experimental arthritis induction. Male DBA/1 mice (6–8 weeks old) were purchased from Huafukang Co. Ltd. (Beijing, China) and fed under specific pathogen-free conditions. Experimental arthritis induction was established in the same way as collagen-induced arthritis (CIA) by following a previously published protocol48. Briefly, DBA/1 mice were immunized intradermally at the base of the tail with 200 µg of bovine type II collagen (CII, Chondrex, Redmond, WA, USA) or different microbial peptides (A-R-EVLDS-R-GNPTVE, IYA-R-EVLDS-R-GNPTIE, and LGSMGESRRALQDSQR) emulsified in complete Freund’s adjuvant (Sigma-Aldrich, St Louis, MO, USA) in equal volumes. Three weeks later, a booster was delivered using 100 µg CII r different microbial peptides emulsified in Freund’s incomplete adjuvant (Sigma-Aldrich, St Louis, MO, USA). Mice were monitored for signs of arthritis after the booster immunization. Clinical score was assessed by using the following system as detailed previously48: 0, normal; 1, erythema and swelling of one or several digits; 2, erythema and moderate swelling extending from the ankle to the mid-foot (tarsals); 3, erythema and severe swelling extending from the ankle to the metatarsal joints; and 4, complete erythema and swelling encompassing the ankle, foot and digits, resulting in deformity and/or ankyloses. The scores of all four limbs were summed, yielding total scores of 0–16 per mouse.
Experimental arthritis intervention. We randomized mice into treatment groups. S. salivarius K12 was inoculated intra-nasally (108 cfu/mice), intra-orally (108 cfu/mice), combined intra-nasally with intra-orally (108 cfu/mice) or intragastrically (109 cfu/mice) three times a week. Lactobacillus salivarius (ATCC 11741) was administrated both intra-nasally and intra-orally (108 cfu/mice). Mice treated with sterile phosphate buffer solution (PBS) served as controls. The preventive group started on day 1, and the therapeutic groups commenced after the onset of CIA (approximately day 26).
Radiography evaluations and histological analyses. The paws from each mouse were collected and fixed in 4% paraformaldehyde (PFA) for 48 hours, then scanned using a Micro-CT scanner (Quantum FX, Caliper, USA). After that, the paws were decalcified in 5% EDTA, paraffin-embedded, sectioned and stained with haematoxylin and eosin (H&E). A microscopic assessment of sagittal sections was performed and histopathological changes were scored based on the following previously reported parameters 49: 0, normal synovium; 1, synovial membrane hypertrophy and cell infiltrates; 2, pannus and cartilage erosion; 3, major erosion of cartilage and subchondral bone; and 4, loss of joint integrity and ankylosis. The scores of all four limbs were summed and divided by 4, yielding average scores of 0–4 per mouse.
Cytokine and auto-antibody detection. At the study endpoints, mice were euthanized and serum samples were collected for cytokine and auto-antibody detection. The concentrations of cytokines (IL-6 and IL-10) were measured using ELISA kits (Multisciences), and the titer of collagen-specific antibody was analyzed using a mouse anti-bovine type II collagen IgG antibody assay kit (Chondrex). Serum levels of Streptococcus enolase peptides were tested by indirect ELISA. In detail, chemosynthetic peptides (10 µg/ml in 0.05 M carbonate buffer) were coated in the 96 well plates (Corning, New York, USA) at 4˚C overnight. Then wash the 96 wells with PBS plus 0.05% Tween-20 (PBST) three times and block them with 5% BSA-PBST at 37˚C for 3 hours. All serum samples were diluted at 1:100 with 1% BSA-PBST, and 100 ul diluted samples were added to the 96-well plate and incubated 1 hour at 37˚C. 1% BSA-PBST was used as control nonspecific background. After incubation, the wells were washed four times with 0.05% PBST. Then, 100 ul of goat anti-mouse IgG conjugated to peroxidase (1:10000 in 1% BSA-PBST) was added to the wells and incubated at 37˚C for 40 minutes. The bound antibodies were tested by tetramethylbenzidine (TMB) as substrate after washing four times. This reaction was terminated by adding 50 ul of 2 M sulfuric acid to the wells. The absorbance density (OD) was read by a Bio-Rad plate reader at 450 nm and 630 nm. The OD values were converted into arbitrary units (AU) values, which was calculated as follows: AU = [(OD peptide – OD nonspecific background) test serum/(OD peptide - OD nonspecific background) positive control serum]×100. The AU value was considered positive if it was greater than the mean + 3 × standard deviation of the control group.
Flow cytometry. The spleens and joint draining lymph nodes (popliteal and axillary lymph nodes, DLN) were obtained from mice, sieved through a 70 µm cell strainer (Corning) in RPMI 1640 medium with 10% FBS and single-cell suspensions (106 cells/100µl) were prepared for flow cytometry. For intracellular cytokine analysis, cells were stimulated with 25 ng/ml PMA plus 1 µg/ml ionomycin for 4–5 hours in the presence of 10 µg /ml brefeldin A (PeproTech). Cell surface markers were first stained, and the cells were then fixed and permeabilized with an intracellular staining buffer set (Thermo Fisher Scientific) following the manufacturer's protocol and stained with intracellular or intranuclear markers. Antibodies were purchased from BD Biosciences (San Jose, CA, USA), eBiosciences (Thermo Fisher Scientific) or Biolegend (San Jose, CA, USA). Th1 cells were defined as CD4+IFN-γ+, Th17 cells as CD4+IL-17A+, Treg cells as CD4+CD25+FOXP3+, Tfh cells as CD4+Foxp3-CD44+CXCR5+PD-1+Bcl6+, GCB cells as B220+CD4−Fas95+GL-7+. Flow cytometry was performed using FACSAriaTMII (BD Biosciences) and the data was analyzed using FlowJo v10.0.7 software (Tree Star Inc., Ashland, OR, USA).
Determination of S. salivarius K12 abundance. Genomic DNA was extracted from the oropharyngeal mucosa using a Protease K Fasting Burst Technique. The DNA concentrations were evaluated spectrophotometrically. The quantity of S. salivarius K12 in the samples was estimated by Quantitative real-time PCR (qPCR) with SRBR Premix Ex TaqTM (2×) (Takara, Japan) and FTC-3000TM Real-Time Quantitative Thermal Cycler (Funglyn, Shanghai, China). All qPCR reactions were run with 3 replicates per DNA. Standard curves were set up by serially diluting plasmid of a pMD18-T vector (Takara, Japan) with the appropriate insert from 107 to 102 target K12 gene copies/µl. The standard curve was obtained using linear regression of threshold cycle numbers (CT) versus log copy numbers of targets. S. salivarius K12-specific primers were as follows: forward, 5’-AAGGGAGAATGATTGCCATGAA-3’; reverse, 5’-GAGTTTGGACAGTCATCAGTAATAGTTG-3’. As TaqMan® probe, K12Taq was designed as follows: 6-FAM-5’-AGAGGTACAGGTTGGTTTG-3’-MGB 50. Q-PCR reactions were performed in a 25 µL reaction volume containing 12.5 µL SRBR Premix Ex Taq™ (2×) (Takara), 1 µl (10µM) each of the forward and reverse primer, 5 µL DNA, 2 µL probe and 3.5 µl of RNAse/DNAse free PCR water. PCR reaction conditions included an initial denaturation at 94 ℃ for 10 min, followed by 45 cycles of 94 ℃ for 1min and 60 ℃ for 105 s. Melting Curve analyses were performed from 60 ℃ to 96 ℃ with increments of 0.1℃ per cycle. The gene copy number in each sample was determined by comparing to Ct values/gene copy number of the standard curve.
Statistical analyses. The statistical analyses for microbiome were performed at the R v3.3 platform (https://www.r-project.org/). Permutational multivariate analysis of variance (Adonis analysis) was performed with the R vegan package, and the Adonis P-value was generated based on 999 permutations. Distance-based redundancy analysis (dbRDA) was performed on these taxonomic composition profiles with the vegan package, based on the Bray-Curtis dissimilarly, and visualized via the R ade4 package. The RA-associated species and modules were identified based on the Wilcoxon rank-sum test. The q value was used to evaluate the false discovery rate for correction of multiple comparisons, and was calculated based on the R fdrtool package. P value < 0.05 or q value < 0.1 was considered statistically significant. Correlation analysis was carried out by using the Spearman's rank correlation statistical measurement system 51. One RA patient was excluded from the clinical association analyses due to the missing information.
In the experiment part, the differences of arthritis scores and incidence between groups were determined by two-way ANOVA followed by Tukey's multiple comparisons test and Kaplan-Meier analysis with log-rank test, respectively. The difference of cytokines, antibodies, lymphocytes or DNA abundance between two groups was analyzed by Mann-Whitney U nonparametric test. The correlation between variables was evaluated using the Spearman’s rank correlation test. The statistical analyses were performed by the GraphPad prism v8.00 software (GraphPad, San Diego, CA, USA). A two-sided P-value of < 0.05 was considered statistically significant.