We found 19 original research papers matching our criteria; we decided to group the studies into five categories according to the final aim and the main findings as follow: (1) Gut microbiome of children at-risk of CeD before the onset of the disease (2) Gut microbiome of CeD patients in comparison with healthy controls, and other groups of interest, (3) Influence of GFD on the microbiome of healthy people (gut) and CeD and Non-Celiac Gluten Sensitivity patients (saliva and gut) (4) Upper gastrointestinal tract microbiome in CeD and (5) Treatments: dietary interventions, prebiotics and hookworm infections. Table S2 summarise the main findings of the select papers.
3.1.1 Gut microbiome of children at risk of CeD before the onset of the disease.
We found four research in which a longitudinal study of the microbiome profile in stool samples of children at-risk of CeD before the disease's onset was performed. Some of these studies focused on identifying microbiome composition at an early age in at-risk infants that could be predictive of CeD development [33, 51, 52], the effect of the time of first gluten exposure [33] and other environmental factors [53]; such as delivery mode, antibiotic exposure and infant feeding type on microbial gut composition and/or CeD development.
Sellitto et al. (2012) were the pioneers in using the Pyrosequencing of the 16S rRNA gene to study infant faecal microbiome before CeD onset. Their research aimed to know if gut microbiota trajectory in early life may predict the development of celiac disease and if first-time gluten exposure could affect the time of CeD onset. The authors found that the microbiota composition was highly different among at-risk infants before 18 months of age but converged at 24 months. In all children, the microbiota was characterised by a low abundance of members of the phylum Bacteroidetes. Infants were breastfed until six months of age. From 6 to 12 months, eight children followed a GFD and the other eight children followed a gluten-containing diet. From 12 to 24 months, all the children followed a gluten-unrestricted diet. At 12 months of age, more children in the group of early exposure to gluten presented antigliadin antibodies. Only one out of all children belonging to the group of early exposure to gluten developed CeD. The children who developed CeD showed a reduction in bacterial richness from 6 to 10 months. However, no definitive answer can be given to the original questions because only one child is insufficient to make statistical analysis and achieve valid conclusions [33].
Six years later than Sellitto et al. (2012) study was published, Rintala et al. 2018 assessed the same question. May gut microbiota trajectory in early life be a predictor of celiac disease development? But, increasing sample size and follow-up time. They studied changes in microbial communities of 27 infants genetically at risk of CeD from 9 to 12 months of age. Unlike Sellitto et al., they did not test the effect of first-time gluten exposure. The introduction of gluten was initiated from 4.4 to 5.7 months of age in all cases, similar to the early exposure time in Sellito et al. The follow-up time of this study was more extensive than in Sellito et al. Nine children (all girls) out of the 27 developed CeD at the median age of 3.5 years (range 2.6–4.2 years). These observations would not have been made in the followed time studied by Sellito et al. (0–24 months). Rintala et al. study showed a significant increase in children's microbiota alpha diversity in general between 9 and 12 months of age. Still, no differences among healthy controls and the group who develop CeD were detected. The study concludes that the individuals who develop CeD, do not already in the early infancy, have a distinct faecal microbiota composition compared to other infants with risk-HLA-haplotype, suggesting that the onset of CeD is more likely a consequence of a solid external trigger rather than gradual development due to peculiarly vulnerable gut microbiota [52].
Another study by Olivares et al. (2018) aimed to answer the same question about the predictive power of microbial communities' changes at an early age of infants genetically at risk of developing CeD. With a similar sample size and time of gluten exposure, samples were taken earlier in the baby's life. They studied the microbiota of 20 children from 4 to 6 months of age. The introduction of gluten was initiated from 6 months of age onwards in all cases. Nine out of the twenty children developed CeD among 16 and 40 months of age and one at 82 months. Unlike Rintala et al. they found changes among microbiota of cases vs controls. The study showed a significant increase in microbiota alpha diversity of healthy control children (characterised by increases in Firmicutes families) between the ages of 4 and 6 months, which did not occur in the child group who developed CeD. An increased relative abundance of Bifidobacterium longum was associated with control children, while increased proportions of Bifidobacterium breve and Enterococcus spp. were associated with CeD development. Authors conclude that the diversity of microbiome increases among 4 and 6 months of age in children who remained healthy but not in those who develop CeD [51].
Finally, Leonard et al. (2020) attempted to characterise the changes in microbial communities of infants genetically at risk of CeD. However, unlike the studies mentioned above, they did not perform a follow-up, and they did not evaluate the predictive power of the microbiota composition because their question was different: Do environmental factors like delivery mode, antibiotic exposure and infant feeding type impact on the microbial composition of gut in at-risk infants?. Authors found that genetic risk to develop CeD was associated with decreased abundance of Streptococcus and Coprococcus and decreased abundance of Veillonella, Parabacteroides and Clostridium perfringens at 4–6 months of age. Also, with increased abundance of Bacteroides and Enterococcus species at 0 months. Cesarean section delivery was associated with a decreased abundance of Bacteroides vulgatus and Bacteroides dorei and folate biosynthesis pathway. With an increased abundance of hydroxyphenyl acetic acid, alterations implicated in immune system dysfunction and inflammatory conditions. They found differences in the microbiota regarding exposure to breastmilk and formula and between antibiotic exposure (as an environmental risk factor). Despite the study provides insights into taxonomic and functional shifts in the developing gut microbiota of infants at risk of CeD linking genetic and environmental risk factors to detrimental immunomodulatory and inflammatory effects, it is unclear whether they indeed contribute to the future development of CeD [53].
In summary, these studies evidence changes in the microbiome of infants with CeD, particularly modifications in diversity, demonstrating the importance of the inclusion of gluten in diet.
3.1.2 Gut microbiome of CeD patients in comparison with healthy controls and other groups of interest.
We reviewed five papers in which a case-control study of the microbiome profile in stool and/or duodenal samples of CeD patients was carried out.
In 2013, Cheng et al. performed a study analysing the duodenal microbiota from newly diagnosed children with CeD. They did not find significant differences in the abundance of taxa between groups, neither at phylum nor at the genus level. However, using Random Forest (RF) algorithm with a subpopulation of eight genus-like taxa, they were able to separate samples between CeD and healthy controls. Taxa included Prevotella melaninoenica, Haemophilus sp. and Serratia sp. augmented in CeD, whereas Prevotella oralis, Proteus sp., Clostridium stercorarium, Ruminococcus bromii, and Papillibacter cinnamivorans were augmented in controls. This classification system had an error rate of 31.6% [54] This study showed that bacteria from Proteobacteria were elevated in CeD and butyrate producers Firmicutes were diminished despite the sequencing technology used.
A different approach was conducted by Pellegrini et al. (2017). They analysed the duodenal microbiota of CeD and patients with Diabetes Type 1 (T1D), trying to find differences in the inflammatory state compared with healthy controls. The authors did not found differences in the diversity measures across the studied groups. However, they found differences in the inflammatory status of the patients studied according to the type of pathology and some relevant changes in the microbiota. In this regard, authors showed that CeD patients share some similitudes in the microbial profile with T1D i.e., reduction in Bacteroidetes phylum and the class Clostridia, but the microbial profile differed according to the abundance of Proteobacteria, being increased in CeD, particularly Gammaproteobacteria from the family Pasteurallaceae, and the genus Haemophilus spp. [55].
Later, Garcia-Mazcorro et al. (2018) analysed duodenal and stool microbiota from gluten-related disorders in Mexican patients. Again, they only found differences in particular taxa abundance but not in the general parameters of diversity. Regarding duodenal samples, a lower abundance of Bacteroidetes and Fusobacteria was found in CeD patients and a higher abundance of Actinobacillus (Gammaproteobacteria), Finegoldia (Clostridia) and TM7 in Non-Celiac gluten sensitivity patients (NCGS), on the other hand, Sphingobacterium (Bacteroidetes) was higher in healthy subjects. The microbial profile of stool samples was different, Firmicutes were predominant regardless of disease status, and Bacteroidetes were less than 1%. For the first time, this study showed particular microbial changes specific to gluten-related disorders in Mexican people [56].
Bodkhe et al. (2019) studied faecal microbiome and duodenal samples from patients with CeD and first-degree relatives. Despite no significant differences in alpha diversity between sample sites were found, a different overall composition of the microbiome was inferred between groups, showing that microbiome composition and diversity were different between duodenum and stool samples for all disease status evidenced by changes in individual taxa abundance [57].
More recently, Panelli et al. (2020) studied also duodenum, stool, and saliva from CeD patients. In stool samples, Bacteroides were predominant, and no differences in taxa abundance were detected between groups. A decrease in Firmicutes and Actinobacteria accompanied by an increase in Proteobacteria was evident in the duodenum and salivary samples of CeD groups. While at the genus level an expansion of Neisseria and a reduction of Streptococcus were found in CeD groups compared to controls was also appreciated. Authors conclude that there are different patterns of diversity across groups and body tissues [58].
These studies demonstrate that alfa-diversity between samples from patients with CeD, and other groups did not show differences. However, specific changes in taxa abundance were found. Some similarities were evident across studies, for example, the increase of Proteobacteria in CeD patients and a decrease in Firmicutes and Actinobacteria, evidencing the existence of a dysbiosis in CeD with a predominance of gram-negative bacteria.
3.1.3 Influence of GFD on the microbiome of healthy people (gut) and CeD and NCGS patients (saliva and gut)
We found five papers studying GFD in different conditions.
Bonder et al. (2016) study the influence of a GFD through time by measuring microbial changes in patients with a GFD for four weeks followed by a washout period. The authors did not observe a time-dependent change in alpha diversity or differential abundance in the microbiome composition. There was a substantial difference when comparing individuals regardless of time point, showing that inter-individual effect on the microbiome variation is stronger than the effect of diet. Carbohydrate and starch metabolisms were altered with differences between GFD and a healthy diet in the abundance of pathways related to tryptophan metabolism, butyrate metabolism, fatty acid metabolism, and seleno-compound metabolism. The authors conclude that change in diet did not influence the bacterial diversity within a sample. Inter-individual differences were more influential than the effect of GFD [59].
Garcia-Mazcorro et al. (2018) the study was partially discussed in Sect. 1.2. They also investigated the potential microbial signatures associated with GFD by consuming certified gluten-free foods in patients with paired samples. Contrary to Bonder et al., they found that the groups studied showed group-specific variation over time after consuming the GFD for four weeks, notably NCGS presented an increase in the duodenal Pseudomonas on the GFD. In contrast, only half of CeD patients showed increases in Pseudomonas, but these increases were pronounced to affect median values. Biomarker analysis of the taxa at the genus level confirmed the results on Pseudomonas and showed that other Proteobacteria (e.g., Stenotrophomonas and Novosphingobium) were significantly more abundant on the GFD. However, these differences were only observed in duodenum samples, and there were no differences regarding these bacterial groups in faecal samples [56].
Ercolini et al. (2016) studied the influence of a change on GFD over the microbiome of celiac children following an African- style GFD during two years before the 60 days of treatment with an Italian-style gluten-free diet. After Italian-style GFD treatment, the microbiome underwent a time-dependent reduction in alpha diversity compared to baseline samples on African-style GFD. Many significant differences in microbiome composition were found. Regardless of changes in abundance, the core microbiome was conserved in children and is composed of fifteen genera belonging to the phylum Firmicutes. The metagenome prediction confirmed the diet's remarkable effects at a possible functional level. At baseline, there was a higher abundance of genes related to carbohydrate metabolism. In contrast, after 60 days of diet change, the saliva was characterised by a higher abundance of genes related to amino acid metabolism, vitamins and cofactors. The change in GFD diet style alters the Saharawi children's salivary microbiome significantly, reducing its alfa diversity, changing their beta diversity and varying the abundance of microbiome members and functional pathways [60], suggesting that diet composition is a critical factor in evaluating the effect of GFD over CeD patients microbiome.
D'Argenio. et al. (2016) compared the microbiome of CeD patients with unrestricted diet, under a GFD, and healthy controls. They found that Actinobacteria and Firmicutes phyla were less abundant in the CeD on gluten-containing diet than in the other two groups. Betaproteobacteria class was highly represented in active CeD patients' gut microbiome, although its levels did not directly correlate with disease severity. Neisseriales order, the Neisseriaceae family, and the Neisseria genus were significantly more abundant in active CeD patients than in the other study groups. Bacterial richness did not differ between CeD and control. The Beta diversity of bacterial communities was statistically significant within the three groups, suggesting that CeD patients' microbiomes either on GFD or unrestricted diet are more similar to each other than either of them to the control microbiomes. Authors conclude that overall microbial community composition is different among CeD and controls, although richness is comparable [61].
In a similar study using samples from duodenum, Nistal. et al. (2016) did not found significant differences in alpha or beta diversity among CeD and controls; however, Asteuralleaceae and Streptococcus members were frequent in CeD patients, mainly on a gluten-containing diet. The authors did not find dysbiosis in the microbiota composition in the duodenum. They suggested that the microbiome's role in the disease must be associated with the function and not merely the composition [62].
In conclusion, there is no consensus in the literature regarding the adherence of CeD patients to a GFD. Some authors mention that inter-individual effect on the microbiome is greater than the effect of diet, whereas some studies report partial changes in the diversity and microbial composition after acquiring this type of diet.
3.1.4 Upper gastrointestinal tract microbiome in CeD.
CeD mainly affects the small intestine (duodenum), its diagnosis often comprises a combination of clinical, serological and histopathological findings, and small-bowel biopsy specimens are fundamental for an accurate diagnosis [23]. However, biopsy-sparing diagnostic guidelines have been proposed and validated in a few recent prospective studies, as the obtention of a duodenal biopsy o duodenal content is an invasive procedure, some studies aim to find microbial markers for other parts of the upper gastrointestinal tract, with minor invasive procedures (saliva. and oropharyngeal exudate) in order to identify possible microbial markers as "surrogate markers" of the duodenum. We found three papers studying the upper gastrointestinal tract microbiome in CeD.
In a study by Francavilla et al. (2014), the authors revealed that the composition of the main bacterial phyla differed between the salivary microbiota of CeD children and controls. Lachnospiraceae, Gemellaceae (genus Gemella), and Streptococcus sanguinis were most abundant in CeD children's saliva. At the same time, the abundance of Streptococcus thermophiles was markedly decreased. Other Firmicutes (e.g., Veillonella parvula) associated with oral health were found at the highest levels in controls' saliva. The saliva of CeD children harboured the highest levels of Bacteroidetes (e.g., Porphyromonas spp., P. endodontalis, and Prevotella nanceiensis) and the lowest levels of Actinobacteria. The authors conclude that the salivary microbiome of CeD children is less diverse, with a different community structure of healthy children's microbiome. CeD microbiome has increased commensal opportunistic pathogens and reduced microbial species associated with health [63].
A similar study conducted by Tian et al. (2017) analysed salivary microbial profiles and protease activities of the microbiome given their potential involvement in gluten digestion and processing in CeD and refractory CeD (R-CeD). Significant differences were found between the CeD and R-CeD groups with respect to Bacteroidetes (CeD > R-CeD), Actinobacteria (CeD < R-CeD), and Fusobacteria (CeD > R-CeD) abundances. The overall microbial diversity was greater in the healthy subjects than in the CeD group, and controls had a higher abundance of TM7 sp., Treponema sp., Simonsiella muelleri, Actinomyces sp., Porphyromonas sp Alloscardovia omnicolens in comparison to CeD and R-CeD. The authors studied gluten degradation rates and found that gluten substrate degradation in saliva was relatively low in healthy controls. However, gluten degradation rates were significantly higher in saliva from CeD patients, regardless of whether the tripeptide substrate or the 33-mer substrates were employed [64].
Finally, Iaffaldano et al. (2018) found that the oropharynx of CeD patients showed a microbial imbalance similar to that previously described in the CeD duodenal microbiome undergoing an unrestricted diet. They observed an almost coincident oropharyngeal microbiome in controls and GFD subjects. Genes associated with polysaccharide metabolism predominated in control and CeD-GFD microbiomes. In contrast, in CeD patients with an unrestricted diet, the more significant metabolic potential for degradation of amino acids, metabolism of lipid and ketone bodies and microbial antioxidant defence mechanisms was observed. Finally, the authors suggest a continuity of CeD microbial composition from mouth to the duodenum, proposing that the characterisation of the microbiome in the oropharynx could contribute to investigating the role of dysbiosis in CeD pathogenesis [65].
Upper intestinal microbiota analysis revealed differences across CeD patients and healthy controls, suggesting the possible use of less invasive samples as a marker. There is a need to perform studies integrating a significant number of samples to determine universal markers related to CeD.
3.1.5 Treatments: dietary interventions, prebiotics and hookworm infections
Due to the implications of the microbiome in CeD onset and progress, dietary modifications aiming to restore microbial eubiosis are of interest. In this sense, probiotics intake and different approaches such as hookworm infections that modify the microbiome and activate the immune response have been proposed as useful tools for modifying gut microbiota in CeD patients. We discuss here two papers evaluating both approaches.
Quagliariello et al. (2016) evaluate the probiotic effect of two Bifidobacterium breve strains on the gut microbiota composition of CeD under a GFD. Firmicutes/Bacteroidetes (F/B) ratio was calculated for each group of subjects; this ratio expresses the relationship between the two dominant phyla found in the gut microbiota and associated with several pathological conditions [66]. CeD subjects had F/B ratio values lower than the control group. However, the administration of the probiotic strains for three months showed to increase this ratio. The authors found that Firmicutes were significantly lower in CeD subjects not receiving the probiotic formulation than the control and probiotic groups. Similar results were found for Actinobacteria, underrepresented in the CeD group, and increased after the administration of probiotic strains. The treatment with the B. breve strains not caused changes at the levels of genus or phylum to which the probiotic belongs, but the intake of the probiotic strains acted as a "trigger" for the increase of Firmicutes and restoring F/B ratio. The authors concluded that three months of administration of B. breve strains could make the intestinal microbiota of CeD patients more similar to that of healthy individuals, restoring the abundance of some microbial communities that characterise the typical physiological condition [67].
In another study, Giacomin et al. (2016) investigated alterations in duodenal microbiota of CeD subjects before and following experimental infection with Necator americanus and following the administration of a low and high dose of dietary gluten. Considering that experimental infections with the human hookworm N. americanus could be used to treat CeD; infection alone increases anti-inflammatory cytokine, hinders colonisation of Proteobacteria, and favours the enrichment of Clostridiales in patients on GFD to improve tolerance to gluten micro-challenges [68]. They revealed that diversity in trial subjects was higher than in control subjects. Moreover, they observed significant microbial diversity changes throughout the trial: at week 24, a significant increase in diversity compared to baseline samples, whereas at week 36, diversity returned to baseline. These results suggest a correlation between infection status and exposure to escalating gluten doses and the gut microbiota composition. Finally, the authors conclude that low doses but not high doses of gluten in hookworm infected CeD patients previously in long-term GFD have beneficial effects in duodenal microbiome diversity (increase). However, the study does not differentiate between gluten and infection effects, do not measure inflammatory markers or mention symptoms and do not go in-depth on analysing individual taxa abundance changes [69].
3.2 Integrative data analysis of publicly available CeD microbiome data sets.
We combine 16S RNA sequencing gene datasets for the first time and perform an analysis following the same pipeline. We compared sequences generated from different regions of the 16S rRNA gene by using a reference mapping protocol for ASV assignment, in which sequences from different regions of the 16S rRNA gene will map to the same full-length reference sequence from SILVA SSU v.138 database [43] if they are from the same species. Achieving an integrative analysis including a high number of samples from different body sites and extensive metadata to find microbial biomarkers characteristic of CeD taking into account the type of diet.
We found nine out of the nineteen selected studies meeting inclusion criteria for the merged data analysis (Table 1). Table 2 shows the number of samples for each study, the clinical classification and tissue of origin (stool, duodenum, pharynx or saliva) for the data included in the analysis. Finally, we included 435 total samples, comprising 190 patients with active or treated CeD and 245 controls.
Table 1
The number of samples for each study, clinical classification and tissue of origin for the data included in the analysis. The cases (n = 190) include patients with celiac disease, on a diet with or without gluten, the controls (n = 245) include non-celiac patients with gluten intolerance, gluten-free diet or without any enteropathy. The total samples add up to 435.
References
|
Accession Nº ENA1
|
Sampled tissue
|
Sequencing technology
|
16S region
|
Bodkhe, et al. (2019)
|
PRJNA385740
|
duodenum and stool
|
Illumina Miseq
|
V4
|
Garcia-Mazcorro et al., (2018)
|
PRJNA401920
|
duodenum and stool
|
Illumina Miseq
|
V4
|
Laffaldano et al., (2018)
|
PRJNA371697
|
pharynx
|
Illumina Miseq
|
V4-V6
|
Olivares et al., (2018)
|
PRJEB23313
|
stool
|
Illumina Miseq
|
V1-V2
|
Tian, et al., (2017)
|
PRJNA321349
|
saliva
|
Illumina Miseq
|
V3-V4
|
Bonder et al., (2016)
|
PRJEB13219
|
stool
|
454
|
V3-V4
|
Giacomin et al., (2016)
|
PRJNA316208
|
duodenum
|
454
|
V1-V3
|
Quagliariello et al., (2016)
|
PRJEB14943
|
stool
|
Illumina Miseq
|
V3-V4
|
Francavilla et al., (2014)
|
PRJNA231837
|
saliva
|
454
|
V1-V3
|
1. Accession numbers in the "European Nucleotide Archive" (ENA) database |
Table 2
Statistics of Input data to analyse (After filtering samples and ASVs).
Tissue
|
Feeding Habit
|
ASVs
|
Samples
|
Cases
|
Controls
|
duodenum
|
GFD
|
475
|
30
|
12
|
18
|
duodenum
|
Unrestricted
|
475
|
89
|
40
|
49
|
stool
|
GFD
|
374
|
89
|
52
|
37
|
stool
|
Unrestricted
|
374
|
107
|
24
|
83
|
saliva
|
Control Unrestricted and Case GFD
|
120
|
78
|
40
|
38
|
pharynx
|
Control Unrestricted and Case GFD
|
71
|
42
|
22
|
20
|
ASV = Amplicon Sequence Variant. GFD = Gluten free diet. Cases = Samples from patients with Celiac disease. |
Table 3
Statistics of significant results obtained on Picrust2 predictions analysis.
Tissue
|
Feeding Habit
|
Gene Differential abundance
|
Gene
Correlation
|
Pathway
Differential abundance
|
Pathway
Correlation
|
duodenum
|
GFD
|
412
|
0
|
13
|
0
|
duodenum
|
Unrestricted
|
404
|
0
|
8
|
0
|
stool
|
GFD
|
31
|
69
|
0
|
1
|
stool
|
Unrestricted
|
261
|
0
|
2
|
0
|
GFD = Gluten-free diet. FC > = 3, p < 0.01 |
3.2.1 Diversity and microbial composition.
Similar to previously discussed studies in Sect. 2, we did not find differences among healthy controls and CeD patients regarding alpha diversity indexes. However, when we consider GFD (for duodenum and stool samples), we found an increment in the diversity in patients as in healthy controls undergoing a GFD. Alpha diversity of the microbiome was estimated using the Chao1, Shannon and Simpson indices (Fig. 2).
For the beta-diversity analysis, we study differences among groups in each tissue sampled by PCoA using weighted UniFrac distance, but we did not find a clear separation when analysing biological variables (age, type of diet or clinical condition), indicating that there are no differences in beta diversity between the microbial communities studied and that the differences in the microbiome are not due to global differences in the abundance of the phylogenetic groups present, but probably due to the abundance of specific taxa.
Our results are in the same line as others that assure that differences in the microbiota across different body sites are enough for grouping samples [72]. Permutational multivariate analysis of variance (PERMANOVA) for each variable by ADONIS function revealed that Study Accession, in the case of the duodenum, saliva, and stool, was a factor influencing the grouping of samples. This may be explained by the experimental protocols used in each study, including differences in the sequencing platform, the region of 16S rRNA targeted, and the DNA extraction technique used, suggesting that some particular protocols may induce some biases, however, the principal coordinates explain a low percentage of the variance between the samples, indicating that other non-technical factors are mainly responsible for the variance.
3.2.2 Differential analysis of the microbiota, correlation and biomarker finding.
To establish microbial biomarkers, first, we conduct a biomarker finding analysis using the LEfSe tool, followed by a differential abundance analysis using DESeq2 to identify ASVs that were differentially expressed according to studied groups. Finally, a correlation analysis looking for an association between CeD and microbial composition was performed. Figure 3 summarise the main findings obtained in each tissue analysed.
3.2.2.1 Microbial changes associated with duodenal microbiota in CeD.
We found that bacteria of the phylum Proteobacteria were characteristic of CeD patients duodenum, with different genus present according to the type of diet.
(i) Microbial changes of duodenal microbiota from untreated CeD patients undergoing an unrestricted diet. For duodenal samples of patients undergoing an unrestricted diet, we found 23 ASV with an LDA score greater than 3 (Fig. 4a). Nine ASV were associated with CeD patients mainly from Proteobacteria phylum, particularly bacteria from the Burkholderia-Paraburkholderia-Caballeronia clade, Alphaproteobacteria, and Enterobacteria, and Actinobacteria from the family Corynebacteriaceae. On the other hand, healthy controls were enriched in Firmicutes phylum class Negativicutes, and phylum Epsilonbacteraeota.
After differential relative abundance analysis, we found 93 ASVs with significant changes in abundance (FDR < 0.01) (Fig. 5a). Among them, 61 were decreased in CeD, and 11 were increased in CeD. Significant ASVs belong to phyla: Firmicutes, Bacteroidetes, Proteobacteria, Epsilonbacteraeota, Actinobacteria, Spirochaetes, Fusobacteria, Synergistetes.
Finally, we found a negative association between CeD and the genus Actinomyces, whereas the top five positive associated genus were Coprococcus 3, Hydrocarboniphaga, Ruminococcaceae UCG_010, Cutibacterium, and Deinococcus.
(ii) Microbial changes of duodenal microbiota from CeD patients undergoing a GFD. We select healthy controls and patients undergoing a GFD to study the microbial composition between the two groups. Biomarker finding analysis revealed 65 ASVs with a LDA score greater than 3 (Fig. 6a). Eight ASV were associated with CeD patients, mainly from Proteobacteria phylum belonging to Burkholderia-Paraburkholderia-Caballeronia clade, and alfaproteobacterias Afipia, and order Rhizobiales. On the other hand, 57 ASV were related with healthy controls following a GFD, particularly from phylum Firmicutes, Fusobacterium, Actinobacteria, and Epsilonbacteraeota comprising the genus Leptotrichia, Fusobacterium, Rothia, and Campylobacter, and other Proteobacteria (Neisseriaceae, Pseudomonales and Haemophilus).
After performing the differential abundance analysis, we found 49 ASVs with significant changes in abundance (FDR < 0.01) (Fig. 5b). Nine were increased in CeD, whereas the other 40 ASV were decreased. Significant ASVs belong to phyla: Actinobacteria, Proteobacteria, Firmicutes, Epsilonbacteraeota, Bacteroidetes, Fusobacteria, and Spirochaetes.
Finally, the top five genus negative associated with CeD duodenum of patients undergoing GFD were Haemophilus, Neisseria, Alloprevotella, Fusobacterium and Delftia. In contrast, the top five positive associated genus were mainly from Proteobacteria, Alfaproteobacteriales: Falsirhodobacter, Asinibacterium, Azonexus, and Blastomonas.
3.2.2.3 Microbial changes associated with stool microbiota in CeD.
Stool samples are more representative of the colonic microbiota; however, they could also indicate changes related to the disease. Like in duodenum samples, we found an increase in bacteria of the phylum Proteobacteria, but different genus was enriched according to the type of diet. Specific changes found were as follows:
(i) Microbial changes of stool microbiota from untreated CeD patients undergoing an unrestricted diet. After biomarker analysis, we found 60 ASVs with an LDA score greater than 3 (Fig. 6a). Of them, 17 were associated with CeD, bacteria from phylum Verrucomicrobia and Firmicutes were characteristics of this group, mainly the genus Akkermansia, Anaerostipes, Faecalibacteria and Dorea, on the other hand in healthy controls undergoing an unrestricted diet ASV from phylum Proteobacteria (Betaproteobacteriales) and Firmicutes mainly order Clostridiales were overrepresented.
Differential abundance analysis revealed 74 ASVs belonging to phyla: Firmicutes, Bacteroidetes, Actinobacteria, Proteobacteria with significant changes in abundance (FDR < 0.01) (Fig. 7a). Among them, 56 were decreased in CeD and seven were increased in CeD. Finally, top five genus positively associated with CeD were: Achromobacter, Flavisolibacter, Geodermatophilus, Candidatus Rubidus, and Tepidimonas.
(ii) Microbial changes of stool microbiota from CeD patients undergoing a GFD. Biomarker finding analysis revealed the presence of 48 ASVs with LDA > 3 (Fig. 6b). Twenty-two were associated with CeD patients, mainly from phylum Verrucomicrobia, Bacteroides, Firmicutes and Proteobacteria, particularly genus Akkermansia, Bacteroides, Romboutsia, and Pseudomonas. On the other hand, 26 ASV were related with controls undergoing GFD, mainly bacteria from phylum Firmicutes and Actinobacteria from genus Lactobacillus, Streptococcus, Ruminoccocous and Bifidobacterium. Differential abundance analysis revealed changes in 77 ASVs (FDR < 0.01) (Fig. 7b). Among them, 41 were decreased in CeD and 19 were increased in CeD. Significant ASVs belong to phyla: Firmicutes, Proteobacteria, Bacteroidetes, Verrucomicrobia, Actinobacteria, Lentisphaerae.
Finally, correlation analysis showed that 15 ASV with a significant correlation with CeD. 5 have a negative correlation with CeD (Ruminococcus 1, Ruminococcaceae UCG_014, Ruminiclostridium 6, Coprococcus 2, Enterococcus) and ten ASV have a positive correlation, and the top five genus positive correlated were Intestinibacter, Akkermansia, UBA1819, Flavonifractor, and Terrisporobacter.
3.2.2.4 Microbial changes associated with pharynx microbiota in CeD.
We found two microbial biomarkers characteristic of the pharynx from patients with CeD. Rothia, a nitrate-reducing bacteria usually found in the oral cavity of humans [70], and Peptosptreptoccus, an oral pathogen recently associated with Colorectal Cancer development [71] (Fig. 8a). On the other hand, differential abundance analysis showed significant changes in abundance (FDR < 0.01) of five ASV. Veillonella, Mogibacterium (p_Firmicutes), Streptobacillus (p_Fusobacteria) and Mannheimia (p_Proteobacteria) where increased in CeD whereas Treponema 2 (p_ Spirochaetes) was decreased.
3.2.2.5 Microbial changes associated with microbiota from saliva samples in CeD.
Despite the most significant changes found in the microbial composition on other parts of the gastrointestinal tract, we only found one genus related to CeD in saliva and was identified after differential abundance analysis and biomarker finding, namely genus Oceanivirga belonging to the phylum Fusobacteria, family Leptotrichiacea.
3.3 Prediction of the metabolic functions profiles in bacterial communities.
The functional potential of the microbiome in the different tissues was predicted using the PICRUSt tool, followed by a differential abundance analysis performed with Deseq2 and correlation analysis.
3.3.1 Prediction of short-chain fatty acid (SCFA) production genes and inflammation-related pathways.
To study the metabolic potential of the microbiome, the analysis was first carried out for all the genes and pathways detected and, since the intestinal microbiome, after its contribution to the digestive process of food, produces short-chain fatty acids that influence the maturation, maintenance, and behaviour of the mucosal immune system, we focus on the genes and pathways involved in the production of these compounds. Figure 3 summarises the main findings observed according to each tissue analysed.
First, we search for the 26 genes reported as potentially involved in the biosynthesis pathways of the main SCFAs produced by the human microbiome (butyrate, acetate, and lactate), given their involvement in the modulation of the immune system. Genes included were those identified by Zhao et al. (2019) by gene deletion and overexpression experiments in E. coli. These authors identified six genes needed in acetate production (menI, tesA, yciA, fadM, tesB, ybgC), eight genes used in butyrate (entH, tesA, ybgC, ybhC, yicA, menI, yigI y tesB) production, and two genes required for lactate (mgsA y lldD) production in addition to the previously known genes pta-ackA, ptb-buk, ldhA, poxB, eutD, tdcD, dld y ykgF [72]. Table 2 shows the number of genes found according to each tissue and the differentially abundant genes/routes between CeD patients and healthy controls.
Differential abundance analysis of metabolic pathways is summarised in Table S3, it revealed an increase in the duodenum of CeD patients, of the degradation of D-glucarate, L-arabinose, D-galactarate and biogenic amines and a decrease in the degradation of lactose and galactose. Regarding genes involved in the production of SCFA, we found a decrease in the abundance of genes involved in the production of acetate (ackA) and lactate (ldhA) and a negative correlation with genes involved in the production of acetate (pta), lactate (lldD, ldh, dld and tesB) and butyrate (tesB). A decrease in the abundance of the fermentation pathway from hexitol to SCFAs involving the ackA and pta genes was consistently found.
In stool samples of patients with CeD, the synthesis routes of lipopolysaccharide (LPS) components of the membrane of gram-negative bacteria, acetate degradation routes, and vitamin B synthesis (B1 and B9) were found to be increased. Regarding genes implied in SCFA synthesis, correlation analysis revealed that two genes related to acetate and lactate production (pta and dld) were negatively associated with CeD (FDR ≤ 0.01; correlation ≥ 0.2).
No statistically significant differences were found in the abundance of any of the 26 genes related to SCFA production regarding saliva and pharynx samples. However, in saliva samples of CeD patients, we found an increase in peptidoglycan synthesis, and the intermediate degradation routes of aromatic compounds and amino acids.
3.3.2 Prediction of genes and genes coding for prolyl peptidase enzymes.
Some components of the microbiota can express enzymes different from those produced by humans and promote the digestion of compounds such as the immunogenic peptides of gluten [14, 15]. We analysed four enzymes involved in the degradation of immunogenic gluten peptides that could be involved in reducing CeD symptoms (general N-type aminopeptidase (PepN), X-prolyl dipeptidyl aminopeptidase (PepX), endopeptidase (PepO) and endoprolyl peptidase (PREP)). However, none of the four enzymes involved in the degradation of gliadin peptides were found to be differentially abundant between cases and controls or associated with CeD in any of the tissues.