Genome-resolved metagenomics of milk microbiomes reveals the influence of maternal dietary fiber on neonatal inheritance of immunoregulatory traits

doi:10.21203/rs.3.rs-2641343/v1

Download PDF

Article

Genome-resolved metagenomics of milk microbiomes reveals the influence of maternal dietary fiber on neonatal inheritance of immunoregulatory traits

https://doi.org/10.21203/rs.3.rs-2641343/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Breastfeeding facilitates vertical transmission of microbes from mothers to infants. Milk microbiome composition is strongly influenced by maternal diet, and this affects which taxa are likely to colonize the infant gut with consequences for host health and immune development. At present, it is unclear how diet influences the composition of the milk microbiome and why these microbes lead to different health outcomes for the infant. Here, we used metagenomics and metabolomics to link microbially-mediated immunoregulatory traits and metabolites to individual milk microbial taxa, and determine how the representation of these traits changes with maternal dietary fiber content. We assembled and annotated genomes accounting for 90% of the milk microbial communities from breastfeeding mice fed high or low-fiber chow. Diverse carbohydrate and fatty acid content in high-fiber milk was associated with diverse microbes harboring multiple glycoside hydrolases and high redundancy of immunoregulatory metabolite pathways. Low dietary fiber, by contrast, produced milk enriched in amino acids and a low-diversity peptide degrading microbiome with limited immunoregulatory traits. Our study indicates that complex milk carbohydrate availability drives assembly of a diverse milk microbiome, and by extension a diverse set of immunoregulatory functions inheritable by the breastfeeding infant. Collectively, our findings highlight how the mother’s diet influences the composition of the milk microbiome and the potential vertical transmission of immunoregulatory traits from mother to infant.

Biological sciences/Microbiology/Microbial communities/Microbial ecology

Biological sciences/Biological techniques/Sequencing/Next-generation sequencing

Gut microbiomes are complex and dynamic microbial communities that influence host health and nutrition starting in early life [1-3]. Indeed, factors that decrease bacterial diversity in this critical period are associated with increased risk of metabolic, neurodegenerative, autoimmune, and allergic diseases [4, 5]. The composition of neonatal gut microbiomes has even been linked to the later development of doctor-diagnosed asthma [6]. It is now recognized that a range of gut microbiome-produced immunoregulatory metabolites directly impact gut health, neonatal immune development, and later life immune function (Table 1). Examples include regulation of intestinal epithelial integrity, homeostasis, and function [7-14]; the expansion of colonic regulatory T cells [15]; immune cell haematopoiesis in the bone marrow [16]; neuroprotection [17, 18]; bile acid production [19, 20]; and cholesterol metabolism [21, 22]. Metabolites such as retinoate, glycine, and taurine also promote assembly of the gut microbiome itself [23-26]. Accordingly, there is a need to understand the processes in the perinatal period that promote the assembly and persistence of a healthy microbiome in infancy and into childhood [27].

Table 1 Microbiome-associated immunoregulatory metabolites and their functions.

Class	Function	Metabolite
Short Chain Fatty Acids (SCFAs)	T-regulatory and B cell expansion and differentiation [87-91]	Acetate, Propionate, Butyrate
	Myeloid cell hematopoiesis [43, 92-95]	Acetate, Propionate, Butyrate
	Intestinal mucosal maintenance [10]	Acetate, Butyrate
	Neutrophil survival, recruitment, and function [96]	Acetate, Propionate, Butyrate
	Downregulation of DC maturation markers CD1a, CD80, CD83, and MHC II, downregulation in production of TNF-α, IL-1β, and IL-6 and upregulation IL-10 production by DCs [97, 98]	Butyrate
	Differentiation of T cell to Th17 and Th1 effector cell, Reduce Th2 polarization [95, 99]	Acetate, Propionate, Butyrate
Indoles	Innate lymphoid cell development [8, 11, 14]	Indole, Indole-3-aldehyde
	Intraepithelial lymphocytes maintenance [9]	Indole, Indole-3-aldehyde
	Intestinal inflammation and permeability regulation [7, 12, 13]	Indole, Indole-3-propionate
	Neuroprotection [17, 18]	Indole-3-propionate
	Satiety signal [100]	Indole
Vitamins	T-regulatory cell inflammation response [101-109]	Retinoate, Vitamin B9
	Reduces the expression of IL-1-β, IL-6, TNF-α, MCP-1 by macrophages, promotes differentiation of monocytes towards M2 macrophages [110, 111]	Retinoate, Vitamin B9
	IgE downregulation [112-114]	Retinoate
	Development and differentiation of B cells, neutrophils, macrophages, dendritic cells, and ILC3 [115-122]	Retinoate
	Lymphocyte development and activity, CD4+ T cell effector function, CD8+ T cells expansion and survival, IL-22 synthesis by γδ T cells and ILC, and Th17 differentiation [23, 24, 123-129]	Retinoate, Vitamin B12, Adenosylcobalamin
Bile Salts	Bile synthesis inhibition [130, 131]	Glycine, Taurine
Bile Salts	Microbial overgrowth inhibition [25, 26]	Glycine, Taurine

Seeding and assembly of the neonatal gut microbiome occurs primarily via vertical transmission from the mother, and is influenced by factors including the mother’s microbiome composition [28, 29], the mode of birth [29, 30-33], and the composition of microbiome-modifying factors in the breastmilk and formula [34-36]. Hence, environmental and lifestyle factors that affect the maternal microbiome can alter the infant’s microbiome, with consequences for their susceptibility to disease [37-41]. Importantly, this relationship continues post-partum in breast-fed infants. For example, it is known that the mother’s diet influences the breastmilk microbiome and production of fatty acids and vitamins in the milk [42]. Recently, we demonstrated in breastfeeding mice that dietary fiber content in the chow during gestation and lactation significantly affects the composition of the milk microbiome, the specific microbial lineages vertically transmitted from the milk to the neonatal gut, neonatal immune development, and as a consequence, susceptibility to disease [43]. These findings raise two important questions: 1) What are the diet-associated milk characteristics that influence the assembly of the milk microbiome? and 2) How does this affect the resulting inheritable microbial immunoregulatory traits?

To address these questions, we used metabolomics and metagenomics to link dietary fiber-associated microbial substrates with microbiome-associated degradation genes and immunoregulatory traits in milk. Critically, we recovered metagenome-assembled genomes (MAGs) representing 90% of bacteria by relative abundance, including all major lineages, from the milk of breastfeeding mice fed high (HFD) and low-fiber diets (LFD). By assembling near-complete communities, we could then use MAG functional annotation to demonstrate that an enrichment of diverse carbohydrates in HFD milk is associated with a diverse microbiome wherein major lineages possess high glycoside hydrolase copy numbers, and immunoregulatory traits have high redundancy across the community. By contrast, we demonstrate that amino acid enriched LFD milk is associated with a small number of microbial lineages, each with a diverse array of peptidases, but a limited range of immunoregulatory traits. Our findings identify a critical interplay between maternal diet-associated milk metabolites, milk microbiome assembly, and the resulting inheritable immunoregulatory traits of the neonatal gut microbiome.

Mouse strains and diets

Wildtype (WT) C57BL/6J mice (from Animal Resources Centre, WA, Australia) were housed at the specific pathogen-free animal facility at QIMR Berghofer Medical Research Institute, and experiments were approved by the animal ethics committees of QIMR Berghofer Medical Research Institute, or University of Queensland, Australia.

Breeding-age females were fed either a high-fiber diet (HFD) or low-fiber diet (LFD) from three weeks prior to timed mating and until the end of the study. HFD and LFD chow, which were identical except for the carbohydrate makeup (Table S1), was purchased from Specialty feeds (Western Australia, Australia). Briefly, HFD carbohydrates were a mix of crude fiber, acid detergent fiber, cellulose, and wheat starch, while LFD carbohydrates were a calorically equivalent quantity of glucose monohydrate.

Milk collection and enrichment

Milk was collected from lactating HFD- or LFD-fed mothers at post-partum day seven. Briefly, dams were separated from their litter then injected i.p. with oxytocin (2 IU/kg). After induction of anaesthesia via injection (i.p. route) of a mixture of xylazine (10 mg/kg) and ketamine (100 mg/kg), the fur was removed and the teats and surrounding area cleaned with ethanol swabs to reduce contamination. Gentle pressure was applied to the nipple to release the milk (for microbiome samples, the areola was first punctured with a needle), which was collected into sealed serum bottles containing 30% anaerobic glycerol in PBS (for later enrichment) or without glycerol (for later metabolite and microbiome analysis) and stored at -80°C prior to further processing.

Anaerobic yeast-casitone-fatty-acids broth was prepared as described [52]. The broth was inoculated separately with milk collected from HFD- or LFD-fed mothers and incubated anaerobically at 37°C for 48 hours. Enriched cultures were centrifuged for 5 min at 7,500 g to obtain bacterial pellets, which were then washed twice with anaerobic Ringer’s solution. Final pellets were resuspended with a sufficient volume of sterile 30% (v/v) glycerol in PBS to produce an optical density of 1.0 at 600 nm and were stored at -80°C for future experiments.

Metabolite extraction and gas chromatography-mass spectrometry (GC-MS) analysis

20 µL of mouse milk was added into a 1.5 mL Eppendorf tube on ice. Metabolites were extracted by addition of 100 µl of methyl-tert-butyl-ether:methanol (1:1 v/v) containing internal standard U-¹³C-sorbitol (16.6 µM). Samples were then vortexed for 1 min and centrifuged for 10 min at 16,100 g, 7°C. Supernatant was then transferred to fresh 1.5 mL Eppendorf tube, and a 20 µL aliquot of the supernatant was dried under vacuum (Eppendorf Concentrator Plus) in glass pulled point inserts. A 10 µL aliquot from each sample was also pooled to generate three pooled biological quality control samples (PBQCs), each containing 20 µL of the vacuum dried mixture. Samples and PBQCs were derivatized by methoxyamination by the addition of 25 µL methoxyamine (30 mg/mL in pyridine, 2 h, 37°C, 900 rpm, Eppendorf ThermoMixer C), followed by trimethylsilylation with 25 µL BSTFA + 1% TMCS (45 min, 37°C, 900 rpm, Eppendorf ThermoMixer C).

GC-MS analysis was performed on a Shimadzu GC/MS-TQ8050 NX system. 1 µL of derivatized sample was injected into the GC inlet set at 280°C in 1:10 split mode, and chromatographically separated using an Agilent DB-5 ms capillary column (30 m × 0.25 mm × 1 µm) with helium at 1 mL/min. A 100°C starting temperature was held for 4 min, then ramped at 10°C/min to 320°C and held for 11 min. Compounds were fragmented by electron impact ionization and analyzed in MRM mode using the Shimadzu Smart Metabolites Database containing 475 MRM metabolite targets (https://www.shimadzu.com/an/gcms/metabolites/index.html). A high-quality matrix was manually curated using the Shimadzu LabSolutions Insight GCMS program (v.3.7 SP3), where metabolites not present in all samples were removed from the dataset. Compound abundances were normalized within each sample by first scaling by the percent difference of internal standard abundances between the sample and the pool average, and then subtracting the average blank abundance of each compound.

DNA extraction and sequencing

After thawing on ice, total DNA from was extracted from 100 µL of either raw or enriched milk using an adaptation of a previously described method [53]. Briefly, the thawed sample was transferred into a sterile 2 mL screw cap tube containing 0.4 g of sterile zirconia beads. After adding 600 µL of lysis buffer (50 mM EDTA, 50 mM Tris-HCl, pH 8.0, 500 mM NaCl, and 4% SDS), tubes were homogenized in a Precellys 24 Tissue Homogenizer at 4°C and 5000 rpm for 3 x 60 s with 30 s rest. The lysates were incubated for 15 min at 70°C, with mixing by inversion once every 5 min. The lysate was then centrifuged for 5 min at 13,200 g and 4°C, and the supernatant was mixed with 30 µL Proteinase K (Maxwell 16 LEV Blood DNA Kit, Promega) in a new 1.5 mL Eppendorf tube, and vortexed for 30 s. The mixture was then incubated at 56°C for 20 min, and the remaining extraction steps completed following the Maxwell 16 LEV Blood DNA cartridge protocol. After extraction, the remaining paramagnetic particles were removed by centrifugation for 2 min at 10,000 g at 4°C, and RNA was removed by incubation at 37°C for 15 min with RNase A (10 mg/mL final concentration; PureLink). DNA concentration was measured using the Qubit dsDNA BR Assay Kit (ThermoFisher Scientific), and quality was checked visually by agarose gel electrophoresis.

The V6-V8 hypervariable regions of microbial 16S rRNA genes were amplified using primers 926F (5’-AAA CTY AAA KGA ATT GRC GG-3’) and 1392wR (5’-ACG GGC GGT GWG TRC-3’) [54]. Both the reverse and forward primers were modified to include an Illumina overhang adapter at their 5’ ends to be compatible with Nextera XT indices i5 (Index 2) and i7 (Index 1), respectively, while the forward primers also contain an 8 bp unique molecular identifier (MID). PCR was performed using 2.5 µL template DNA in 1x AmpliTaq Gold 360 master mix (Applied Biosystems) with 200 µM of each primer, made up to a total volume of 20 µl with ultrapure water. PCRs which performed on a SimpliAmp 96-well Thermocycler (Applied Biosystems) using the following thermocycling conditions: 95°C for 8 min; then 35 cycles of 95°C for 20 s, 56°C for 30 s, 72°C for 45 sec min; followed by 72°C for 7 min. Amplification products were visualized by agarose gel electrophoresis, then purified using an 18% suspension of Sera-Mag Speed-beads Carboxyl Magnetic Beads (GE Healthcare), added in a ratio of 1.8 beads : 1 vol PCR product and resuspended in 25 µl water. PicoGreen dsDNA Quantification Kit (Invitrogen) was used to quantify the purified MID-barcoded amplicons, which were pooled in equimolar quantities and then subjected to dual indexing using the Nextera XT Index Kit (Illumina) according to the manufacturer’s guidelines, before purification as described above. The concentrations of the indexed amplicons were normalized and pooled. Finally, 16S rRNA gene amplicon libraries were sequenced using a MiSeq Reagent Kit v3 (Illumina- 600 cycle and Illumina- 30% PhiX Control v3) at the Queensland Brain Institute (QBI), University of Queensland.

Whole genomic DNA (gDNA) was sequenced to enable genome recovery, species identification, and metabolic reconstruction. Libraries from gDNA were prepared and indexed using IDT® for Illumina Nextera DNA Unique Dual Indexes (Illumina, Australia). Libraries were purified using an 18% suspension of Sera-Mag Speed-beads Carboxyl Magnetic Beads (GE Healthcare) as described above and sequenced by the Australian Centre for Ecogenomics (ACE) Sequencing service.

Processing of 16S rRNA gene amplicon data

The sequencing data was processed via a modified UPARSE approach [55]. Briefly: i) forward reads were quality filtered and clustered to produce an OTU table using default parameters using USEARCH (v10.0.240) [56]; ii) SILVA SSU r138 [57] taxonomy was assigned using BLASTN (v2.3.0+) [58] within QIIME2 (v2017.9) [59] and; iii) the OTU table was filtered for non-bacterial sequences using BIOM [60]. Finally, the OTU table was rarefied to 1000 sequences for each sample, and the number of observed OTUs and Shannon’s diversity index were calculated using QIIME2.

Metagenome-assembled genome (MAG) assembly

Raw reads were quality-trimmed to ≥ 10 trailing bases and an average of ≥ Q15 across a four-base window with Trimmomatic (v0.36) [61]. Trimmed reads of < 75 bases were then discarded. Cleaned reads were then assembled using four different methods in parallel (all using default parameters): individual sample assembly with metaSPAdes (v3.11.0) [62]; individual sample assembly with MEGAHIT (v1.1.2) [63]; co-assembly of replicate samples with metaSPAdes; and co-assembly of replicate samples with MEGAHIT. Metagenome-assembled genomes (MAGs) were then produced from each assembly using differential coverage binning, whereby all reads from all samples were mapped to an assembly using BWA mem (v0.7.17-r1188) [64], mapped reads were sorted using Samtools (v1.9) [65] sort, and finally the assembly’s contigs were binned based on sample-to-sample differential coverage with MetaBAT (v2.12.1) [66].

Quality statistics were evaluated for all 568 resulting MAGs using the CheckM (v1.0.7) [67] lineage workflow, and then a final set of high quality (Completeness – 4·Contamination ≥ 80%) and partial (Completeness – 4·Contamination < 80% and Contamination < 10%) MAGs were dereplicated at 99% average nucleotide identity with dRep (v2.6.2) [68], whereby final MAGs were selected from secondary clusters based on the following priority: 1) single-assembled raw milk MAGs; 2) co-assembled raw milk MAGs; 3) single-assembled enriched milk MAGs; 4) co-assembled enriched milk MAGs. Finally, GTDB r202 taxonomy [69] was assigned using the GTDB-Tk (v1.7.0) [70] de-novo workflow.

To check whether dereplicated MAGs represented the same strains in both raw and enriched communities, consensus strain variants for each MAG were called and compared between samples. Trimmed reads from each sample were mapped to a concatenated file containing all final MAGs, ensuring that all read-MAG pairings were unique. Reads were mapped with BWA mem, with off-diagonal X-dropoff of 120 and end clipping penalty of 7, and the resulting sam files were then separated by MAG. Next, genotype likelihoods were assessed with Samtools sort followed by mpileup. These were then used to call variant genomes with BCFtools call, followed by vcfutils.pl vcf2fq, and finally seqtk (v1.2-r94) seq to convert to fasta format. To compare the raw and enriched variants, sample-by-sample fastANI (v1.33) [71] distance matrices were generated for each MAG and compared using PERMANOVA.

Host contamination and community profiling

Community abundance profiles were calculated from the distribution of raw reads both between kingdoms, and within bacteria. For the kingdom-level profile, host reads were first identified and removed from each sample’s raw reads by mapping to the Mus musculus C57BL/6J NCBI reference genome (GCF_000001635.27) using Bowtie2 (v2.3.4.1) [72] with ‘sensitive-local’ settings, and then using Samtools to sort (sort -n), filter (view -f 12 -F 256), and export (fastq) unmapped reads. Raw and filtered reads were counted with grep -c “@”, and the number of host reads was calculated as the difference between raw and filtered counts.

Host-filtered reads were then used to assess MAG-associated reads using CoverM (v0.6.0) (genome -m trimmed_mean --bam-file-cache-directory), which scales MAG coverage by MAG size after excluding the 5th and 95th percentile of positional coverages for each MAG. The exported mapping files were used to filter and count MAG-associated reads from each sample using the same workflow used for host reads.

The distribution of remaining reads between remaining kingdoms was then assessed using Kraken2 (v2.1.1) [73] with ‘report’ output, where the reference database was built from NCBI nr on 28 July 2021. It was assumed that kingdom-distribution of uncharacterized reads was proportional to that of the characterized reads, and so kingdom-characterized read counts were scaled to count of total input reads. The bacterial reads measured by Kraken were counted as “unbinned” reads. These were summed with the MAG-associated reads to get total bacteria reads. No additional animalia reads were identified, and so only M. musculus is represented in the results.

Finally, the kingdom-level reads distribution profiles were calculated as the percent distribution of read counts in each sample. Similarly, the MAG relative abundance profiles were calculated as the percent distribution of trimmed mean coverage within each sample, scaled by the relative proportions of MAG-associated and unbinned reads.

MAG Richness

Assembled community richness was also used to estimate the proportion of raw milk lineages assembled into MAGs. A set of 24 single-copy bacterial marker genes were identified and clustered into OTUs from both the host-filtered reads and the MAGs using SingleM (v0.13.2) pipe. The percent of total lineages assembled into MAGs was then estimated by comparing the reads and MAGs OTUs using SingleM appraise.

MAG functional annotation

High quality, near complete MAGs were functionally annotated using the EnrichM (v0.6.3) annotate workflow. Briefly, open reading frames were identified and translated to protein sequences using Prodigal (v2.6.3) [74] in meta-mode. Protein sequences were then functionally annotated for either carbohydrate degradation, protein degradation, or general metabolic function. Carbohydrate degradation functions were identified using HMMER (v3.1b) [75] hmmsearch against the CAZy database [76]. Protein degradation and general metabolic functions were identified using Diamond (v0.9.36) [77] blastp search against the MEROPS [78] and KEGG [79] databases, respectively. The KEGG database was constructed by KO-annotation of the UniRef100 database [80]. All database alignments applied cutoffs of 1e-5 E-value, 30% identity match, 70% query alignment, and 70% reference alignment. CAZy, MEROPS, and KO gene counts from each MAG were then parsed into MAG-by-CAZy/MEROPS/KO (respectively) gene count matrices.

Key immunoregulatory metabolic pathways were defined based on the KEGG module definition structure using KO accession numbers, whereby genes that perform analogous reactions are separated by commas, and individual reactions are separated by spaces (Table S3). EnrichM classify was then used to compare the MAG-by-KO matrix to the pathway definitions, and a MAG-by-pathway presence/absence matrix was built whereby a pathway was present if the MAG could perform at least 80% of the reactions while missing no more than two. This allowed for the possibility of unassembled genes given the 80% quality score cutoff for “complete” MAGs.

Cholesterol degradation to coprostanol, associated with the ismA gene, has no KEGG representative and contains a highly conserved active site. As such, a graftM (v0.13.1) [81] package was constructed from the IsmA confirmed, probable, and negative protein sequences reported by Kenny et al. [21] using Clustal Omega (v1.2.4) [82], HMMER hmmbuild, and graftM create. GraftM graft then hmmsearched, aligned, and grafted MAG protein sequences to the IsmA tree, filtering to ≥ 30 aligned residues and 1e-5 E-value.

Statistics and Data Visualization

Metabolite Log₂ Fold Change (L2FC) was calculated as log₂ of the average HFD divided by the average LFD metabolite abundances. P-values were generated via 2-way ANOVA with R (v4.1.2) stats using anova. Significant (p < 0.05) metabolites with L2FC ≥ 0.5 were then plotted with ggplot2 (v3.3.6) [83] using geom_bar, coord_flip, and theme_ipsum.

The 16S rRNA gene amplicon and MAG community relative abundance profiles were Hellinger-transformed, and then visualized as heatmaps in R with pheatmap (v1.0.12) [84]. Gene count and trait presence/absence profiles were also visualized with pheatmap using the row and column cluster features. MAG dendrograms were retained, while genes and traits were re-grouped by classification.

Reads distribution and alpha diversity metrics were plotted with ggplot2 using geom_bar, geom_errorbar, and theme_ipsum. Error bars represent 95% confidence intervals, calculated as the lower 2.5% probability tail t-stat (qt from stats) multiplied by the standard error (se in sciplot (v1.2-0) [85]). Post-hoc group labelling was generated by applying Tukey’s HSD adjustment with aov from stats.

Community beta diversities and functional distributions were assessed with vegan (v2.6-2) [86] using rda and cca after confirming significant group variances via PERMANOVA with adonis2 using Euclidian distances of either Hellinger transformed (MAGs and traits) or z-score standardized (genes) abundances. The primary and secondary axis site and species scores were then plotted with ggplot2 using geom_point and stat_ellipse. The Mantel test was used to determine whether two sample sets were significantly similar using mantel from vegan on the Euclidian distance matrices of each sample set, generated with vegdist. Significant indicator features were identified by fitting the normalized abundance matrices of those features to the given ordination using envfit from vegan. Z-score standardized abundances were used for indicator metabolites. The features were then added to the ordination plots with ggplot2 with geom_point.

Maternal diet associated with distinct milk metabolomes and microbiomes

We first characterized the key milk metabolites associated with maternal high and low-fiber diet (Table S1). GC-MS open profiling of HFD and LFD milk identified 206 unique metabolites, including aldehydes, amino acids, benzenoids, indoles, long and short chain fatty acids, sterols, sugar alcohols, sugars, and a range of metabolic intermediates. While these metabolites were detected in all samples, their concentrations differed significantly between diets (p = 0.030). Indicator analysis revealed that HFD milk was associated with significantly higher concentrations of SCFAs, sugar alcohols, and sugars relative to LFD milk (Fig. 1A). By contrast, LFD milk was associated with significantly higher concentrations of aldehydes, amino acids, and poly- and un-saturated long chain fatty acids (LCFAs).

We next used 16S rRNA gene amplicon sequencing to profile the total HFD and LFD milk communities and investigate whether the distinct metabolome profiles were associated with changes in the microbiome. Maternal diet was associated with significant differences in the composition of bacterial communities (p = 0.030), where the HFD community was associated with higher concentrations of SCFAs and all classes of sugars, while the LFD community was associated with amino acids, aldehydes, and some SCFAs (Fig. 1B). The two communities had similar total observed OTUs, though the HFD community was significantly more diverse by Shannon Index (Fig. 1C). The largest compositional change between diets was a Streptococcus population (OTU12), which was the dominant LFD lineage (≥ 50% average relative abundance), but a minor lineage (< 5% average relative abundance) in HFD milk (Fig. 1D). By contrast, HFD-dominant lineage Parabacteroides (OTU4) was also a major lineage (5–50% average relative abundance) in LFD milk. Other HFD-major lineages Bacteroides (OTU11) and Erysipelotrichales (OTUs 16 and 115) were also less abundant in LFD milk, while milk from both diets harbored a major Faecalibaculum lineage (OTU17) that represented 15 ± 7% relative abundance irrespective of diet.

Milk culture enrichments enhance metagenome-assembled genome recovery to near-complete

We next shotgun sequenced the high and low fiber milk to recover and functionally annotate metagenome-assembled genomes (MAGs). Seven high quality and one partial MAGs were assembled (Table S2), representing 88 ± 6% of the communities by relative abundance (Fig. 2A), but only 43 ± 2% of total estimated lineages (Fig. 2B), indicating that most of the low abundance linages were not assembled into MAGs. We therefore cultured aliquots of each raw milk sample in YCFA general media to enrich and increase recovery of these minor lineages into MAGs. Before shotgun sequencing, we sequenced 16S rRNA gene amplicons of the enriched communities and verified that they were significantly similar to the raw communities (Mantel p = 0.013; Fig. S1A-C), indicating that shotgun sequencing data from enrichment communities could be used to supplement raw community genome recovery. Enrichment assemblies yielded an additional 14 high quality and one partial MAGs (Table S2), and improved community recovery to 90 ± 10% by relative abundance (Fig. 2A) and 74 ± 4% of estimated lineages (Fig. 2B).

To ensure that the enriched MAGs were representative of the corresponding raw sample lineages, raw reads from each sample were mapped to all MAGs and variant consensus genomes were called. ANI distance matrices of these variant genomes were then compared with PERMANOVA, which confirmed there were no significant differences between raw and enriched variants for any of the MAGs (Fig. S2).

The MAG coverage-based community profile (Fig. 3A) recapitulated the major structural elements of the 16S rRNA gene amplicon profile (Fig. 1D), with Parabacteroides distasonis (MAG15), Phocaeicola vulgatus (MAG9), Erysipelatoclostridiaceae (MAG2), and Dubosiella newyorkensis (MAG19) as the major HFD lineages and Streptococcus acidominimus (MAG3) and Parabacteroides distasonis (MAG15) as the dominant and major LFD lineages (respectively). In fact, all OTUs with ≥ 0.5% relative abundance in at least one sample matched a corresponding MAG, except for three lineages: Actinobacteriota (OTUs 77 and 375) and Rodentibacter (OTU9). Conversely, all MAGs present in raw milk could be linked to a raw milk OTU, either by directly linking GTDB and Silva accession numbers, or by matching taxonomy for those MAGs without a direct Silva link (Table S2). As with the 16S rRNA gene amplicon profile, maternal diet and the diet-associated metabolites were associated with significant differences in the composition of bacterial communities (p = 0.032; Fig. 3B). Relative abundances did differ between individual linked MAGs and OTUs, which is consistent with known copy number and amplification biases associated with 16S rRNA gene amplicon sequencing. This is reflected in the more pronounced differences in alpha diversity for the MAG profile, where the HFD community was > 3x more diverse with 1.7x as many observed MAGs (p < 0.001 each; Fig. 3C). Despite these individual lineage differences, the total MAG and OTU community profiles were significantly similar (Mantel p < 0.001).

Additional to the bacterial community profiles, DNA from Mus musculus C57BL/6J accounted for 6 ± 8% of HFD and 92 ± 6% of LFD raw milk reads (Fig. S3A), while viruses, archaea, and fungi each accounted for 0.04–0.6% (Fig. S3B). There was no significant difference in either the fungal or archaeal distributions between HFD and LFD raw milk. Viruses, on the other hand, were more prevalent in HFD (0.44 ± 0.01% vs 0.05 ± 0.02%) but were not explored further.

Milk metabolomes were associated with distinct profiles of substrate degradation genes

We split the microbiome functional analysis into substrate degradation and immunoregulatory traits. MAG annotation identified 82 unique CAZy and 359 unique MEROPS genes in total (Table S2). To determine which of these degradation genes were significantly associated with each of the two diets, gene abundances were calculated by multiplying MAG relative abundances with their gene counts, and then summing by gene. Indicator genes were then identified by plotting a CCA ordination of the z-score standardized gene abundances as a function of diet (Fig. 4A), and then mapping the z-score vectors back to the CCA. Indicator genes are those whose vectors align to one or the other diet with p < 0.001. The HFD milk, with its greater concentration and diversity of carbohydrates, was associated with a community harboring more glycoside hydrolases (Fig. 4B). By contrast, the LFD milk, which had a greater concentration and diversity of amino acids, was associated with a community encoding higher counts of all classes of peptidases.

MAG degradation gene profiles clustered by phylum (Fig. 4C). Within CAZy genes, Bacteroidota lineages generally possessed a greater number and diversity of glycoside hydrolases, especially GH2, 95, and 97. GH77 however was nearly exclusive to the Firmicutes. The MEROPS profiles were more evenly distributed, though with distinct clusters within the phyla. Bacteroidota tended to have more copies of the serine peptidases S01.UPC and S16.UNW. Firmicutes, by contrast, tended to have more copies of the metallo peptidases M15.UPB, M41.UPW, and M79.UPW. LFD-dominant S. acidominimus possessed the most diverse range of peptidase enzymes of all the MAGs, with a particularly unique collection of cysteine, metallo, and serine peptidases.

Milk metabolites influence the type and diversity of inheritable immunoregulatory traits

Finally, we explored the distribution of metabolic and immunoregulatory pathways across MAGs by annotating KO genes from the KEGG database. MAG annotation identified 3149 unique KO genes, which were classified into 235 unique KEGG and custom modules (Table S2). To determine whether the abundance profiles of MAG immunoregulatory traits were significantly similar to the concentration profiles of metabolites in those pathways, immunoregulatory traits were defined from KEGG gene annotation and compound IDs (Table S3). The trait abundance profiles were significantly similar to the profile of average concentrations for each pathway (Mantel p = 0.029), indicating that trait abundance is an important marker of active community function.

Indicator traits for central metabolism and immunoregulation were identified using a similar approach to that used for indicator degradation genes (Fig. S4A and 5A). Indicator traits for central metabolism were generally evenly distributed between the diets, though LFD did surprisingly have a greater diversity of “other” carbohydrate biosynthesis and degradation pathways (Fig. S4B-C).

While several SCFA and vitamin traits were present in both diets, the HFD community possessed a greater number and diversity of immunoregulatory functions from all trait classes (Fig. 5B). In fact, only the acetyladenylate, propanyladenylate, and indole-3-acetamide pathways were significantly associated with the LFD community.

As with the degradation genes, MAG immunoregulatory trait profiles clustered by phylum (Fig. 5C). Indole and bile acid traits were evenly dispersed, while Bacteroidota lineages possessed a greater number and diversity of SCFA and vitamin classes. Once again, the LFD-dominant S. acidominimus was distinctive in that it had a particularly small assortment of immunoregulatory traits, with one pathway to produce indole-3-aldehyde, two to produce both acetate and propionate, and one to produce vitamin B9.

Notably, the cholesterol degradation gene ismA was not identified in any MAG. This is surprising given the range of other key gut microbiome traits found in the HFD milk community, as well as the exceptionally low cholesterol content in the HFD relative to LFD milk (Fig. 1A). To verify this result, we performed an hmm search of the unassembled metagenomes, but still found no ismA copies.

Carbohydrate and amino acid complexity is associated with milk microbiome community structure

While only the structural complexity of the carbohydrates differentiated the HFD and LFD diets (Table S1), this resulted in distinct differences in the metabolite profiles of the milk. Mothers fed the high-fiber diet, which was rich in complex fibers and starches, produced milk with significantly greater and more diverse sugar, sugar alcohol, and SCFA content (Fig. 1A). By contrast, LFD mothers, whose only source of carbohydrates was the monosaccharide glucose, produced milk enriched in amino acids, poly/unsaturated LCFAs, and cholesterol.

Fiber and starch are composed of a complex web of interlinked sugar monomers. Both host and microbes consume these sugars for energy through central carbon metabolism, but only after they have been released from the polysaccharide meta-structure. Different carbohydrate-active enzymes (CAZymes) are required to hydrolyze, de-esterify, or otherwise lyse the array of bonds linking individual monosaccharides. Individual species tend to possess only a limited range of both CAZymes and monosaccharide degradation pathways, and so fiber and starch-rich environments tend to promote high microbial diversity [44] as was observed in this study (Fig. 1C). Moreover, the HFD community possessed a greater diversity of polysaccharide-degrading glycoside hydrolases (Fig. 4B), consistent with the greater carbohydrate content in the HFD milk.

The amino acid rich LFD milk, by contrast, coincided with a community rich in peptidases (Fig. 4B). This is highlighted by the LFD-dominant S. acidominimus, which possessed 27 of 48 indicator peptidases compared to just 20 or fewer in the other major lineages (Fig. 4C). This complex and diverse collection of peptidases likely allowed S. acidominimus to dominate the amino acid-rich LFD milk community.

Several theories exist about the origin of milk microbiota, including retrograde transfer [45, 46], infant oral contact [47, 48], and the entero-mammary pathway [49, 50]. As such, the time scale for mammary microbial proliferation and metabolite production is not clear, and milk metabolites likely result from a combination of both host and microbial metabolism. For instance, microbial communities tend to fully degrade simple glucose substrates, such as that in the low-fiber chow, much more rapidly than complex mixed fiber and starch substrates, such as that in the high-fiber chow [51]. This slower degradation of fibers and starches by the gut microbiomes of HFD mums therefore likely produced a complex mix of sugars that were absorbed into the bloodstream, transported to the mammary glands, and finally observed in the HFD milk. By contrast, the gut microbiomes of LFD mums likely consumed the majority of the simple glucose substrate before it could be absorbed into the bloodstream. The origin of the increased amino acids in LFD milk is not clear, however the significant association between enrichment of substrates and corresponding degradation genes in the milk of both diets suggests that the assembly of the milk microbiome is strongly influenced by the substrates available to them within the milk. Additional experiments testing gradients of substrates in the milk are needed to verify this assertion.

Maternal high fiber diet promotes a milk microbiome with diverse inheritable immunoregulatory traits

The distribution of immunoregulatory traits between the HFD and LFD communities points to potential causal factors for the inhibited immune development associated with the LFD community [43] (Fig. 6). As with the glycoside hydrolase genes, the HFD community possesses a greater number and diversity of immunoregulatory functions, with at least one indicator pathway for every immunoregulatory product except indole-3-propionate and coprostanol. By contrast, the LFD community was limited to only the acetate, propionate, and indole-3-aldehyde traits. The LFD community was also capable of vitamin B9 production, but distinctly lacked other immunoregulatory traits. Much of this was likely due to the near-monoculture of S. acidominimus within the LFD milk. Aside from the acetaldehyde pathway for acetate production, every single non-indicator or LFD indicator trait was reliant on presence in S. acidominimus. By contrast, most HFD indicator traits were spread between the major HFD lineages, while two (indole and methylglyoxal) were in fact only found in the minor lineages. Moreover, S. acidominimus possessed none of the HFD indicator traits.

Intriguingly, of the detected immunoregulatory metabolites, the majority of those more concentrated in LFD milk were either pathway intermediates (4 of 7) or pathway substrates (2 of 7). This indicates that the LFD community lacked the ability to either uptake or completely metabolize compounds from the milk. The low trait complexity of the LFD community suggests that this was at least partly due to lack of metabolic complexity, though the low carbohydrate content of the milk itself indicates that carbon limitation was also a factor.

Meanwhile, the high diversity of immunoregulatory traits in the HFD community likely explains the improved immune development and health outcomes associated with neonates whose mothers consume a fiber-rich diet [43]. The HFD community had an enrichment of pathways for all but two immunoregulatory metabolites, and this pathway diversity was distributed among both major and minor HFD lineages (Fig. 6). All four major HFD lineages could degrade bile and produce acetate, propionate, and vitamin B9. Indole-3-aldehyde and retinoate could be produced by all but P. distasonis, though it could metabolize vitamin B12 alongside P. vulgatus. The indole and methylglyoxal pathways were also indicative of the HFD community, though were only found in minor lineages.

Given the large cholesterol reduction in HFD compared to LFD milk (Fig. 1A), it is striking that the ismA cholesterol degradation gene was not detected in the HFD community. Gut bacterial ismA converts cholesterol to coprostanol, which mediates host bile production and serum cholesterol levels [21, 22]. The highly conserved active site in all known versions of the cholesterol-degrading ismA gene [21] may simply indicate that our annotation model for this gene was incomplete. Further research is therefore needed to describe the full range of microbial cholesterol degradation gene variants.

While our understanding of gut microbiome colonization is still rudimentary [27], we have recently shown that milk lineages (and their traits) directly colonize the neonate gut microbiome and contribute to its immunoregulatory function [43]. The diversity of traits found in the HFD milk community are therefore likely to have positive implications for neonatal gut microbiome assembly, haematopoiesis of lymphoid and myeloid cells, immune cell differentiation, and the production of neuroprotectants, antibiotics, and bile (Table 1).

Here, we have shown that individual milk microbes tend to possess a limited range of immunoregulatory traits. Hence, a diverse milk microbiome is imperative for neonates to inherit a full complement of immunoregulatory traits. This milk microbiome diversity appears to be promoted by the complexity of fiber and starch substrates which are incompletely degraded in the maternal gut, and then transported in the bloodstream to the milk. Experiments linking milk substrate gradients to milk microbiome profiles would confirm this causal link between milk carbohydrate complexity and the complexity of milk microbiomes and their immunoregulatory traits. Together, our findings highlight how the mother’s diet influences the composition of the microbiome and the potential vertical transmission of immunoregulatory traits from mothers to infants.

Acknowledgements

Adam Skarshewski and Young Song are gratefully acknowledged for technical assistance. This work was supported by an NHMRC of Australia project grant awarded to SP and PGD.

Competing interests

The authors declare that they have no competing interests.

Data availability

The raw DNA sequencing data and metagenome-assembled genomes are available at NCBI-SRA as BioProject PRJNA671760, and the raw GCMS data is supplied in the Supplementary Data.

Author contributions

RDH, RR, MAAS, TS, SP, and PGD participated in designing the experiments. RR and MAAS performed mouse experiments and sampled milk. RDH, RR, and MAAS extracted DNA. RL performed PCRs and prepared libraries for amplicon and shotgun sequencing. TS developed and carried out the GC-MS method. RDH carried out the bioinformatics and statistical analysis with significant input from PGD. RDH wrote the manuscript with significant input from PGD and all other authors.

Fan Y, Pedersen O. Gut microbiota in human metabolic health and disease. Nat. Rev. Microbiol. 2020;19(1):55–71.
Lynch SV, Pedersen O. The Human Intestinal Microbiome in Health and Disease. New Engl. J. Med. 2016;375(24):2369–2379.
Thursby E, Juge N. Introduction to the human gut microbiota. Biochem. J. 2017;474(11):1823–1836.
Miyauchi E, Shimokawa C, Steimle A, Desai MS, Ohno H. The impact of the gut microbiome on extra-intestinal autoimmune diseases. Nat. Rev. Immunol. 2022;23:9-23.
Renz H, Skevaki C. Early life microbial exposures and allergy risks: opportunities for prevention. Nat. Rev. Immunol. 2021;21(3):177–191.
Fujimura KE, Sitarik AR, Havstad S, Lin DL, Levan S, Fadrosh D, et al. Neonatal gut microbiota associates with childhood multisensitized atopy and T cell differentiation. Nat. Med. 2016;22(10):1187–1191.
Bansal T, Alaniz RC, Wood TK, Jayaraman A. The bacterial signal indole increases epithelial-cell tight-junction resistance and attenuates indicators of inflammation. P. Natl. Acad. Sci. USA. 2010;107(1):228–233.
Lee JS, Cella M, McDonald KG, Garlanda C, Kennedy GD, Nukaya M, et al. AHR drives the development of gut ILC22 cells and postnatal lymphoid tissues via pathways dependent on and independent of Notch. Nat. Immunol. 2012;13(2):144.
Li Y, Innocentin S, Withers DR, Roberts NA, Gallagher AR, Grigorieva EF, et al. Exogenous stimuli maintain intraepithelial lymphocytes via aryl hydrocarbon receptor activation. Cell. 2011;147(3):629–640.
Macia L, Tan J, Vieira AT, Leach K, Stanley D, Luong S, et al. Metabolite-sensing receptors GPR43 and GPR109A facilitate dietary fibre-induced gut homeostasis through regulation of the inflammasome. Nat. Commun. 2015;6(1):1–15.
Qiu J, Heller JJ, Guo X, Chen ZME, Fish K, Fu YX, et al. The aryl hydrocarbon receptor regulates gut immunity through modulation of innate lymphoid cells. Immunity. 2012;36(1):92.
Shimada Y, Kinoshita M, Harada K, Mizutani M, Masahata K, Kayama H, et al. Commensal bacteria-dependent indole production enhances epithelial barrier function in the colon. PloS One. 2013;8(11):e80604.
Venkatesh M, Mukherjee S, Wang H, Li H, Sun K, Benechet AP, et al. Symbiotic bacterial metabolites regulate gastrointestinal barrier function via the xenobiotic sensor PXR and Toll-like receptor 4. Immunity. 2014;41(2):296-310.
Zelante T, Iannitti RG, Cunha C, De Luca A, Giovannini G, Pieraccini G, et al. Tryptophan catabolites from microbiota engage aryl hydrocarbon receptor and balance mucosal reactivity via interleukin-22. Immunity. 2013;39(2):372-385
Tanoue T, Atarashi K, Honda K. Development and maintenance of intestinal regulatory T cells. Nat. Rev. Immunol. 2016;16(5):295–309.
McCoy KD, Thomson CA. The Impact of maternal microbes and microbial colonization in early life on hematopoiesis. J. Immunol. 2018;200(8):2519–2526.
Ahmed H, Leyrolle Q, Koistinen V, Kärkkäinen O, Layé S, Delzenne N, et al. Microbiota-derived metabolites as drivers of gut–brain communication. Gut Microbes. 2022;14(1):2102878.
Ahmed S, Busetti A, Fotiadou P, Vincy Jose N, Reid S, Georgieva M, et al. In vitro characterization of gut microbiota-derived bacterial strains with neuroprotective properties. Front. Cell. Neurosci. 2019;13:402.
Paik D, Yao L, Zhang Y, Bae S, D’Agostino GD, Zhang M, et al. Human gut bacteria produce ΤΗ17-modulating bile acid metabolites. Nature. 2022;603(7903):907-912.
Song X, Sun X, Oh SF, Wu M, Zhang Y, Zheng W, et al. Microbial bile acid metabolites modulate gut RORγ+ regulatory T cell homeostasis. Nature. 2020;577(7790):410–415.
Kenny DJ, Plichta DR, Shungin D, Koppel N, Hall, AB, Fu B, et al. Cholesterol metabolism by uncultured human gut bacteria influences host cholesterol level. Cell Host Microbe. 2020;28(2):245-257.e6.
Kriaa A, Bourgin M, Potiron A, Mkaouar H, Jablaoui A, Gérard P, et al. Microbial impact on cholesterol and bile acid metabolism: current status and future prospects. J. Lipid Res. 2019;60(2):323–332.
Cha HR, Chang SY, Chang JH, Kim JO, Yang JY, Kim CH, et al. Downregulation of Th17 cells in the small intestine by disruption of gut flora in the absence of retinoic acid. J. Immunol. 2010;184(12):6799–6806.
Gaboriau-Routhiau V, Rakotobe S, Lécuyer E, Mulder I, Lan A, Bridonneau C, et al. The key role of segmented filamentous bacteria in the coordinated maturation of gut helper T cell responses. Immunity. 2009;31(4):677–689.
Inagaki T, Moschetta A, Lee YK, Peng L, Zhao G, Downes M, et al. Regulation of antibacterial defense in the small intestine by the nuclear bile acid receptor. P. Natl. Acad. Sci. USA. 2006;103(10):3920–3925.
Lorenzo-Zúñiga V, Bartolí R, Planas R, Hofmann AF, Viñado B, Hagey LR, et al. Oral bile acids reduce bacterial overgrowth, bacterial translocation, and endotoxemia in cirrhotic rats. Hepatology. 2003;37(3):551–557.
Fricke WF, Ravel J. More data needed on neonatal microbiome seeding. Microbiome. 2022;10(1):88.
Ferretti P, Pasolli E, Tett A, Asnicar F, Gorfer V, Fedi S, et al. Mother-to-infant microbial transmission from different body sites shapes the developing infant gut microbiome. Cell Host Microbe. 2018;24(1):133-145.
Podlesny D, Fricke WF. Strain inheritance and neonatal gut microbiota development: A meta-analysis. Int. J. Med. Microbiol. 2021;311(3):151483.
Dominguez-Bello MG, de Jesus-Laboy KM, Shen N, Cox LM, Amir A, Gonzalez A, et al. Partial restoration of the microbiota of cesarean-born infants via vaginal microbial transfer. Nat. Med. 2016;22(3):250–253.
Korpela, K, Costea, P, Coelho, L. P, Kandels-Lewis, S, Willemsen, G, Boomsma, D. I, Segata, N, Bork, P. Selective maternal seeding and environment shape the human gut microbiome. Genome Res. (2018). 28(4), 561–568.
Shao, Y, Forster, S. C, Tsaliki, E, Vervier, K, Strang, A, Simpson, N, Kumar, N, Stares, M. D, Rodger, A, Brocklehurst, P, Field, N, Lawley, T. D. Stunted microbiota and opportunistic pathogen colonization in caesarean-section birth. Nature, (2019). 574(7776), 117–121.
Stewart, C. J, Ajami, N. J, O’Brien, J. L, Hutchinson, D. S, Smith, D. P, Wong, M. C, Ross, M. C, Lloyd, R. E, Doddapaneni, H. V, Metcalf, G. A, Muzny, D, Gibbs, R. A, Vatanen, T, Huttenhower, C, Xavier, R. J, Rewers, M, Hagopian, W, Toppari, J, Ziegler, A. G, … Petrosino, J. FTemporal development of the gut microbiome in early childhood from the TEDDY study. Nature, . (2018). 562(7728), 583–588.
Azad MB. Infant feeding and the developmental origins of chronic disease in the child cohort: role of human milk bioactives and gut microbiota. Breastfeed. Med. 2019;14(S1):S22–S24.
Fehr, K, Moossavi, S, Sbihi, H, Boutin, R. C. T, Bode, L, Robertson, B, Yonemitsu, C, Field, C. J, Becker, A. B, Mandhane, P. J, Sears, M. R, Khafipour, E, Moraes, T. J, Subbarao, P, Finlay, B. B, Turvey, S. E, Azad, M. B. Breastmilk Feeding Practices Are Associated with the Co-Occurrence of Bacteria in Mothers’ Milk and the Infant Gut: the CHILD Cohort Study. Cell Host Microbe, (2020). 28(2), 285-297.e4.
McGuire, M. K, McGuire, M. A. Got bacteria? The astounding, yet not-so-surprising, microbiome of human milk. Current Opinion in Biotechnology, 44, 63–68.
David, L. A, Maurice, C. F, Carmody, R. N, Gootenberg, D. B, Button, J. E, Wolfe, B. E, Ling, A. v, Devlin, A. S, Varma, Y, Fischbach, M. A, Biddinger, S. B, Dutton, R. J, Turnbaugh, P. J. (2013). Diet rapidly and reproducibly alters the human gut microbiome. Nature, (2017). 505(7484), 559–563.
Lim AI, McFadden T, Link VM, Han SJ, Karlsson RM, Stacy A, et al. Prenatal maternal infection promotes tissue-specific immunity and inflammation in offspring. Science. 2021;373(6558):eabf3002.
Voreades N, Kozil A, Weir TL. Diet and the development of the human intestinal microbiome. Front. Microbiol. 2014;5:494.
Yatsunenko T, Rey FE, Manary MJ, Trehan I, Dominguez-Bello MG, Contreras M, et al. Human gut microbiome viewed across age and geography. Nature. 2012;486(7402):222–227.
Zou ZH, Liu D, Li HD, Zhu DP, He Y, Hou T, et al. Prenatal and postnatal antibiotic exposure influences the gut microbiota of preterm infants in neonatal intensive care units. Ann. Clin. Microb. Anti. 2018;17(1):1–11.
Bravi F, Wiens F, Decarli A, Dal Pont A, Agostoni C, Ferraroni M. Impact of maternal nutrition on breast-milk composition: a systematic review. Am. J. Clin. Nutr. 2016;104(3):646–662.
Sikder MAA, Rashid RB, Ahmed T, Sebina I, Howard DR, Ullah MA, et al. The maternal microbiome regulates infant respiratory disease susceptibility via intestinal Flt3L expression and plasmacytoid dendritic cell hematopoiesis. bioRxiv. 2023;https://doi.org/10.1101/2023.01.05.522516
Chung WSF, Walker AW, Vermeiren J, Sheridan PO, Bosscher D, Garcia-Campayo V, et al. Impact of carbohydrate substrate complexity on the diversity of the human colonic microbiota. FEMS Microbiol. Ecol. 2019;95(1):fiy201.
Fernández L, Langa S, Martín V, Maldonado A, Jiménez E, Martín R, et al. The human milk microbiota: origin and potential roles in health and disease. Pharmacol. Res. 2013;69(1):1–10.
West PA, Hewitt JH, Murphy OM. Influence of methods of collection and storage on the bacteriology of human milk. J. Appl. Bacteriol. 1979;46(2):269–277.
Moossavi S, Azad MB. Origins of human milk microbiota: new evidence and arising questions. Gut Microbes. 2020;12(1):e1667722.
Moossavi S, Sepehri S, Robertson B, Bode L, Goruk S, Field CJ, et al. Composition and variation of the human milk microbiota are influenced by maternal and early-life factors. Cell Host Microbe. 2019; 25(2):324-335.
Rodríguez JM, Fernández L, Verhasselt V. The gut-breast axis: programming health for life. Nutrients. 2021;13(2):606.
Selvamani S, Dailin DJ, Gupta VK, Wahid M, Keat HC, Natasya KH, et al. An insight into probiotics bio-route: Translocation from the mother’s gut to the mammary gland. Applied Sciences. 2021;11(16):7247.
Gupta M, Velayutham P, Elbeshbishy E, Hafez H, Khafipour E, Derakhshani H, et al. Co-fermentation of glucose, starch, and cellulose for mesophilic biohydrogen production. Int. J. Hydrogen Energ. 2014;39(36):20958–20967.
Browne HP, Forster SC, Anonye BO, Kumar N, Neville BA, Stares MD, et al. Culturing of ‘unculturable’ human microbiota reveals novel taxa and extensive sporulation. Nature. 2016;533(7604):543–546.
Yu Z, Morrison M. Improved extraction of PCR-quality community DNA from digesta and fecal samples. BioTechniques. 2004;36(5):808–812.
Engelbrektson A, Kunin V, Wrighton KC, Zvenigorodsky N, Chen F, Ochman H, et al. Experimental factors affecting PCR-based estimates of microbial species richness and evenness. ISME J. 2010;4(5):642–647.
Edgar RC. UPARSE: highly accurate OTU sequences from microbial amplicon reads. Nat. Meth. 2013;10(10):996–998.
Edgar RC, Bateman A. Search and clustering orders of magnitude faster than BLAST. Bioinformatics. 2010;26(19):2460–2461.
Quast C, Pruesse E, Yilmaz P, Gerken J, Schweer T, Yarza P, et al. The SILVA ribosomal RNA gene database project: improved data processing and web-based tools. Nucleic Acids Res. 2013;41:D590.
Zhang Z, Schwartz S, Wagner L, Miller W. A greedy algorithm for aligning DNA sequences. J. Comput. Biology. 2000;7(1–2):203–214.
Bolyen E, Rideout JR, Dillon MR, Bokulich NA, Abnet CC, Al-Ghalith GA, et al. Reproducible, interactive, scalable and extensible microbiome data science using QIIME 2. Nat. Biotech. 2019;37(8):852–857.
McDonald D, Clemente JC, Kuczynski J, Rideout JR, Stombaugh J, Wendel D, et al. The Biological Observation Matrix (BIOM) format or: how I learned to stop worrying and love the ome-ome. GigaScience. 2012;1(1):2047-17X.
Bolger AM, Lohse M, Usadel B. Trimmomatic: A flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30(15):2114–2120.
Nurk S, Meleshko D, Korobeynikov A, Pevzner PA. MetaSPAdes: A new versatile metagenomic assembler. Genome Res. 2017;27(5):824–834.
Li D, Liu CM, Luo R, Sadakane K, Lam TW. MEGAHIT: An ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph. Bioinformatics. 2015;31(10):1674–1676.
Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25(14):1754–1760.
Danecek P, Bonfield JK, Liddle J, Marshall J, Ohan V, Pollard MO, et al. Twelve years of SAMtools and BCFtools. GigaScience. 2021;10(2):giab008.
Kang DD, Froula J, Egan R, Wang Z. MetaBAT, an efficient tool for accurately reconstructing single genomes from complex microbial communities. PeerJ. 2015;3:e1165.
Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res. 2015;25(7):1043–1055.
Olm MR, Brown CT, Brooks B, Banfield JF. DRep: A tool for fast and accurate genomic comparisons that enables improved genome recovery from metagenomes through de-replication. ISME J. 2017;11(12):2864–2868.
Parks DH, Chuvochina M, Rinke C, Mussig AJ, Chaumeil PA, Hugenholtz P. GTDB: an ongoing census of bacterial and archaeal diversity through a phylogenetically consistent, rank normalized and complete genome-based taxonomy. Nucleic Acids Res. 2022;50(D1):D785–D794.
Chaumeil PA, Mussig AJ, Hugenholtz P, Parks DH. GTDB-Tk: a toolkit to classify genomes with the Genome Taxonomy Database. Bioinformatics. 2019;36:1925–1927.
Jain C, Rodriguez-R LM, Phillippy AM, Konstantinidis KT, Aluru S. High throughput ANI analysis of 90K prokaryotic genomes reveals clear species boundaries. Nat. Commun. 2018;9(1):1–8.
Langmead B, Trapnell C, Pop M, Salzberg SL. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol. 2009;10(3):1–10.
Wood DE, Lu J, Langmead B. Improved metagenomic analysis with Kraken 2. Genome Biol. 2019;20(1):1–13.
Hyatt D, Chen GL, LoCascio PF, Land ML, Larimer FW, Hauser LJ. Prodigal: Prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics. 2010;11(1):1–11.
Wheeler TJ, Eddy SR. nhmmer: DNA homology search with profile HMMs. Bioinformatics. 2013;29(19):2487–2489.
Drula E, Garron ML, Dogan S, Lombard V, Henrissat B, Terrapon N. The carbohydrate-active enzyme database: functions and literature. Nucleic Acids Res. 2022;50(D1):D571–577.
Buchfink B, Reuter K, Drost HG. Sensitive protein alignments at tree-of-life scale using DIAMOND. Nat. Meth. 2021;18(4):366–368.
Rawlings ND, Barrett AJ, Thomas PD, Huang X, Bateman A, Finn RD. The MEROPS database of proteolytic enzymes, their substrates and inhibitors in 2017 and a comparison with peptidases in the PANTHER database. Nucleic Acids Res. 2018;46:D624–D632.
Kanehisa M, Furumichi M, Sato Y, Kawashima M, Ishiguro-Watanabe M. KEGG for taxonomy-based analysis of pathways and genomes. Nucleic Acids Res. 2023;51(D1):D587-592.
Suzek BE, Wang Y, Huang H, McGarvey PB, Wu CH. UniRef clusters: a comprehensive and scalable alternative for improving sequence similarity searches. Bioinformatics. 2015;31(6):926.
Boyd JA, Woodcroft BJ, Tyson GW. GraftM: a tool for scalable, phylogenetically informed classification of genes within metagenomes. Nucleic Acids Res. 2018;46(10):e59.
Madeira F, Pearce M, Tivey ARN, Basutkar P, Lee J, Edbali O, et al. Search and sequence analysis tools services from EMBL-EBI in 2022. Nucleic Acids Res. 2022;50(W1):W276-279.
Wickham H. ggplot2: Elegant Graphics for Data Analysis. In: Use R! series. Springer-Verlag. 2016; https://ggplot2.tidyverse.org
Kolde R. Package ‘pheatmap’. R Package. 2018;1
Morales M. Package ‘sciplot’. CRAN web host, online. 2015; https://www.rdocumentation.org/packages/sciplot.
Oksanen J, Blanchet FG, Kindt R, Legendre P, Minchin PR, O’Hara RB, et al. Package ‘vegan’. Community ecology package. 2013;2(9):1-295.
Arpaia N, Campbell C, Fan X, Dikiy S, van der Veeken J, Deroos P, et al. Metabolites produced by commensal bacteria promote peripheral regulatory T-cell generation. Nature. 2013;504(7480):451–455.
Furusawa Y, Obata Y, Fukuda S, Endo TA, Nakato G, Takahashi D, et al. Commensal microbe-derived butyrate induces the differentiation of colonic regulatory T cells. Nature. 2013;504(7480):446–450.
Kim M, Qie Y, Park J, Kim CH. Gut microbial metabolites fuel host antibody responses. Cell Host Microbe. 2016;20(2):202–214.
Smith PM, Howitt MR, Panikov N, Michaud M, Gallini CA, Bohlooly-Y M, et al. The microbial metabolites, short-chain fatty acids, regulate colonic T reg cell homeostasis. Science. 2013;341(6145):569–573.
Thorburn AN, McKenzie CI, Shen S, Stanley D, MacIa L, Mason LJ, et al. Evidence that asthma is a developmental origin disease influenced by maternal diet and bacterial metabolites. Nat. Commun. 2015;6(1):1-3.
Alex S, Lange K, Amolo T, Grinstead JS, Haakonsson AK, Szalowska E, et al. Short-chain fatty acids stimulate angiopoietin-like 4 synthesis in human colon adenocarcinoma cells by activating peroxisome proliferator-activated receptor γ. Molecular and Cellular Biology. 2013l33(7):1303–1316.
Brown AJ, Goldsworthy SM, Barnes AA, Eilert MM, Tcheang L, Daniels D, et al. The orphan G protein-coupled receptors GPR41 and GPR43 are activated by propionate and other short chain carboxylic acids. J. Biol. Chem. 2003;278(13):11312–11319.
Trompette A, Gollwitzer ES, Pattaroni C, Lopez-Mejia IC, Riva E, Pernot J, et al. Dietary fiber confers protection against flu by shaping Ly6c- patrolling monocyte hematopoiesis and CD8+ T cell metabolism. Immunity: 2018;48(5):992-1005.
Trompette A, Gollwitzer ES, Yadava K, Sichelstiel AK, Sprenger N, Ngom-Bru C, et al. Gut microbiota metabolism of dietary fiber influences allergic airway disease and hematopoiesis. Nat. Med. 2014;20(2):159–166.
Rodrigues HG, Takeo Sato F, Curi R, Vinolo MAR. Fatty acids as modulators of neutrophil recruitment, function and survival. Eur. J. Pharmacol. 2016;785:50–58.
Liu L, Li L, Min J, Wang J, Wu H, Zeng Y, et al. Butyrate interferes with the differentiation and function of human monocyte-derived dendritic cells. Cell. Immunol. 2012;277(1–2):66–73.
Millard AL, Mertes PM, Ittelet D, Villard F, Jeannesson P, Bernard J. Butyrate affects differentiation, maturation and function of human monocyte-derived dendritic cells and macrophages. Clin. Exp. Immunol. 2002;130(2):245–255.
Park J, Kim M, Kang SG, Jannasch AH, Cooper B, Patterson J, et al. Short-chain fatty acids induce both effector and regulatory T cells by suppression of histone deacetylases and regulation of the mTOR-S6K pathway. Mucosal Immunol. 2015;8(1):80–93.
Chimerel C, Emery E, Summers DK, Keyser U, Gribble FM, Reimann F. Bacterial metabolite indole modulates incretin secretion from intestinal enteroendocrine L cells. Cell Rep. 2014;9(4):1202–1208.
Bergstrom KSB, Kissoon-Singh V, Gibson DL, Ma C, Montero M, Sham HP, et al. Muc2 protects against lethal infectious colitis by disassociating pathogenic and commensal bacteria from the colonic mucosa. PLoS Pathog. 2010;6(5):e1000902.
Coombes JL, Siddiqui KRR, Arancibia-Cárcamo CV, Hall J, Sun CM, Belkaid Y, et al. A functionally specialized population of mucosal CD103+ DCs induces Foxp3+ regulatory T cells via a TGF-β– and retinoic acid–dependent mechanism. J. Exp. Med. 2007;204(8):1757-1764.
Hall JA, Cannons JL, Grainger JR, dos Santos LM, Hand TW, Naik S, et al. Essential role for retinoic acid in the promotion of CD4+ T cell effector responses via retinoic acid receptor alpha. Immunity. 2011;34(3):435.
Hall JA, Grainger JR, Spencer SP, Belkaid Y. The role of retinoic acid in tolerance and immunity. Immunity. 2011;35(1):13.
Kang SG, Lim HW, Andrisani OM, Broxmeyer HE, Kim CH. Vitamin A metabolites induce gut-homing FoxP3+ regulatory T cells. J. Immunol. 2007;179(6):3724–3733.
Kunisawa J, Hashimoto E, Ishikawa I, Kiyono H. A pivotal role of vitamin B9 in the maintenance of regulatory T cells in vitro and in vivo. PLoS One. 2012;7(2):e32094.
Mora JR, Iwata M, Eksteen B, Song SY, Junt T, Senman B, et al. Generation of gut-homing IgA-secreting B cells by intestinal dendritic cells. Science. 2006;314(5802):1157–1160.
Mucida D, Park Y, Kim G, Turovskaya O, Scott I, Kronenberg M, et al. Reciprocal TH17 and regulatory T cell differentiation mediated by retinoic acid. Science. 2007;317(5835):256–260.
Sun CM, Hall JA, Blank RB, Bouladoux N, Oukka M, Mora JR, et al. Small intestine lamina propria dendritic cells promote de novo generation of Foxp3 T reg cells via retinoic acid. J. Exp. Med. 2007;204(8):1775.
Kolb AF, Petrie L. Folate deficiency enhances the inflammatory response of macrophages. Mol. Immunol. 2013;54(2):164–172.
Vellozo NS, Pereira-Marques ST, Cabral-Piccin MP, Filardy AA, Ribeiro-Gomes FL, Rigoni TS, et al. All-trans retinoic acid promotes an M1-to M2-phenotype shift and inhibits macrophage-mediated immunity to Leishmania major. Front. Immunol. 2017;8:1560.
Dilillo DJ, Matsushita T, Tedder TF. B10 cells and regulatory B cells balance immune responses during inflammation, autoimmunity, and cancer. Ann. NY. Acad. Sci. 2010;1183:38–57.
Heine G, Hollstein T, Treptow S, Radbruch A, Worm M. 9-cis retinoic acid modulates the type I allergic immune response. J. Allergy Clin. Immun. 2018;141(2):650-658.
Seo GY, Lee JM, Jang YS, Kang SG, Yoon SI, Ko HJ, et al. Mechanism underlying the suppressor activity of retinoic acid on IL4-induced IgE synthesis and its physiological implication. Cell. Immunol. 2017;322:49–55.
Blair PA, Noreña LY, Flores-Borja F, Rawlings DJ, Isenberg DA, Ehrenstein MR, et al. CD19(+)CD24(hi)CD38(hi) B cells exhibit regulatory capacity in healthy individuals but are functionally impaired in systemic Lupus Erythematosus patients. Immunity. 2010;32(1):129–140.
di Caro V, Phillips B, Engman C, Harnaha J, Trucco M, Giannoukakis N. Retinoic acid-producing, ex-vivo-generated human tolerogenic dendritic cells induce the proliferation of immunosuppressive B lymphocytes. Clin. Exp. Immunol, (2013).174(2), 302–317.
Hiemstra IH, Beijer MR, Veninga H, Vrijland K, Borg EGF, Olivier BJ, et al. The identification and developmental requirements of colonic CD169+ macrophages. Immunology. 2014;142(2):269–278.
Iwata Y, Matsushita T, Horikawa M, DiLillo DJ, Yanaba K, Venturi GM, et al. Characterization of a rare IL-10-competent B-cell subset in humans that parallels mouse regulatory B10 cells. Blood. 2011;117(2):530–541.
Klebanoff CA, Spencer SP, Torabi-Parizi P, Grainger JR, Roychoudhuri R, Ji Y, Sukumar M, et al. Retinoic acid controls the homeostasis of pre-cDC-derived splenic and intestinal dendritic cells. J. Exp. Med. 2013;210(10):1961–1976.
Mauri C, Gray D, Mushtaq N, Londei M. Prevention of arthritis by interleukin 10-producing B cells. J. Exp. Med. 2003;197(4):489–501.
Shrestha S, Kim SY, Yun YJ, Kim JK, Lee JM, Shin M, et al. Retinoic acid induces hypersegmentation and enhances cytotoxicity of neutrophils against cancer cells. Immunol. Lett. 2017;182:24–29.
Spencer SP, Wilhelm C, Yang Q, Hall JA, Bouladoux N, Boyd A, et al. Adaptation of innate lymphoid cells to a micronutrient deficiency promotes type 2 barrier immunity. Science. 2014;343(6169):432–437.
Guo Y, Pino-Lagos K, Ahonen CA, Bennett KA, Wang J, Napoli JL, et al. A retinoic acid--rich tumor microenvironment provides clonal survival cues for tumor-specific CD8(+) T cells. Cancer Res. 2012;72(20):5230–5239.
Guo Y, Lee YC, Brown C, Zhang W, Usherwood E, Noelle RJ. Dissecting the role of retinoic acid receptor isoforms in the CD8 response to infection. J. Immunol. 2014;192(7):3336–3344.
Kjer-Nielsen L, Patel O, Corbett AJ, le Nours J, Meehan B, Liu L, et al. MR1 presents microbial vitamin B metabolites to MAIT cells. Nature. 2012;491(7426):717–723.
le Bourhis L, Mburu YK, Lantz O. MAIT cells, surveyors of a new class of antigen: development and functions. Curr. Opin. Immunol. 2013;25(2):174–180.
Mielke LA, Jones SA, Raverdeau M, Higgs R, Stefanska A, Groom JR, et al. Retinoic acid expression associates with enhanced IL-22 production by γδ T cells and innate lymphoid cells and attenuation of intestinal inflammation. J. Exp. Med. 2013;210(6):1117–1124.
Pino-Lagos K, Guo Y, Brown C, Alexander MP, Elgueta R, Bennett KA, et al. A retinoic acid-dependent checkpoint in the development of CD4+ T cell-mediated immunity. J. Exp. Med. 2011;208(9):1767–1775.
Tamura J, Kubota K, Murakami H, Sawamura M, Matsushima T, Tamura T, et al. Immunomodulation by vitamin B12: augmentation of CD8+ T lymphocytes and natural killer (NK) cell activity in vitamin B12-deficient patients by methyl-B12 treatment. Clin. Exp. Immunol. 1999;116(1):28.
Kim I, Ahn S. H, Inagaki T, Choi M, Ito S, Guo GL, et al. Differential regulation of bile acid homeostasis by the farnesoid X receptor in liver and intestine. J. Lipid Res. 2007;48(12):2664–2672.
Potthoff MJ, Boney-Montoya J, Choi M, He T, Sunny NE, Satapati S, et al. FGF15/19 regulates hepatic glucose metabolism by inhibiting the CREB-PGC-1α pathway. Cell Metab. 2011;13(6):729–738.

(Not answered)

Download PDF

Version 1

posted

You are reading this latest preprint version

Genome-resolved metagenomics of milk microbiomes reveals the influence of maternal dietary fiber on neonatal inheritance of immunoregulatory traits

Status:

Version 1

Abstract

Figures

Introduction

Materials And Methods

Mouse strains and diets

Milk collection and enrichment

Metabolite extraction and gas chromatography-mass spectrometry (GC-MS) analysis

DNA extraction and sequencing

Processing of 16S rRNA gene amplicon data

Metagenome-assembled genome (MAG) assembly

Host contamination and community profiling

MAG Richness

MAG functional annotation

Statistics and Data Visualization

Results

Maternal diet associated with distinct milk metabolomes and microbiomes

Milk culture enrichments enhance metagenome-assembled genome recovery to near-complete

Milk metabolomes were associated with distinct profiles of substrate degradation genes

Milk metabolites influence the type and diversity of inheritable immunoregulatory traits

Discussion

Carbohydrate and amino acid complexity is associated with milk microbiome community structure

Maternal high fiber diet promotes a milk microbiome with diverse inheritable immunoregulatory traits

Conclusion

Declarations

References

Additional Declarations

Supplementary Files

Status:

Version 1