Evolutionary relationships among bifidobacteria and their hosts and environments

doi:10.21203/rs.2.16551/v1

Download PDF

Research article

Evolutionary relationships among bifidobacteria and their hosts and environments

https://doi.org/10.21203/rs.2.16551/v1

This work is licensed under a CC BY 4.0 License

Journal Publication

published 08 Jan, 2020

Read the published version in BMC Genomics →

You are reading this older preprint version

Read the latest preprint version →

Background The assembly of animal microbiomes is influenced by multiple environmental factors and host genetics, although the relative importance of these factors remains unclear. Bifidobacteria (genus Bifidobacterium , phylum Actinobacteria) are common first colonizers of gut microbiomes in humans and inhabit other mammals, social insects, food, and sewages. In humans, the presence of bifidobacteria in the gut has been correlated with health-promoting benefits. Here, we compared the genome sequences of a subset of the over 400 Bifidobacterium strains publicly available to investigate the adaptation of bifidobacteria diversity. We tested 1) whether bifidobacteria show a phylogenetic signal with their isolation sources (hosts and environments) and 2) whether key traits encoded by the bifidobacteria genomes depend on the host or environment from which they were isolated. We analyzed Bifidobacterium genomes available in the PATRIC and NCBI repositories and identified the hosts and/or environment from which they were isolated. A multilocus phylogenetic analysis was conducted to compare the genetic relatedness the strains harbored by different hosts and environments. Furthermore, we examined differences in genomic traits and genes related to amino acid biosynthesis and degradation of carbohydrates.

Results We found that bifidobacteria diversity appears to have evolved with their hosts as strains isolated from the same host were non-randomly associated with their phylogenetic relatedness. Moreover, bifidobacteria isolated from different sources displayed differences in genomic traits such as genome size and accessory gene composition and on particular traits related to amino acid production and degradation of carbohydrates. In contrast, when analyzing diversity within human-derived bifidobacteria, we observed no phylogenetic signal or differences on specific traits (amino acid biosynthesis genes and CAZymes).

Conclusions Overall, our study shows that bifidobacteria diversity is strongly adapted to specific hosts and environments and that several genomic traits were associated with their isolation sources. However, this signal is not observed in human strains alone. Looking into the genomic signatures of bifidobacteria strains in different environments can give insights into how this bacterial group adapts to their environment and what types of traits are important for these adaptations.

Epigenetics & Genomics

Bifidobacterium

Pan-genome

CAZymes

host-trait associations

Bacteria are central to the evolution and ecology of animals influencing their genomes, development, and physiology (1). The composition of bacterial communities in the animal gut are thought to be shaped by host physiology and diet on daily timescales, but also by host evolutionary history over much longer timescales (2–4). A major challenge in animal microbiome research is therefore to disentangle the ecological and evolutionary processes underlying the variation in gut communities. One approach to tackling these questions is to focus on a specific bacterial group within the larger gut community (5,6).

A widespread and abundant group of bacteria in mammalian guts is bifidobacteria. Bifidobacteria are gram-positive, anaerobic, saccharolytic bacteria, members of the genus Bifidobacterium of the phylum Actinobacteria (7). Their presence in the gut has been correlated with health-promoting benefits in humans and mouse models including the production of metabolites like vitamins and antioxidants, immune system development, and protection from certain gut diseases such as enterocolitis and acute diarrhea (8). In newborns, specific species of bifidobacteria are important for degrading human milk oligosaccharides (HMOs) derived from breast milk (9,10). The fermentation of HMOs promotes the wellness of infants and prevents colonization from potential pathogenic bacteria (11,12). Bifidobacteria also excel at degrading and fermenting carbohydrates (13,14). This process produces short-chain fatty acids (SCFAs) such as butyrate, acetate, and propionate, which have been linked to reducing the risk of inflammatory diseases, heart disease, type II diabetes, and other adverse conditions such as cancer (15).

Here, we take a comparative genomics approach to investigate the relationship between bifidobacteria diversity and their hosts and environments. Bifidobacteria are ubiquitous inhabitants of the gastrointestinal tract, vagina, and mouth of mammals, including humans and are also present in guts of insects such as bees (16,17). They have also been found in human blood, breast milk, and sewage (18–20). The genomic signatures of bifidobacteria strains in different environments can give insights into how this bacterial group adapts to their environment and what types of traits are important for these adaptations. The few studies that have considered the association between bifidobacteria diversity and their hosts and environments have found contradictory results. Some studies observe no relationship between hosts and the type of genes bifidobacteria carry (21,22), while others do (23–25).

We analyzed a subset of the 400 bifidobacteria genomes publicly available to answer two questions: 1) Do bifidobacteria show a phylogenetic signal with their isolation sources (hosts and environments)? and 2) Do key traits encoded by the bifidobacteria genomes depend on the host or environment from which they were isolated? The term “phylogenetic signal” generally refers to the tendency of related species to resemble one another more than they would resemble a species drawn randomly from the same phylogenetic tree (26,27).

Since most bacterial traits are phylogenetically conserved (28), our first hypothesis was that bifidobacteria are adapted to the hosts (and other environments) from which they are isolated. We predicted that this adaptation would be reflected in the phylogeny of bifidobacteria, despite horizontal gene transfer (HGT) and rapid evolution. Secondly, we hypothesized that bifidobacteria strains would further adapt to their environment through genomic signatures like genome size and overall composition of accessory genes, as well as the composition of particular traits. Genome size is broadly associated with different bacterial lifestyles (29–31), and accessory gene composition can capture horizontally transferred regions of the genome, which are thought to allow for rapid adaptation to a specific environment (32). We specifically focused on two particular classes of genes: amino acid biosynthesis genes and carbohydrate-active enzymes (CAZymes). The abundance and diversity of amino acid biosynthesis genes may vary as amino acids can be exchanged between different hosts and bacteria (33,34), allowing for the loss or gain of these genes. Bacterial CAZyme profiles are also known to vary by environment, suggesting a mechanism for bacteria to adapt to the local carbohydrate supply (35,36). Moreover, bifidobacteria are key degraders of carbohydrates in host guts, and we expected that strains might adapt to host diet.

Phylogenetic relationships between bifidobacteria strains and isolation sources

To investigate the phylogenetic relationships between bifidobacteria strains isolated from

different environments and hosts, two phylogenetic trees were constructed based on 107 concatenated core genes. These trees included one with 60 human-derived strains (Fig. 1A) and one with 129 strains from different environments and hosts (Fig. 1B).. In both trees, members of the same taxonomic species clustered closely, and the phylogenetic structure of the trees was similar to previous reports based on 16S rRNA sequences and based on various core genes (16,24,37–39). For instance, B. breve and B. longum strains were found to be closely related as well as B. bifidum and B. scardovii. One difference was thatthe B. asteroides phylogroup has been previously shown to be positioned in the deepest branches of the bifidobacteria lineage (16,24,40); however, in our human-derived strains phylogenetic tree the deepest branch corresponded to a member of the B. thermophilum species. In the larger tree, the deepest branches corresponded to strains from the B. simiarum, B. primatium, B. vansinderenii, and B. tissieri species followed by B. asteroides group.

The strains isolated from a variety of human stages and body locations showed no phylogenetic signal (ANOSIM: R = 0.022, p>0.05). For example, strains isolated from infants were not more genetically similar to one another than those isolated from adults (Fig. 1A).. Similarly, strains isolated from the blood were not more genetically similar to one another than those found in milk or in the urogenital tract.

By contrast, when comparing across multiple host species and environments, the habitat from which the strains were isolated was strongly associated with the bacteria’s phylogenetic distribution (Fig. 1B; ANOSIM:R = 0.420, p<0.001). For instance, bee, primate, and rodent derived strains are tightly clustered in the phylogenetic tree within their categories (Fig. 1B).. These broader evolutionary patterns seem particularly robust for strains isolated from the orders Artiodactyla (pig and cattle-derived strains), Hymenoptera (bee-derived strains), and Primates (human and non-human primate-derived strains) as they clustered mostly within the same branches (Fig. 1B).

Genomic features and content among isolation sources

Genome size analysis

Within the human-derived strains, genome size did not differ by the particular human habitat (e.g., urogenital or gut) or between different human stages (e.g., infant or elderly) (Fig. 2A; Kruskal-Wallis H = 10.428, p>0.05, df = 7). Conversely, strains isolated from diverse animal hosts and environments (e.g. primates, bees, wastewater, etc.) differed significantly in genome size (Fig. 2B; H = 26.244, p<0.01, df = 9). Strains isolated from non-human primates had the highest genome size (2.9 Mb + 0.19 SD), whereas strains isolated from bees had the lowest genome size (2.0 Mb + 0.21 SD).

Pangenome analysis

The analysis on 129 bifidobacteria strains revealed that their pangenome is composed of 438 core genes, 115 soft core genes, 1,802 shell genes, and 24,550 cloud genes, for a total of 26,905 gene clusters (Fig. 3). This resonates with previous studies with fewer genomes that found this genus to have between 400–500 core genes (16,24). The composition of accessory genes excluding the core genome and singletons (~6,400 genes), was associated with both the bacteria’s isolation source (ANOSIM: R = 0.394, p<0.001), and the phylogeny of the bifidobacteria strains (based on 107 core genes; RELATE test, Spearman’s ρ = 0.52, p<0.001).

Amino acid biosynthesis analysis

Beyond general genomic characteristics, we investigated how a variety of specific traits,

such as amino acid biosynthesis genes varied among the strains. There was a significant difference in abundance of amino acid biosynthesis genes between different animal hosts and environments (Fig. 4A; H = 62.216, p<0.001, df = 11) (post hoc Dunn’s test). For instance, bees showed the lowest abundance of amino acid biosynthesis genes (87 genes + 13 SD) while non-human primates showed the highest number (100 genes + 2.9 SD) (Fig. 4A)..

Furthermore, the diversity of amino acid biosynthesis genes also differed among hosts and environments (Fig. 4B; H = 76.594, p<0.001, df = 11) (post hoc Dunn’s test); the bee-derived strains showed the lowest diversity of amino acid biosynthesis genes (78 genes + 12 SD). Strains isolated from the other host categories carried between 86 and 90 genes (Fig. 4B)..

Carbohydrate-active enzymes (CAZymes)

Since bifidobacteria are known to be excellent degraders of complex carbohydrates, we

also searched for CAZymes in their genomes. On the one hand, the abundance of CAZymes among the different human-derived strains did not differ significantly (Fig. 5A; H = 9.6557, p>0.5, df = 7). On the other hand, when comparing strains derived across different hosts and environments, we found a significant difference between categories (Fig. 5B; H = 60.9, p<0.001, df = 11). In the human environments, the oral-derived strains encoded the highest number of CAZymes (103 genes + 2.8 SD), whereas strains derived from adults (gut-derived) encoded the lowest number (55.8 genes + 12 SD). Across all hosts and environments, non-human primates carried more CAZymes than any other host (84 genes + 20 SD), while wastewater exhibited the fewest (42 genes + 10 SD) (Fig. 5)..

Studying the diversity of bifidobacteria and their trait associations provides insights into the mechanisms that underlie their assembly within a larger microbial community. Bifidobacteria strains isolated from the same host or environment were non-randomly associated with their phylogenetic relatedness. This pattern is consistent with the hypothesis that bifidobacteria specialize, or at least prefer, particular hosts, in agreement with several other studies (19,24,41). For example, Lamendella et al. (19) found that bifidobacteria strains from the same host, including those isolated from birds and pigs, tended to cluster by clade. We also observed that all B. pseudolongum subsp. pseudolongum strains were isolated from pigs as previously noted (42). Similarly, bee-derived bifidobacteria clustered within two relatively deep branches (24). Notably, this clustering was not perfect; for instance, some primate-derived strains clustered with more ancient branches than the bee-derived strains, and rodent-isolated strains could be found within several clades. This pattern of imperfect clustering suggests that host-specialization of bifidobacteria has occurred several times within different branches of the genus. In addition, the clades of strains from mixed isolation sources may indicate that many bifidobacteria are not strict specialists but are capable of colonizing non-preferred host types (21).

The bifidobacteria genomes also reveal adaptation to their host environment through genomic signatures like accessory genes and specific gene sets, supporting our second hypothesis. Sun et al. (24) also observed that bifidobacteria isolated from bees, pigs, and humans shared unique sets of genes. However, the correlation we observed between accessory genes and isolation sources was weaker than the association with the phylogeny based on core genes to the whole genus. Thus, it appears that specialization by bifidobacteria to a host species is primarily determined by vertically inherited traits, whereas horizontal gene transfer of traits captured through accessory gene composition plays a secondary role.

More specifically, bifidobacteria strains isolated from different hosts differed in the abundance and diversity of amino acid biosynthesis genes. Notably, bee-derived strains encoded the lowest abundance and diversity of amino acid biosynthesis genes, while non-human primates encoded the highest. Similarly, the bee strains also showed the smallest genome size. Given that species isolated from bees dominate the more ancient lineages, bifidobacteria may have coevolved longer with bees than with other hosts (40). One might speculate a longer coevolutionary history allowed bee-derived bifidobacteria to lose genes by evolving to use amino acids and other nutrients produced by the host or other gut bacteria, similar to the selection for smaller genome sizes observed in obligate bacterial symbionts (30,34).

Bifidobacteria are also known to degrade a range of carbohydrates ranging from simple to complex molecules, and there was genomic evidence of carbohydrate specialization by bifidobacteria isolated from different hosts. In particular, strains isolated from primates (including humans) carry relatively high abundances of CAZyme encoding genes. This difference could be due to more varied, plant diets of primates as well as the complexity and diversity of their milk oligosaccharides (43).

While bifidobacteria strains appear to be adapted to different hosts, there was little evidence that they are adapted to particular habitats and life stages within humans. In particular, we expected that different strains might be adapted to adults or infants, as bifidobacteria composition varies over age (44,45). Indeed, some subspecies such as B. longum subps. infantis are specialized to breakdown human milk oligosaccharides (10). Perhaps we could not see the pattern at this finer scale due to the limited diversity within each bifidobacteria species in our analysis. However, a recent study also found that strains within just two species, B. breve and B. longum, isolated from the vagina and gut of humans were indistinguishable based on phylogenetic and genomic trait analyses (22). Thus, at least for these two habitats, that may be connected by dispersal, there are not specialized strains even when focusing on a finer genetic scale.

The lack of differences in CAZyme abundance among human categories was also surprising. This is contrary to previous studies that have found the highest abundance of CAZymes in gut bacterial communities (8,35,36). In particular, we expected high numbers of CAZymes from infant strains as some bifidobacteria can degrade HMOs in the babies’ gut allowing the modulation of the immune system and succession of the microbiome in the infants (10,35,46). A point worth noting is the blood-derived strains, which we suspect are not specialized in their isolation source but instead are transient. Indeed, the strain classified as B. scardovii JCM 12489^T = DSM 13734^T (accession number AP012331) has been reported to have one of the largest genomes consisting of 3,158,347 bp with no plasmids and with the largest number of glycosyl hydrolase genes (47).

Our conclusions are limited by data issues inherent to the reanalysis of publicly available genomes that could be addressed in future research. First, the sampling among host animals is quite uneven, and larger sample sizes among a broader range of hosts would strengthen the results. Second, signals of host or habitat adaptation will be stronger at a higher genetic resolution (i.e. within bifidobacteria species), and thus there is a need for deeper sampling of strains to resolve finer-scale adaptation. Related to this, we had to exclude many human-derived genomes that were not accompanied by information about the specific isolation site and age stage of the host. Lastly, it is unclear whether some of the observed patterns might have been influenced by different isolation methods, which likely varied across different studies.

This comparative genomic analysis reveals that bifidobacteria are adapted to their hosts. This adaptation is reflected in the evolutionary history of the shared core genome as well as their accessory gene composition and specific gene sets. At the same time, there is little evidence within the genus for specialization on particular human habitats or stages, which may be due to sampling limitations or a higher degree of bacterial dispersal within humans than appreciated. In sum, the assembly of bifidobacteria in their habitats appears to be determined by a mix of ecological (host filtering) and evolutionary (host adaptation) forces (48). Bifidobacteria thus offers a model to study these processes in animal microbiomes.

Genome sequences and annotation

Genome sequences of all Bifidobacterium strains were downloaded from the Pathosystems Resource Integration Center (PATRIC) and the National Center for Biotechnology Information (NCBI) databases on March 14th, 2018 (n = 497). Duplicate sequences were removed from further analysis. We identified the hosts for each of the strains by searching the PATRIC and NCBI databases or associated publications (n = 449). Based on the concatenation of 107 core genes (see phylogenetic analysis below for details), we removed sequences with many gaps in the core genes from further analysis and only kept unique strains (n = 400). The vast majority of the strains in the databases were derived from human hosts followed by primates, cattle, pigs and bees. For strains isolated from humans (n = 272), we assigned each strain to the most specific category possible, acknowledging that some categories are subsets of other categories: infant (n = 117), adult (n = 20), human blood (n = 13), human milk (n = 10), urogenital (n = 9), elderly (n = 5), child (n = 4), probiotic (n = 3), oral (n = 2), human unspecified (n = 89). Child refers to 2–6 years old while infant usually refers to children anywhere from birth to 1 year old (or reported as infant in their respected studies). A subset of 60 human-derived strains from diverse environments were used for genomic comparisons based on their descriptive isolation source (Additional file 1)..

To compare strains among hosts, we focused on a subset of 129 bifidobacteria strains. These strains included the majority of the non-human bifidobacteria strains in addition to a subset of human strains from adult and infant feces (n = 13), blood (n = 1), vagina (n = 1), and mouth (n = 1). The categories were the following: primate (n = 18), human (n = 16), cattle (n = 15), pig (n = 16), bee (n = 16), rodent (n = 12), probiotic (n = 8), wastewater (n = 7), rabbit (n = 7), chicken (n = 6), other mammals (n = 4; including giraffe, hippopotamus, llama, and wallaby), dairy products (n = 3), soil-plant-associated (n = 1). We recognize that not all the host categories are at the same phylogenetic level (Additional file 2)..

To ensure uniform annotation, we reannotated all the genomes using Prodigal v2.6.3 in Normal Mode to predict Open Reading Frames (ORF) (49). We then used Prokka v1.13 (50) to annotate the sequences.

Phylogenetic analysis

Multilocus phylogenetic trees were constructed using the bcgTree pipeline (51) with the protein fasta files (.*faa) derived from Prodigal v2.6.3. Each of the genome sequences was searched for 107 conserved single-copy genes defined by Dupont et al. 2012 (52) using hmmsearch v3.1b2. The extracted genes were then each aligned using muscle v3.8.31 (53) and polished using Gblocks v0.91b (54) by eliminating poorly aligned areas. The 107 genes were then concatenated, and a phylogenetic tree was built using RAxML v8.2.10 with PROTGAMMABLOSUM62 substitution model and 100 rapid Bootstrap searches (55). We visualized the phylogenetic trees using the iTOL v3 interactive tool (56).

Comparative genomic analysis

We next tested whether some of the variation in the traits encoded by bifidobacteria genomes could be explained by the host or environment from which they were isolated. We used the genome size values provided by the PATRIC metadata to compare the genome size among isolates. For human-derived strains we used the same 60 sequences used in the phylogenetic analysis since they were carefully chosen to encompass variable human environments and tried to keep similar samples sizes when possible between categories; however, for the comparison among multiple hosts and environments we used a subset of the 129 strains to keep sample sizes the same for each category (n = 6); hence, we did not include isolates from the dairy, mammal, and soil categories since their sample sizes were less than 6 strains.

The pan-genome and gene ontology of the 129 selected bifidobacteria strains were established with Roary v3.12.0 (57) using the annotated genome assemblies obtained from Prokka v1.13 (.gff files). To account for the relatively high diversity of this genus, we used a 50% sequence identity for the blastp cutoff (58). The Roary software was able to detect core genes (present in 99%–100% of the strains), soft core genes (present in 95%–99% of the strains), shell genes (present in 15%–95% of the strains), and cloud genes (present in 0%–15%). The presence-absence table given by Roary, depicting the 26,905 gene clusters, was curated by deleting the following genes: core genes present in all 129 strains (minus 352 = total: 26,553), singletons (minus 10,967 = total: 15,586), genes with an average sequence per isolate higher than 1, due to splitting errors (minus 189 = total: 15,397), and genes with hypothetical annotation with no identifiable gene name (minus 9,000 = total: 6,397). The final table containing 6,397 accessory genes was converted into a matrix for further comparisons between core genes and phylogenetic distance against accessory gene composition. We used Phandango (59) to construct the pan-genome alignment by incorporating the RAxML inferred tree and the presence-absence table given by Roary.

Toassess the abundance (number of genes) and diversity (number of different genes) of amino acid biosynthesis genes, the automatic annotation server Ghostkoala was used to obtain gene function assignments based on the KEGG Orthology (60). To identify the CAZymes encoded in each genome, we used the dbCAN2 meta server based on the CAZy database updated on July 13th, 2018 (61,62). The input files for the webserver were protein fasta files (.*faa) derived from Prodigal v2.6.3. This server has the option to utilize three tools to predict CAZymes: i) HMMER search against the dbCAN HMM (hidden MArkov model) database; ii) DIAMOND search against pre-annotated CAZyme sequence database; iii) Hotpep search against the CAZyme short peptide database. We used all three tools at the default parsing thresholds and only considered the CAZymes found by all three tools.

Statistical Analyses

We used ANOSIM in PRIMER–6 Software (63) to test whether the isolation source categories were associated with phylogenetic relatedness and accessory genes of the bifidobacteria strains. To test for a correlation between the similarity in accessory and core gene content, we used the Relate test in PRIMER–6. We used the Tree and reticulogram REConstruction (T-REX) web server (64) to create the distance matrices used in the ANOSIM and Relate tests using the Netwick phylogenetic tree from RAxML. We assessed normality of data using Shapiro-Wilk normality test and its variance with Levene’s test incorporated in RStudio version 1.1.453. To account for the non-normal data and non-equal sample sizes, we used the Kruskal-Wallis (with a calculated significance level of p > 0.05) and Dunn’s post hoc tests (RStudio version 1.1.453) to compare genome size, amino acid biosynthesis genes, and CAZymes between the different strains belonging to varying hosts and environments.To construct heatmaps and boxplots, RStudio version 1.1.453 (http://www.rstudio.com/) was implemented and to help with the optimization of the images created, Adobe® Acrobat® Pro 2017 was used.

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Availability of data and materials

All data generated or analyzed during this study are included in this published article and its additional information files.

Competing interests

The authors declare that they have no competing interests.

Funding

This work was supported by a UCI Chancellor’s Fellow award to JBHM and a UCI Faculty Mentor Program (FMP) fellowship to CIR.

Authors’ contributions

JBHM and CIR conceived the project, wrote the manuscript, and interpreted the data. CIR conducted the bioinformatic analyses. All authors read and approved the final manuscript.

Acknowledgements

We would like to thank and acknowledge Katrine Whiteson, Brandon Gaut, and the members of the Martiny lab for their feedback while working on the manuscript.

McFall-Ngai M, Hadfield MG, Bosch TCG, Carey HV, Domazet-Lošo T, Douglas AE, et al. Animals in a bacterial world, a new imperative for the life sciences. Proc Natl Acad Sci U S A. 2013 Feb 26;110(9):3229–36.
David LA, Materna AC, Friedman J, Campos-Baptista MI, Blackburn MC, Perrotta A, et al. Host lifestyle affects human microbiota on daily timescales. Genome Biol. 2014 Jul 25;15(7):R89.
David LA, Maurice CF, Carmody RN, Gootenberg DB, Button JE, Wolfe BE, et al. Diet rapidly and reproducibly alters the human gut microbiome. Nature. 2014 Jan 23;505(7484):559–63.
Muegge BD, Kuczynski J, Knights D, Clemente JC, González A, Fontana L, et al. Diet Drives Convergence in Gut Microbiome Functions Across Mammalian Phylogeny and Within Humans. Science. 2011 May 20;332(6032):970–4.
Groussin M, Mazel F, Sanders JG, Smillie CS, Lavergne S, Thuiller W, et al. Unraveling the processes shaping mammalian gut microbiomes over evolutionary time. Nat Commun. 2017 Feb 23;8:14319.
Moeller AH, Caro-Quintero A, Mjungu D, Georgiev AV, Lonsdorf EV, Muller MN, et al. Cospeciation of gut microbiota with hominids. Science. 2016 Jul 22;353(6297):380–2.
Klijn A, Mercenier A, Arigoni F. Lessons from the genomes of bifidobacteria. FEMS Microbiol Rev. 2005 Aug 1;29(3):491–509.
O’Callaghan A, van Sinderen D. Bifidobacteria and Their Role as Members of the Human Gut Microbiota. Front Microbiol. 2016 Jun 15;7.
Ruiz-Moyano S, Totten SM, Garrido DA, Smilowitz JT, German JB, Lebrilla CB, et al. Variation in Consumption of Human Milk Oligosaccharides by Infant Gut-Associated Strains of Bifidobacterium breve. Appl Env Microbiol. 2013 Oct 1;79(19):6040–9.
LoCascio RG, Ninonuevo MR, Freeman SL, Sela DA, Grimm R, Lebrilla CB, et al. Glycoprofiling of Bifidobacterial Consumption of Human Milk Oligosaccharides Demonstrates Strain Specific, Preferential Consumption of Small Chain Glycans Secreted in Early Human Lactation. J Agric Food Chem. 2007 Oct;55(22):8914–9.
Bode L. Human milk oligosaccharides: Every baby needs a sugar mama. Glycobiology. 2012 Sep;22(9):1147–62.
Marcobal A, Sonnenburg JL. Human milk oligosaccharide consumption by intestinal microbiota. Clin Microbiol Infect Off Publ Eur Soc Clin Microbiol Infect Dis. 2012 Jul;18(0 4):12–5.
Liu S, Ren F, Zhao L, Jiang L, Hao Y, Jin J, et al. Starch and starch hydrolysates are favorable carbon sources for Bifidobacteria in the human gut. BMC Microbiol. 2015 Mar 1;15(1):54.
Rivière A, Moens F, Selak M, Maes D, Weckx S, De Vuyst L. The Ability of Bifidobacteria To Degrade Arabinoxylan Oligosaccharide Constituents and Derived Oligosaccharides Is Strain Dependent. Appl Environ Microbiol. 2014 Jan 1;80(1):204–17.
Slavin J. Fiber and Prebiotics: Mechanisms and Health Benefits. Nutrients. 2013 Apr;5(4):1417–35.
Milani C, Lugli GA, Duranti S, Turroni F, Bottacini F, Mangifesta M, et al. Genomic Encyclopedia of Type Strains of the Genus Bifidobacterium. Appl Environ Microbiol. 2014 Oct;80(20):6290–302.
Turroni F, van Sinderen D, Ventura M. Genomics and ecological overview of the genus Bifidobacterium. Int J Food Microbiol. 2011 Sep 1;149(1):37–44.
Esaiassen E, Hjerde E, Cavanagh JP, Simonsen GS, Klingenberg C. Bifidobacterium Bacteremia: Clinical Characteristics and a Genomic Approach To Assess Pathogenicity. J Clin Microbiol. 2017 Jul 1;55(7):2234–48.
Lamendella R, Domingo JWS, Kelty C, Oerther DB. Bifidobacteria in Feces and Environmental Waters. Appl Env Microbiol. 2008 Feb 1;74(3):575–84.
Martín R, Jiménez E, Heilig H, Fernández L, Marín ML, Zoetendal EG, et al. Isolation of Bifidobacteria from Breast Milk and Assessment of the Bifidobacterial Population by PCR-Denaturing Gradient Gel Electrophoresis and Quantitative Real-Time PCR. Appl Environ Microbiol. 2009 Feb;75(4):965–9.
Milani C, Mangifesta M, Mancabelli L, Lugli GA, James K, Duranti S, et al. Unveiling bifidobacterial biogeography across the mammalian branch of the tree of life. ISME J. 2017 Dec;11(12):2834–47.
Freitas AC, Hill JE. Bifidobacteria isolated from vaginal and gut microbiomes are indistinguishable by comparative genomics. PLoS ONE. 2018 Apr 23;13(4).
Sharma V, Mobeen F, Prakash T. Exploration of Survival Traits, Probiotic Determinants, Host Interactions, and Functional Evolution of Bifidobacterial Genomes Using Comparative Genomics. Genes. 2018 Oct;9(10):477.
Sun Z, Zhang W, Guo C, Yang X, Liu W, Wu Y, et al. Comparative Genomic Analysis of 45 Type Strains of the Genus Bifidobacterium: A Snapshot of Its Genetic Diversity and Evolution. Riedel CU, editor. PLOS ONE. 2015 Feb 6;10(2):e0117912.
Turroni F, Milani C, Duranti S, Ferrario C, Lugli GA, Mancabelli L, et al. Bifidobacteria and the infant gut: an example of co-evolution and natural selection. Cell Mol Life Sci. 2018 Jan 1;75(1):103–18.
Münkemüller T, Lavergne S, Bzeznik B, Dray S, Jombart T, Schiffers K, et al. How to measure and test phylogenetic signal. Methods Ecol Evol. 2012;3(4):743–56.
Kamilar JM, Cooper N. Phylogenetic signal in primate behaviour, ecology and life history. Philos Trans R Soc B Biol Sci. 2013 May 19;368(1618).
Martiny AC, Treseder K, Pusch G. Phylogenetic conservatism of functional traits in microorganisms. ISME J. 2013 Apr;7(4):830–8.
Cobo-Simón M, Tamames J. Relating genomic characteristics to environmental preferences and ubiquity in different microbial taxa. BMC Genomics. 2017 Jun 29;18.
McCutcheon JP, Moran NA. Extreme genome reduction in symbiotic bacteria. Nat Rev Microbiol. 2012 Jan;10(1):13–26.
Dini-Andreote F, Andreote FD, Araújo WL, Trevors JT, van Elsas JD. Bacterial Genomes: Habitat Specificity and Uncharted Organisms. Microb Ecol. 2012 Jul;64(1):1–7.
Hall JPJ, Brockhurst MA, Harrison E. Sampling the mobile gene pool: innovation via horizontal gene transfer in bacteria. Philos Trans R Soc B Biol Sci. 2017 Dec 5;372(1735).
Neis EPJG, Dejong CHC, Rensen SS. The Role of Microbial Amino Acid Metabolism in Host Metabolism. Nutrients. 2015 Apr 16;7(4):2930–46.
Graf J, Ruby EG. Host-derived amino acids support the proliferation of symbiotic bacteria. Proc Natl Acad Sci. 1998 Feb 17;95(4):1818–22.
Cantarel BL, Lombard V, Henrissat B. Complex Carbohydrate Utilization by the Healthy Human Microbiome. PLoS ONE. 2012 Jun 13;7(6).
Berlemont R, Martiny AC. Glycoside Hydrolases across Environmental Microbial Communities. PLoS Comput Biol. 2016 Dec 19;12(12).
Ventura M, Canchaya C, Casale AD, Dellaglio F, Neviani E, Fitzgerald GF, et al. Analysis of bifidobacterial evolution using a multilocus approach. Int J Syst Evol Microbiol. 2006;56(12):2783–92.
Lugli GA, Milani C, Turroni F, Duranti S, Ferrario C, Viappiani A, et al. Investigation of the Evolutionary Development of the Genus Bifidobacterium by Comparative Genomics. Appl Environ Microbiol. 2014 Oct;80(20):6383–94.
Turroni F, Berry D, Ventura M. Bifidobacteria and their role in the human gut microbiota. Frontiers Media SA; 2017. 244 p.
Bottacini F, Milani C, Turroni F, Sánchez B, Foroni E, Duranti S, et al. Bifidobacterium asteroides PRL2011 Genome Analysis Reveals Clues for Colonization of the Insect Gut. PLOS ONE. 2012 Sep 20;7(9):e44229.
Milani C, Turroni F, Duranti S, Lugli GA, Mancabelli L, Ferrario C, et al. Genomics of the Genus Bifidobacterium Reveals Species-Specific Adaptation to the Glycan-Rich Gut Environment. Appl Environ Microbiol. 2016 Feb 15;82(4):980–91.
Lugli GA, Duranti S, Albert K, Mancabelli L, Napoli S, Viappiani A, et al. Unveiling Genomic Diversity among Members of the Species Bifidobacterium pseudolongum, a Widely Distributed Gut Commensal of the Animal Kingdom. Appl Environ Microbiol. 2019 Apr 15;85(8):e03065–18.
Tao N, Wu S, Kim J, An HJ, Hinde K, Power ML, et al. Evolutionary Glycomics: Characterization of Milk Oligosaccharides in Primates. J Proteome Res. 2011 Apr 1;10(4):1548–57.
Arboleya S, Watkins C, Stanton C, Ross RP. Gut Bifidobacteria Populations in Human Health and Aging. Front Microbiol. 2016 Aug 19;7.
Kato K, Odamaki T, Mitsuyama E, Sugahara H, Xiao J, Osawa R. Age-Related Changes in the Composition of Gut Bifidobacterium Species. Curr Microbiol. 2017 Aug 1;74(8):987–95.
Thomson P, Medina DA, Garrido D. Human milk oligosaccharides and infant gut bifidobacteria: Molecular strategies for their utilization. Food Microbiol. 2018 Oct 1;75:37–46.
Toh H, Oshima K, Nakano A, Yamashita N, Iioka E, Kurokawa R, et al. Complete Genome Sequence of Bifidobacterium scardovii Strain JCM 12489T, Isolated from Human Blood. Genome Announc. 2015 Apr 9;3(2).
Moran NA, Ochman H, Hammer TJ. Evolutionary and Ecological Consequences of Gut Microbial Communities. Annu Rev Ecol Evol Syst. 2019;50(1).
Hyatt D, Chen G-L, LoCascio PF, Land ML, Larimer FW, Hauser LJ. Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics. 2010 Mar 8;11:119.
Seemann T. Prokka: rapid prokaryotic genome annotation. Bioinforma Oxf Engl. 2014 Jul 15;30(14):2068–9.
Ankenbrand MJ, Keller A. bcgTree: automatized phylogenetic tree building from bacterial core genomes. Chain F, editor. Genome. 2016 Oct;59(10):783–91.
Dupont CL, Rusch DB, Yooseph S, Lombardo M-J, Alexander Richter R, Valas R, et al. Genomic insights to SAR86, an abundant and uncultivated marine bacterial lineage. ISME J. 2012 Jun;6(6):1186–99.
Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004 Mar 1;32(5):1792–7.
Castresana J. Selection of Conserved Blocks from Multiple Alignments for Their Use in Phylogenetic Analysis. Mol Biol Evol. 2000 Apr 1;17(4):540–52.
Stamatakis A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014 May 1;30(9):1312–3.
Letunic I, Bork P. Interactive tree of life (iTOL) v3: an online tool for the display and annotation of phylogenetic and other trees. Nucleic Acids Res. 2016 08;44(W1):W242–245.
Page AJ, Cummins CA, Hunt M, Wong VK, Reuter S, Holden MTG, et al. Roary: rapid large-scale prokaryote pan genome analysis. Bioinformatics. 2015 Nov 15;31(22):3691–3.
Chase AB, Gomez‐Lunar Z, Lopez AE, Li J, Allison SD, Martiny AC, et al. Emergence of soil bacterial ecotypes along a climate gradient. Environ Microbiol. 2018;20(11):4112–26.
Hadfield J, Croucher NJ, Goater RJ, Abudahab K, Aanensen DM, Harris SR. Phandango: an interactive viewer for bacterial population genomics. Bioinformatics. 2018 Jan 15;34(2):292–3.
Kanehisa M, Sato Y, Morishima K. BlastKOALA and GhostKOALA: KEGG Tools for Functional Characterization of Genome and Metagenome Sequences. J Mol Biol. 2016 Feb 22;428(4):726–31.
Yin Y, Mao X, Yang J, Chen X, Mao F, Xu Y. dbCAN: a web resource for automated carbohydrate-active enzyme annotation. Nucleic Acids Res. 2012 Jul 1;40(W1):W445–51.
Zhang H, Yohe T, Huang L, Entwistle S, Wu P, Yang Z, et al. dbCAN2: a meta server for automated carbohydrate-active enzyme annotation. Nucleic Acids Res. 2018 Jul 2;46(W1):W95–101.
Clarke KR, Gorley RN. Primer V6: User Manual - Tutorial. Plymouth Marine Laboratory; 2006. 190 p.
Boc A, Diallo AB, Makarenkov V. T-REX: a web server for inferring, validating and visualizing phylogenetic trees and networks. Nucleic Acids Res. 2012 Jul 1;40(W1):W573–9.

Download PDF

Journal Publication

published 08 Jan, 2020

Read the published version in BMC Genomics →

Review #1 received at journal
12 Nov, 2019
Review #3 received at journal
12 Nov, 2019
Editorial decision: Minor revision
12 Nov, 2019
Review #2 received at journal
08 Nov, 2019
Reviewer #3 agreed at journal
29 Oct, 2019
Reviewer #2 agreed at journal
25 Oct, 2019
Reviewer #1 agreed at journal
24 Oct, 2019
Submission checks completed at journal
23 Oct, 2019
Editor assigned by journal
23 Oct, 2019
Reviewers invited by journal
23 Oct, 2019
Editor invited by journal
22 Oct, 2019
First submitted to journal
21 Oct, 2019

You are reading this older preprint version

Read the latest preprint version →

Evolutionary relationships among bifidobacteria and their hosts and environments

Status:

Journal Publication

Version 1

Abstract

Figures

introduction

results

Phylogenetic relationships between bifidobacteria strains and isolation sources

Genome size analysis

Pangenome analysis

Amino acid biosynthesis analysis

Carbohydrate-active enzymes (CAZymes)

Discussion

conclusions

methods

Genome sequences and annotation

Phylogenetic analysis

Comparative genomic analysis

Statistical Analyses

declarations

Ethics approval and consent to participate

Consent for publication

Availability of data and materials

Competing interests

Funding

Authors’ contributions

Acknowledgements

references

Supplementary Files

Status:

Journal Publication

Version 1