Anaerobic fungi in the tortoise alimentary tract illuminate early stages of host-fungal symbiosis and Neocallimastigomycota evolution

doi:10.21203/rs.3.rs-3335237/v1

Download PDF

Article

Anaerobic fungi in the tortoise alimentary tract illuminate early stages of host-fungal symbiosis and Neocallimastigomycota evolution

https://doi.org/10.21203/rs.3.rs-3335237/v1

This work is licensed under a CC BY 4.0 License

Journal Publication

published 28 Mar, 2024

Read the published version in Nature Communications →

Version 1

posted

You are reading this latest preprint version

The anaerobic gut fungi (AGF, Neocallimastigomycota) reside in the alimentary tract of herbivores. While their presence in mammals is well documented, evidence for their occurrence in non-mammalian hosts is currently sparse. Here we report on AGF communities in tortoises (family Testudinidae). Culture-independent surveys of tortoise fecal samples identified a unique AGF community, with three novel deep-branching genera representing > 90% of sequences in most samples. Representatives of all genera were successfully isolated under strict anaerobic conditions at 30^ºC or 39^ºC. Transcriptomics-enabled phylogenomic and molecular dating analysis indicated an ancient, deep-branching position in the AGF tree for these genera, with an evolutionary divergence time estimate of 104–112 million years ago (Mya). Such estimates push the establishment of animal- Neocallimastigomycota symbiosis from the early Paleogene (67 Mya) to the early Cretaceous (112 Mya). Further, compared to their mammalian counterparts, tortoise-associated isolates exhibited a more limited capacity for plant polysaccharides metabolism and lacked genes encoding several carbohydrate active enzyme (CAZyme) families mediating their degradation. Finally, we demonstrate that the observed curtailed degradation capacities and reduced CAZyme repretoire in tortoise-associated AGF is driven by the paucity of horizontal gene transfer (HGT) in tortoise-associated AGF genomes, compared to the massive HGT occurrence in mammalian AGF taxa. The reduced CAZyome and overall secretory machinery observed is also reflected in an altered cellulosomal production capacity in tortoise-associated AGF. Our findings provide novel insights into the scope of phylogenetic diversity, ecological distribution, evolutionary history, evolution of fungal-host nutritional symbiosis, and dynamics of genes and traits acquisition in Neocallimastigomycota.

Biological sciences/Ecology/Microbial ecology

Biological sciences/Evolution/Phylogenetics

Biological sciences/Microbiology/Fungi/Fungal ecology

Microbial communities play a crucial role in the digestive process in herbivores by mediating the breakdown of substrates recalcitrant to their hosts’ enzymes (1–3). The establishment of herbivore-microbiome nutritional symbiosis was associated with the evolution of dedicated digestive chambers e.g. enlarged hindgut, diverticula, and rumen, and longer feed retention times to improve the efficiency of the digestion process (4–7). A complex community of microorganisms in the herbivorous gastrointestinal tract (GIT) breaks down plant biomass to absorbable end products (3). So far, greater emphasis has been placed on the study of bacterial and archaeal members of the community (8–14, 15, 16), compared to microbial eukaryotes (protozoa and fungi). Nevertheless, the role of eukaryotes in the herbivorous gut is increasingly being recognized (17–21).

The anaerobic gut fungi (AGF, Neocallimastigomycota) are integral and ubiquitous constituents of the GIT community in mammalian herbivores (22–27). Notably, while chiefly investigated in mammalian hosts, microbiome-enabled herbivory and associated GIT structural features conducive to AGF establishment also occur in multiple non-mammalian herbivores. One of the potential non-mammalian AGF hosts are tortoises, members of the family Testudinidae, order Testudines (28). Tortoises are terrestrial herbivores that feed on grains, leaves, and fruits, possess an enlarged caecum (29), retain food for extremely long time frames (12–14 days) (29), and rely on hindgut fermentation (30, 31). While the extant Family Testudinidae have evolved only 38–39 million years ago (Mya) (32, 33), the Order Testudines (with 13 other families encompassing side-necked turtles, softshell turtles, sea turtles, and others) is much older, evolving in the Late Triassic (237 − 201 Mya); and some of its extinct members (e.g. Proganochelys) were known to be herbivores, with a digestive process highly similar to extant tortoises (34).

Here, we challenge the prevailing mammalian-centric narrative of AGF distribution and evolutionary history by the identification, isolation, and characterization of ancient, deep-branching AGF taxa from the tortoise GIT. The discovery of these novel tortoise-associated AGF demonstrate that AGF evolution as a distinct fungal phylum predates the rise of mammalian herbivory post the K-Pg extinction event; previously regarded as the defining event driving Neocallimastigomycota evolution and establishment of herbivores-fungal nutritional symbiosis (27, 35). Finally, we assess trait evolution patterns in these novel taxa in comparison to other mammalian-associated AGF and demonstrate that massive horizontal gene transfer (HGT) events driving CAZyme expansion in the Neocallimastigomycota have occurred mostly in mammalian-, but not tortoise-associated AGF lineages.

Samples. Fecal samples from 11 tortoises belonging to 8 genera and 9 species were obtained between November 2020 and March 2022 (Table S1). All samples originated from animals kept at the Oklahoma City Zoo (Oklahoma City, Oklahoma, USA), except one Sulcate (African Spurred) tortoise sample (Centrochelys sulcate), which was obtained from a local farm near Walters, OK, USA (34°28'43.3"N 98°13'33.0"W). Specimen collection from wild tortoise populations is exceedingly difficult, since many of the sampled tortoises spp. are critically endangered, e.g., ploughshare tortoise (36), and/or have a very limited geographic range (Table S1). Freshly deposited samples were placed in 15- or 50-mL conical centrifuge tubes and transferred on ice to the laboratory, where they were stored at -20ºC.

Amplicon-based diversity surveys. DNA extraction from fecal samples was conducted using DNeasy Plant Pro Kit (Qiagen Corp., Germantown, MD, USA) according to the manufacturer’s instructions and as previously described (27). PCR amplification targeting the D2 region of the LSU rRNA utilized the DreamTaq Green PCR Master Mix (ThermoFisher, Waltham, Massachusetts), and AGF-specific primers AGF-LSU-EnvS For: 5’-GCGTTTRRCACCASTGTTGTT-3’, and AGF-LSU-EnvS Rev: 5’-GTCAACATCCTAAGYGTAGGTA-3’ (37). The primers target a ~ 370 bp region of the LSU rRNA gene (corresponding to the D2 domain), hence allowing for high throughput sequencing using the Illumina MiSeq platform. Primers were modified to include the Illumina overhang adaptors. PCR reactions contained 2 µl of DNA, 25 µl of the DreamTaq 2X Master Mix (Life Technologies, Carlsbad, California, USA), 2 µl of each primer (10 µM) in a 50 µl reaction mix. The PCR protocol consisted of an initial denaturation for 5 min at 95°C followed by 40 cycles of denaturation at 95°C for 1 min, annealing at 55°C for 1 min and elongation at 72°C for 1 min, and a final extension of 72°C for 10 min. PCR products were individually cleaned to remove unannealed primers using PureLink® gel extraction kit (Life Technologies), and the clean product was used in a second PCR reaction to attach the dual indices and Illumina sequencing adapters using Nexterra XT index kit v2 (Illumina Inc., San Diego, California). These second PCR products were then cleaned using PureLink® gel extraction kit (Life Technologies, Carlsbad, California), individually quantified using Qubit® (Life Technologies, Carlsbad, California), and pooled using the Illumina library pooling calculator (https://support.illumina.com/help/pooling-calculator/pooling-calculator.htm) to prepare 4–5 nM libraries. Pooled libraries were sequenced at the University of Oklahoma Clinical Genomics Facility using the MiSeq platform.

Sequence data analysis. Forward and reverse Illumina reads were assembled using make.contigs command in mothur (38), followed by screening to remove sequences with ambiguous bases, sequences with homopolymer stretches longer than 8 bases, and sequences that were shorter than 200 or longer than 380 bp. Sequence assignment to AGF genera was conducted using a two-tier approach as recently described (27). Briefly, sequences were first compared by Blastn to the curated D1/D2 LSU rRNA AGF database (www.anaerobicfungi.org), and were classified as their first hit taxonomy if the percentage similarity to the first hit was > 96% and the two sequences were aligned over > 70% of the query sequence length. For all sequences that could not be confidently assigned to an AGF genus by Blastn, insertion into a reference LSU tree (with representatives from all cultured and uncultured AGF genera and candidate genera) was used to assess novelty. These genus-level assignments were then used to build a taxonomy file in mothur, which was subsequently used to build a shared file using the mothur commands phylotype and make.shared. The genus-level shared file was used for all downstream analyses as detailed below.

Alpha diversity estimates (observed number of genera, Shannon, Simpson, and Inverse Simpson diversity indices) were calculated using the command estimate_richness in the Phyloseq R package (39). To compare beta diversity and community structure between AGF communities in tortoises and AGF communities in canonical mammalian hosts, the AGF genus-level shared file from the 11 tortoise samples studied here was combined with the AGF genus-level shared file from a subset of mammalian hosts previously studied (27) (Dataset 1). The subset of mammalian AGF hosts included a comparable size from each of the five most-commonly sampled and numerous mammalian hosts: cattle (n = 25) (Bos taurus), sheep (n = 25) (Ovis aries), goat (n = 25) (Capra hircus), white-tail deer (n = 24) (Odocoileus virginianus), and horse (n = 25) (Equus caballus) generated in a prior study (27) that utilized the same DNA extraction, D2 LSU region amplification, and Illumina sequencing chemistry used in this study (Dataset 1). With this combined AGF genus-level shared file, we used the ordinate command in the Phyloseq R package to calculate weighted Unifrac beta diversity indices, and used the obtained pairwise values to construct ordination plots (both PCoA and NMDS) using the function plot_ordination in the Phyloseq R package.

Quantitative PCR. We quantified total AGF load in the 11 tortoise samples and compared it to a subset of the samples from ten cattle, ten goats, ten sheep, and ten horses (sample names in red text in Dataset 1) using quantitative PCR. The same primer pair (AGF-LSU-EnvS and AGF-LSU-EnvS Rev) used in the amplicon-based diversity survey described above was also used for qPCR quantification. The 25-µl PCR reaction volume contained 1 µl of extracted DNA, 0.3 µM of primers AGF-LSU-EnvS primer pair, and SYBR GreenER™ qPCR SuperMix for iCycler™ (ThermoFisher, Waltham, MA, USA). Reactions were run on a MyiQ thermocycler (Bio-Rad Laboratories, Hercules, CA). The reactions were heated at 95ºC for 8.5 min, followed by 40 cycles, with one cycle consisting of 15 sec at 95ºC and 1 min at 55ºC. A pCR 4-TOPO or pCR-XL-2-TOPO plasmid (ThermoFisher, Waltham, Massachusetts) containing an insert spanning ITS1-5.8S rRNA-ITS2-D1/D2 region of 28S rRNA from a pure culture strain was used as a positive control, as well as to generate a standard curve. The efficiency of the amplification of standards (E) was calculated from the slope of the standard curve and was found to be 0.89.

Isolation of AGF from Tortoises. Isolation of AGF from fecal samples of tortoises was conducted using established enrichment and isolation procedures in our laboratory (25, 40). A sequence-guided strategy, where samples with the highest proportion of novel, yet-uncultured AGF taxa were prioritized, was employed. To account for the poikilothermic (ectothermic) nature of the host, and the fact that the tortoise gut community is often exposed to lower and variable temperatures, we enriched for tortoise-associated AGF at a range of temperatures (30ºC, 39ºC, and 42ºC). Finally, the rumen fluid medium used for enrichment and isolation was amended with cellobiose (RFC medium) in addition to an insoluble substrate (switchgrass) and antibiotics (50 µg ml^− 1 chloramphenicol, 20 µg ml^− 1 streptomycin, 50 µg ml^− 1 penicillin, 50 µg ml^− 1 kanamycin, and 50 µg ml^− 1 norfloxacin).

Transcriptomic sequencing. Transcriptomic sequencing of 7 representative tortoise-associated AGF isolates (Testudinimyces strains BO1, EO1, GO1, NOS1, T130A.3, and Astrotestudinimyces strains B1.1, and B1.2) was conducted as described previously(27, 35). Briefly, biomass from cultures grown in RFC medium was vacuum filtered and used for total RNA extraction using an Epicentre MasterPure Teast RNA purification kit (Epicentre, Madison, WI) according to manufacturer’s instructions. RNA-seq was conducted on an Illumina HiSeq2500 platform using 2 × 150 bp paired-end library at the Oklahoma State University Genomics and Proteomics Core Facility. RNA-seq reads were quality trimmed and de novo assembled using Trinity (v2.14.0) and default parameters. Assembled transcripts were clustered using CD-HIT (41) (identity parameter of 95% (–c 0.95)) to identify unigenes. Following, peptide and coding sequence prediction was conducted on the unigenes using TransDecoder (v5.0.2) with a minimum peptide length of 100 amino acids (https://github.com/TransDecoder/TransDecoder). BUSCO (42) was used to assess transcriptome completeness using the fungi_odb10 dataset modified to remove 155 mitochondrial protein families as previously suggested (22).

Phylogenomic analysis and molecular dating. Phylogenomic analysis was conducted as previously described (27, 43) using the 7 transcriptomic datasets generated in this study, as well as 52 transcriptomic datasets from 14 AGF genera previously generated by our group (27, 35, 44), and others (22, 45–47), in addition to 5 outgroup Chytridiomycota genomes (Chytriomyces sp. strain MP 71, Entophlyctis helioformis JEL805, Gaertneriomyces semiglobifer Barr 43, Gonapodya prolifera JEL478, and Rhizoclosmatium globosum JEL800) to provide calibration points. The final alignment file included 88 genes that were gap free and comprising more than 150 nucleotide sites. This refined alignment was further grouped into 20 partitions, each assigned with an independent substitution model, suggested by a greedy search using PartitionFinder v2.1.1. All partition files, along with their corresponding models, were imported into BEAUti v1.10.4 for conducting Bayesian and molecular dating analyses. Calibration priors were set as previously described (35) including a direct fossil record of Chytridiomycota from the Rhynie Chert (407 Mya) and the emergence time of Chytridiomycota (573 to 770 Mya as 95% HPD)). The Birth-Death incomplete sampling tree model was employed for interspecies relationship analyses. Unlinked strict clock models were used for each partition independently. Three independent runs were performed for 30 million generations each. Tracer v1.7.1 (48) was used to confirm that sufficient effective sample size (ESS > 200) was obtained after the default burn-in (10%). The maximum clade credibility (MCC) tree was compiled using TreeAnnotator v1.10.4 (49).

Transcriptomic gene content analysis and comparative transcriptomics. Transcriptomic datasets obtained from tortoise AGF isolates (n = 7) were compared to the 52 previously generated transcriptomic datasets from mammalian AGF isolates (22, 27, 35, 44–47). Gene content comparison was conducted via classification of the predicted peptides against COG, KOG, GO, and KEGG classification schemes, as well as prediction of the overall CAZyme content. COG and KOG classifications were carried out via Blastp comparisons of the predicted peptides against the most updated databases downloaded from NCBI ftp server (https://ftp.ncbi.nih.gov/pub/COG/COG2020/data/ for COG 2020 database update, and https://ftp.ncbi.nih.gov/pub/COG/KOG/ for KOG database). GO annotations were obtained by first running Blastp comparisons of the predicted peptides against the SwissProt database. The first SwissProt hit of each peptide was then linked to a GO number by awk searching the file idmapping_selected.tab available from the Uniprot ftp server (https://ftp.uniprot.org/pub/databases/uniprot/current_release/knowledgebase/idmapping/idmapping_selected.tab.gz). GO numbers corresponding to the first hits were then linked to their GO aspect (one of: Molecular function, Cellular component, or Biological process) by awk searching the file “goa_uniprot_all.gaf” available from GOA ftp site (ftp://ftp.ebi.ac.uk/pub/databases/GO/goa/UNIPROT). KEGG classification was conducted by running GhostKOALA (50) search on the predicted peptides. The overall CAZyme content was predicted using run_dbcan4 (https://github.com/linnabrown/run_dbcan), the standalone tool of the dbCAN3 server (http://bcb.unl.edu/dbCAN2/) to identify GHs, PLs, CEs, AAs, and CBMs in the 7 transcriptomic datasets.

To identify predicted functions that are unique to tortoise-associated or mammalian-associated AGF, predicted peptides from all 59 transcriptomes were compared in an all versus all Blastp followed by MCL clustering. Clusters obtained were then examined to identify these clusters that are unique to both tortoise isolates, unique to one of them, or present in mammalian associated genera but absent from both tortoise genera (thereafter “GroupD” clusters). KEGG classifications of predicted peptides belonging to each of these groups were then compared.

Quantifying horizontal gene transfer (HGT). We implemented an HGT detection pipeline that was previously developed and extensively validated (44) to identify patterns of HGT in the 7 tortoise-associated AGF transcriptomic datasets. The pipeline involved a combination of BLAST similarity searches against UniProt databases (downloaded January 2023), comparative similarity index (HGT index, h_U), and phylogenetic analyses to identify potential HGT candidates. The downloaded Uniprot databases encompassed Bacteria, Archaea, Viruses, Viridiplantae, Opisthokonta-Chaonoflagellida, Opisthokonta-Metazoa, the Opisthokonta-Nucleariidae and Fonticula group, all other Opisthokonta, and all other non-Opisthokonta, non-Viridiplantae Eukaryota. Each predicted peptide from the 7 tortoise isolates transcriptomic datasets was searched against each of these databases, as well as against the Opisthokonta-Fungi (without Neocallimastigomycota representatives). Candidates with a Blastp bit-score against a nonfungal database that was > 100 and an HGT index h_U that was ≥ 30 were further evaluated via phylogenetic analysis to confirm HGT occurrence, and to determine the potential donor. All potential candidates were first clustered using CD-HIT and 95% similarity cutoff. Representatives of each cluster were then queried against the nr database using web Blastp once against the full nr database and once against the Fungi (taxonomy ID 4751) excluding the Neocallimastigomycetes (taxonomy ID 451455) with an E value below e^− 10. The first 100 hits obtained using these two Blastp searches were downloaded and combined in one FASTA file that was then combined with the AGF representative sequences and aligned using MAFFT multiple sequence aligner, and the alignment was subsequently used to construct maximum likelihood phylogenetic trees using FastTree. At this level, candidates that showed a nested phylogenetic affiliation that was incongruent to organismal phylogeny with strong bootstrap supports were deemed horizontally transferred.

Predicted secretome in transcriptomic datasets. To identify the predicted secretome, DeepLoc 2.0 (51) was used to predict the subcellular location of all predicted peptides from the transcriptomes of a representative of two different tortoise-associated AGF genera (strain T130A.3 and B1.1), as well as one representative of mammalian-associated AGF genera (Orpinomyces joyonii strain AB3). All transcripts encoding peptides predicted to be extracellular (henceforth predicted secretome) were then subjected to run_dbcan4 (https://github.com/linnabrown/run_dbcan) to identify GHs, PLs, and CEs in the predicted secretome. In addition, the predicted secretome was searched for the presence of scaffoldin homologues via Blastp comparison against a scaffoldin database (319 proteins downloaded June 2023 from Uniprot and created by searching the UniprotKB for Scaffoldin and filtering the output by taxonomy using taxid Neocallimastigomycetes [451455]). Finally, the predicted secretome was also searched for the presence of non-catalytic dockerin domains (NCDD) via the NCBI Batch CD-search online tool and identifying the predicted peptides with hits to the CBM_10 pfam02013. All predicted extracellular peptides with NCDD were further subjected to run_dbcan4 to identify co-existing GH, PL, and CE domains.

Proteomics sequencing and analysis. In addition to secretome prediction from transcriptomic datasets, we conducted proteomic analysis on the same tortoise-associated AGF described above (Testudinimyces gracilis strains T130A.3 and Astrotestudinimyces divisus strains B1.1). Proteomic analysis was conducted on two fractions: biomass, and cellulose-bound. Briefly, cultures were grown in RFC media until mid-exponential phase (typically 3 days for Astrotestudinimyces strain B1.1, and 1 week for Testudinimyces strain T130A). Biomass fraction was first collected by centrifugation (3220 xg for 10 minutes at 4ºC). The cellulosomal fraction (in the supernatant) was separated using cellulose precipitation as previously described (52). Briefly, the supernatant pH was adjusted to 7.5, followed by adding cellulose (Sigmacell type 50) (0.4% w/v) and gently stirring at 4°C for 2 hours. Low-speed centrifugation (3220 xg for 10 minutes at 4ºC) was then used to separate the cellulosomal (pellet) fraction. Proteins bound to cellulose in the cellulosomal fraction (pellet) were then eluted in water by agitation at room temperature for 1 hour, followed by removal of cellulose by centrifugation. The soluble eluates were collected by centrifugation, and frozen at -80ºC. For both fraction (biomass, and cellulose-bound), the frozen samples were dried by vacuum centrifugation. Dried samples were redissolved for 30 min at RT in reducing buffered guanidine (6M guanidine HCl, 0.1M Tris HCl, tris(2-carboxyethyl)phosphine, pH 8.5). Debris were removed by centrifugation, and the solutes were alkylated by adding iodoacetamide to 10 mM and incubation for 30 min in the dark at RT. The alkylation reactions were then digested with trypsin using a filter aided sample preparation (FASP) approach (53). For FASP, the samples were loaded into 30-kDa spin filter devices (Sigma®), and subjected to three buffer exchanges into 8 M urea, 0.1M TrisHCl, pH 8.5, followed by three additional buffer exchanges into digestion buffer (100 mM TrisHCl, pH 8.5). For the final buffer exchange, samples were concentrated to ~ 10 µl in digestion buffer, followed by dilution with 75 µl of digestion buffer containing 0.75 µg of trypsin/LysC mix (Promega). Reactions were digested overnight at 37ºC, and the trypsinolysis products were recovered by centrifugation of the FASP device. Recovered peptides were desalted using centrifugal devices loaded with C18 resin following the manufacturer’s recommendations (HMMS18R, The Nest Group). The desalted peptides were frozen and dried by vacuum centrifugation, and redissolved in 0.1% aqueous formic acid immediately prior to analysis by LC-MS/MS.

For LC-MS/MS, peptides were injected onto a 75 µm x 50 cm nano-HPLC column packed with 1.9-micron C18 beads (Thermo PN 164942) connected to an Easy-nLC 120 nano-HPLC system configured for two-column vented trap operation. Peptides were separated by gradient chromatography using 0.1% aqueous formic acid as mobile phase A and 80:20:0.1 acetonitrile/water/formic acid as mobile phase B. Peptides separations used a gradient of 4–32% mobile phase B delivered over a period of 120 minutes. Eluted peptides were ionized in a Nanospray Flex Ion source using stainless steel emitters (Thermo). Peptide ions were analyzed in a quadrupole-Orbitrap mass spectrometer (Fusion model, Thermo) using a “high/low” “top-speed” data-dependent MS/MS scan cycle that consisted of an MS1 scan in the Orbitrap sector, ion selection in the quadrupole sector, high energy collision in the ion routing multipole, and fragment ion analyses in the ion trap sector. The details regarding the MS programming are provided (Table S3).

RAW files from the mass spectrometer were searched against the corresponding transcriptome predicted peptides database using the MaxQuant application (v2.0.2.0, (54)). Searches utilized MaxQuant defaults, supplemented with two additional peptide modifications: deamidation of N/Q residues, and Q cyclization to pyro-glutamate. The MaxQuant “match between runs” algorithm was not used. Sequences for reversed-sequence decoy proteins and common contaminants proteins were utilized for the database searches, but were removed from the final MaxQuant protein results.

Data availability. Illumina amplicon reads generated in this study have been deposited in GenBank under BioProject accession number PRJNA997953, and BioSample accession numbers SAMN36694530- SAMN36694536. RNA-seq reads from tortoise isolates have been deposited in GenBank under BioProject accession number PRJNA997953, and BioSample accession numbers SAMN36694608- SAMN36694614.

Anaerobic gut fungal diversity and community structure in tortoise. Culture-independent analysis identified the occurrence of AGF in all tortoise samples examined (n = 11, Table S1). A distinct community composition pattern was observed, with sequences affiliated with three AGF genera (NY54, NY56, and NY36) being highly prevalent, representing (either individually or collectively) > 90% of sequences encountered in 9/11 samples (Dataset 2, Fig. 1a). Candidate genus NY54 was the most ubiquitous being identified in all tortoise samples, as well as the most abundant making up the > 90% of the AGF community in 6 samples (Pancake, Impressed, Egyptian, Indian star, one Galapagos, and one Sulcata tortoises) and > 50% of the AGF community in one sample (Burmese star tortoise). Candidate genus NY56 was encountered only in 3/11 samples, and in only one of these (the Texas tortoise) it constituted > 90% of the AGF community, while making up only a minor fraction (< 1%) in the other two samples. Similarly, candidate genus NY36 was less ubiquitous, being only encountered in two samples, and constituting > 90% of the community in one sample). In contrast to their collective abundance in tortoise fecal samples, two out of the three tortoise-associated genera (NY36 and NY56) were seldom identified in reference mammalian fecal samples, while the third genus (NY54) exhibited a higher level of occurrence (Fig. 1b). Regardless of their observed pattern of ubiquity, the three tortoise-associated genera constituted an extremely minor component of the community in mammalian samples, when identified (Fig. 1b).

Phylogenetic analysis using the D2 domain of the LSU rRNA placed NY54, NY56, and NY36 as three distinct, deeply-branching lineages within the Neocallimastigomycota tree (Fig. 1c), with the closest relative being the genus Khoyollomyces} (Fig. 1c), recently identified as the earliest evolving AGF genus (55). Identical sequences across various tortoises were identified for each of the three genera (Figure S1) suggesting ready cross-host colonization. The observed low intra-genus sequence divergence estimates for all three genera suggests an extremely low level of speciation (Fig. 1d). Quantitative PCR was conducted to estimate the abundance of AGF in tortoise fecal samples. AGF load (expressed as ribosomal copy number per gram feces) was invariably low in all tortoise fecal samples examined. Loads were much higher in AGF canonical mammalian hosts, e.g. cattle, sheep, goat, and horse (Fig. 1e).

Assessment of alpha diversity patterns indicated that tortoises harbored a significantly less diverse AGF community when compared to placental mammals (p-value < 0.04) (Fig. 2a). Community assessment using PCoA constructed using phylogenetic similarity-based weighted Unifrac confirmed the clear distinction between tortoise and mammalian AGF mycobiomes (Fig. 2b-c). Host class (Mammalia versus Reptilia) explained 47.6% of the variance (adonis p-value = 0.001), while the host species explained 24.6% of the community variance (adonis p-value = 0.001). DPCoA ordination plots (Fig. 2d) showed that the abundance of the tortoise-associated genera NY36, NY54, and NY56, and the paucity of all other AGF was responsible for the observed pattern of community structure distinction.

Isolation of tortoise-associated AGF genera. Isolation efforts from tortoise fecal samples yielded twenty-nine different isolates (Table S2, Figure S2). Amplification and sequencing of the D1/D2 region of the LSU rRNA confirmed that the obtained isolates are identical to sequences encountered in culture-independent surveys. Representative isolates belonging to the candidate genera NY54 and NY36 have been successfully maintained and characterized as Testudinimyces gen. nov, and Astrotestudinimyes gen. nov, respectively (40). On the other hand, despite repeated successful isolation rounds, isolates belonging to candidate genus NY56 have been extremely hard to maintain as viable cultures for subsequent analysis. The names Testudinimyces and Astrotestudinimyes will henceforth be used to describe cultured strains belonging to NY54 and NY36 the manuscript.

Phylogenomic and molecular clock timing analysis. Transcriptomics-enabled phylogenomic analysis placed isolates belonging to the genera Testudinimyces and Astrotestudinimyces

as two distinct, early-branching basal lineages in the Neocallimastigomycota tree (Fig. 3). Molecular clock timing suggests that tortoise-associated AGF (T-AGF) have evolved in the early Cretaceous, with a divergence time estimate of 112.19 Mya, with the 95% Highest Probability Density (HPD) interval at 95.94-129.98 Mya, and 104.43 Mya (95% HPD: 89.37-120.82 Mya) for Astrotestudinimyces, and Testudinimyces, respectively. Such estimates push the Neocallimastigomycota evolution by approximately 45 Mya, since prior efforts timed the phylum evolution at 67 Mya in the early Paleogene (27, 35), and indicate that Neocallimastigomycota evolution predates the K-Pg extinction event and subsequent evolution of mammalian herbivorous families (e.g. Equidae, Bovidae, Cervidae) as well as grasses (Poaceae), previously regarded as the defining process forging AGF evolution into a distinct fungal phylum (27, 35).

A curtailed carbohydrate active enzyme machinery in tortoise-associated AGF. Preliminary comparative transcriptomic analysis (Supp. text, Figure S3) indicated that T-AGF lack many gene clusters (n = 1699) encountered in all currently available, mammalian-associated AGF (henceforth M-AGF) isolates (Figure S4). Interestingly, a significant proportion (55.13%) of these gene clusters encoded metabolic functions, with a high proportion of carbohydrate metabolism (49.63% of metabolic functions gene clusters) and, more specifically, an enrichment of Carbohydrate Active enZymes (CAZymes) (46% of carbohydrate metabolism clusters) (Figure S4). Further, representatives of the genera Testudinimyces (strain T130A.3) and Astrotestudinimyces (strain B1.1) demonstrated a slower ability to grow on (microcrystalline) cellulose, failed to grow on xylan, and exhibited a relatively more limited capacity for carbohydrates metabolism compared to reference M-AGF (Figure S5 and (40)). Such pattern strongly suggests a curtailed machinery for plant biomass degradation in tortoise-associated, compared to M-AGF isolates.

Comparative analysis demonstrated that T-AGF harbor a significantly reduced CAZyome compared to M-AGF (Student t-test p-value = 0.0011), with only 0.5±0.11% of the predicted peptides assigned to a GH, CE, and PL families, compared to 1.3 ±0.61% in mammalian isolates transcriptomes (Dataset 3). Specifically, T-AGF transcriptomes harbored significantly lower number of distinct transcripts assigned to the families primarily associated with cellulose and hemicellulose metabolism, e.g. cellulase GH families GH5, GH9, the xylanase families GH10, GH11, GH16, GH45, the cellobiohydrolase families GH6, GH48, the β-glucosidase family GH3, the β-xylosidase family GH43, the α-amylase family GH13, the acetyl xylan esterases families CE1, CE2, CE4, and CE6 (Wilcoxon adjusted p-value < 0.03) (Fig. 4).

Limited horizontal gene transfer in tortoise-associated AGF. Interestingly, many of the CAZyme families lacking or severely curtailed in T-AGF have previously been shown to be acquired by AGF via horizontal gene transfer (HGT) (Figure S4, (44)). To determine whether this reflects a broader pattern of sparse HGT occurrence in the entirety of T-AGF genomes, we quantified HGT occurrence and frequency in T-AGF transcriptomes. Our analysis (Table 1) identified a total of only 35 distinct HGT events (with an average of 0.16±0.05% of transcripts in the 7 sequenced T-AGF transcriptomes). This value is markedly lower than the 277 distinct HGT events previously reported from M-AGF transcriptomes (44). Interestingly, within the limited number of HGT events identified in T-AGF, the majority (30/35) were also identified in M-AGF (44); and virtually all of which (29/30) had the same HGT donor (Table 1). Such pattern indicates that most HGT events in T-AGF occurred prior to the phylum’s Neocallimastigomycota diversification into tortoise- and mammalian-associated lineages.

Table 1
COG/ KEGG Classification	Function imparted/ Pathway	Affiliation of donor		Distribution in Tortoise genera (NY54 and NY36)	Occurrence In mammalian AGF	Same donor?
COG/ KEGG Classification	Function imparted/ Pathway	Phylum/ Class	Kingdom/ Clade	Distribution in Tortoise genera (NY54 and NY36)	Occurrence In mammalian AGF	Same donor?
Cellular Processes and Signaling
[O] Posttranslational modification, protein turnover, chaperones	DsbA	Cnidaria	Metazoa	Both	Yes	No
Information Storage and Processing
[L] Replication, recombination and repair	methylated DNA protein-cysteine methyl transferase	Firmicutes	Bacteria	Both	Yes	Yes
[J] Translation, ribosomal structure and biogenesis	GTP-binding	Mixed phyla	Bacteria	Both	Yes	Yes
Metabolism
[E] Amino acid transport and metabolism	ADP-ribosyl arginine hydrolases	Firmicutes	Bacteria	NY54	Yes	Yes
	Aspartate-ammonia ligase	Bacteroidetes	Bacteria	Both	Yes	Yes
	Cysteine synthase	Firmicutes	Bacteria	Both	Yes	No
	Tryptophan synthase (trpB)	Verrucomicrobia	Bacteria	NY54	Yes	Yes
[H] Coenzyme transport and metabolism	dephospho CoA kinase		Amoebozoa	NY54	Yes	Yes
[H] Coenzyme transport and metabolism	NadB (L-aspartate oxidase)	Myxococcota	Bacteria	Both	Yes	Yes
[C] Energy production and conversion	Lactate dehydrogenase and 2-hydroxyacid dehydrogenase	Firmicutes	Bacteria	Both	Yes	Yes
	lipoamide dehydrogenase	Mixed phyla	Bacteria	Both	Yes	Yes
	bifunctional aldehyde/alcoholDH family of Fe-ADH	Cyanobacteria	Bacteria	Both	Yes	Yes
[F] Nucleotide transport and metabolism	guanine deaminase	Firmicutes	Bacteria	NY36	No	N/A
	thymidine kinase	Alpha-Proteobacteria	Bacteria	NY54	Yes	Yes
	anaerobic ribonucleoside triphosphate reductase	TM6	Bacteria	Both	Yes	Yes
[Q] Secondary metabolites biosynthesis, transport and catabolism	Dehydrogenases	Firmicutes	Bacteria	Both	Yes	Yes
	Dehydrogenases	Firmicutes	Bacteria	NY54	Yes	Yes
[KO] Metabolism of terpenoids and polyketides	Squalene-hopene cyclase	Ciliophora	Alveolata	Both	Yes	Yes
[G] Carbohydrate transport and metabolism
Glycoside Hydrolases	GH13	Firmicutes	Bacteria	Both	Yes	Yes
	GH1	Firmicutes	Bacteria	Both	Yes	Yes
	GH5	Firmicutes	Bacteria	Both	Yes	Yes
	GH43	Firmicutes	Bacteria	Both	Yes	Yes
	GH10	Firmicutes	Bacteria	Both	Yes	Yes
	GH11	Fibrobacteres	Bacteria	NY36	Yes	Yes
	GH88	Mollicutes/ Firmicutes	Bacteria	Both	Yes	Yes
	GH3	Firmicutes	Bacteria	NY36	Yes	Yes
	GH53	Firmicutes	Bacteria	Both	No	N/A
	GH48	Firmicutes and Actinobacteria	Bacteria	NY54	Yes	Yes
Carbohydrate Esterases	CE1	Firmicutes and Fibrobacteres	Bacteria	NY36	Yes	Yes
Carbohydrate Esterases	CE15	Bacteroidetes	Bacteria	Both	Yes	Yes
Polysaccharide Lyases	PL4	Bacteroidetes	Bacteria	Both	Yes	Yes
Poorly characterized
[KO] Not Included in Pathway or Brite	aminopeptidase (Peptidase MEROPS Family M18)	Firmicutes	Bacteria	Yes	No	N/A
No COG/KEGG	TerD_like	Firmicutes	Bacteria	Yes	No	N/A
	uncharacterized protein		Metazoa	Yes	No	N/A
	uncharacterized protein	Firmicutes	Bacteria	Yes	No	N/A

Prior work has suggested the prevalence of metabolic functions in genes acquired by HGT in M-AGF (44). Such pattern held true for T-AGF, with (29/35) of the identified HGT events encoding a metabolic function. Several HGT-acquired metabolic genes in T-AGF were involved in processes enabling anaerobiosis. Specifically these genes mediated functions such as recycling reduced electron carriers via fermentation (aldehyde/alcohol dehydrogenases and d-lactate dehydrogenase for ethanol and lactate production from pyruvate), de novo synthesis of NAD via the bacterial pathway (L-aspartate oxidase NadB), the acquisition of the oxygen-sensitive ribonucleoside-triphosphate reductase class III (anaerobic ribonucleoside triphosphate reductase nrdD) and of squalene-hopene cyclase, catalyzing the cyclization of squalene into hopene during biosynthesis of tetrahymanol (that replaced the molecular O₂-requiring ergosterol in the cell membranes of AGF) (Table 1). Few additional HGT-acquired metabolic genes encoded CAZymes (Table 1). However, the number of HGT-acquired CAZyme genes in T-AGF was extremely minor (13 events representing an average of 10.81±4.17% of the total CAZYome in the 7 sequenced transcriptomes) compared to the massive acquisition of CAZymes by HGT previously reported in M-AGF (a total of 72 events representing 24.62–40.41% of the overall CAZyome) (44).

Cellulosomal production capacity in tortoise versus mammalian AGF. Anaerobic fungi produce cellulosomes, extracellular structures that function as multienzyme complexes that synergistically break down plant biomass into fermentable sugars. Many AGF-produced CAZymes localize to the cellulosomes. A non-catalytic dockerin domain (NCDD) similar to carbohydrate-binding module family 10 (CBM10) is usually associated with cellulosome-bound genes in anaerobic fungi and typically docks the enzymes to cohesin domains housed in a large scaffolding protein (scaffoldin), that in-turn anchors the entire structure to the cell wall (26). We hypothesized that the observed differences in gene content (Figure S4), CAZyme repertoire (Fig. 4, Dataset 3), secretome content (Supp. text, Figure S6), and HGT frequency (Table 1) between T-AGF and M-AGF would result in a differential cellulosomal production capacity (assessed as all peptides predicted to be extracellular and harbor an NCDD, as previously suggested (52, 56, 57)). Within the transcriptomes of representatives of T-AGF genera Testudinimyces (strain T130A) and Astrotestudinimyces (strain B1.1), predicted peptides with high sequence similarity (> 27.34% aa identity), and close phylogenetic affiliation (Fig. 5a) to Neocallimastigomycota scaffoldin protein ScaA were identified (5 copies in T130A, and 38 copies in B1.1 transcriptomes equivalent to 0.03, and 0.14% of total transcripts). As well, a total of 91, and 183 transcriptome predicted peptides possessing a non-catalytic dockerin domain (NCDD) and predicted to be extracellular were identified in T130A, and B1.1, respectively (equivalent to 1.16, and 0.34% of total transcripts). NCDD-harboring predicted peptides encoded CAZymes (n = 43, and 72, respectively), spore coat protein CotH (n = 6, and 9, respectively), carbohydrate binding modules (n = 34, and 87, respectively), expansins (n = 1, and 3, respectively), and other functions including hydrolases, and phosphatases (Fig. 5b). For comparative purposes, we sequenced and analyzed the transcriptome of a reference M-AGF isolate (Orpinomyces joyonii strain AB3). Similar to previously reported M-AGF, e.g. Pecoramyces (56), Caecomyces (58), Piromyces, Neocallimastix, and Anaeromyces (57), strain AB3 harbored a larger number of scaffoldin predicted peptides (n = 24, equivalent to 0.146% of total transcripts), and extracellular predicted peptides possessing a NCDD (n = 316, equivalent to 1.92% of total transcripts). Extracellular NCDD-harboring predicted peptides in strain AB3 encoded CAZymes (n = 134), spore coat protein CotH (n = 30), carbohydrate binding modules (n = 118), and expansins (n = 3) (Fig. 5b). Further, in addition to the overall lower number of NCDD-harboring peptides in T-AGF compared to M-AGF, clear differences were also observed in the relative composition of the CAZyme component of their predicted cellulosomes. In general, an extremely minor representation of CEs and GH10 component in T-AGF cellulosomes, when compared with the M-AGF representative strain AB3, was observed (Fig. 5B). On the other hand, an exclusive representation of GH45 in strain B1.1, and high and exclusive representation of PL1 in strain T130A, in comparison to strain AB3 cellulosome was observed (Fig. 5c).

To confirm the translation, secretion, and cellulose-binding affinity of predicted cellulosomal proteins (scaffoldins and NCDD-containing peptides), shotgun proteomics was conducted on the total, and cellulose-bound fractions of representatives of Astrotestudinimyces (strain B1.1), and Testudinimyces (strain T130A) (Dataset 4). Of the 221, and 96 proteins predicted to be cellulosomal-bound in transcriptomics analysis of strains B1.1, and T130A, respectively, 172, and 57 proteins were identified in the proteomics dataset, confirming their translation (Dataset 5, Figure S7). Of these, 169, and 50 proteins were identified in the cellulose-bound fraction. Further, all or the majority of these proteins were identified in higher intensity in the cellulose-bound fraction (169, and 46 proteins, respectively), with intensity ratios (intensity in cellulose-bound fraction: intensity in the biomass fraction) exceeding 5 in 94%, and 90% of the proteins (Dataset 5, Figure S7).

Assessment of the AGF community in tortoises demonstrated that three genera (Testudinimyces, Astrotestudinimyces, and NY56) represent the majority of AGF in most samples examined (Fig. 1a). The collective predominance and ubiquity of these genera is in stark contrast to their rarity and low relative abundance, when encountered, in mammalian hosts (Fig. 1b, (27)). Phylogenomic and molecular clock timing analysis estimated an evolutionary time of 112 Mya, and 104 Mya for the genera Astrotestudinimyces, and Testudinimyces, respectively. Such estimates predate the evolution of all current mammalian families known to harbor AGF (59–65). More importantly, it coincides with an increased diversification in fossil records of extinct turtle lineages during the lower Cretaceous (145 − 100 Mya), a process spurred by an increase in climatically suitable geographic areas (66). We hence posit that tortoise-associated AGF lineages evolved in now-extinct Testudines ancestor(s) during the Middle Cretaceous, and were successfully retained throughout subsequent evolutionary events leading to the evolution of their current host (herbivorous land-dwelling tortoises, family Testudinidae) during the Eocene (38–39 Mya) (32, 33).

T-AGF genera were identified in 8 genera and 9 tortoise species that collectively encompass multiple feeding strategies and geographical ranges (Table S1). The observed strong pattern of tortoise-fungal phylosymbiosis indicates that T-AGF possess distinct properties enabling successful colonization and propagation in the tortoise GIT. The lower temperature optima (for Testudinimyces) and wider temperature range (for both Testudinimyces and Astrotestudinimyces) compared to M-AGF aids in their survival and growth in the poikilothermic (cold-blooded) tortoises, where lower and wider variation in internal temperature prevail (40). As well, the slower growth of representatives of the genus Testudinimyces mirrors the slower basal metabolic rate and the extremely long food retention time in tortoises (12–14 days) (29), allowing ample time for substrate colonization.

However, while a rationale for their successful establishment in the tortoise GIT over their more robust mammalian AGF counterparts could be discerned; the exact ecological role and services rendered by T-AGF to their hosts, if any, are currently unclear. Given their relatively low numbers (Fig. 1e) in the ecosystem, as well as their relatively curtailed CAZyme repertoire (Fig. 4 and Dataset 3), their relative contribution to substrate depolymerization in their hosts appears minor. As well, a role in oligomer conversion to monomers, followed by monomer fermentation to absorbable VFA could be postulated. Alternatively, T-AGF could be rendering ecological services unrelated to food digestion in their host, e.g., modulating community and preventing pathogenic microbes’ establishment via secondary metabolites production and niche competition, akin to the gut and skin microbiome role in colonization resistance in human (67, 68). Finally, the possibility that T-AGF are dispensable commensals, rather than indispensable symbionts could not be discounted, given their extremely low loads in the tortoise GIT (Fig. 1e).

Regardless of their adaptive strategies and putative role in the tortoise GIT, our findings have important implications on our understanding of the evolutionary history of Neocallimastigomycota. Prior efforts based on available M-AGF taxa estimated an AGF divergence time of 67 Mya (27, 35). Such estimate post-dates the K-Pg extinction event and coincides with the evolution of mammalian AGF host families (59–65) and the associated evolutionary innovations in hosts’ alimentary tract architecture, as well as the evolution of grasses in the family Poaceae (69). Our results describe two distinct, deep branching lineages that independently evolved 37–45 Mya prior to these events in a non-mammalian host. The described genera hence represent the earliest known form of host-anaerobic fungal associations known to date; and demonstrate that AGF evolution predates the events previously recognized as the driving force behind forging Neocallimastigomycota evolution as a distinct fungal phylum. It is currently unclear whether T-AGF are the direct ancestors providing the seed for M-AGF, or whether M-AGF evolved from other yet-undiscovered extinct or extant ancestors. Nevertheless, it is important to note that despite such discovery, our results do not challenge the key role played by the rise of mammalian herbivory post the K-Pg extinction event and the evolution of mammalian families with dedicated fermentation chambers in AGF evolution. Establishment of AGF in the mammalian herbivorous gut has spurred an impressive wave of AGF family- and genus-level diversification (27), and acquisition of genes enabling efficient cellulose and hemicellulose degradation via HGT (44) to fully utilize the newly evolved grasses (family Poaceae) as a primary food source (35, 44). These innovations, in turn, enabled the establishment of AGF as indispensable members of the GIT tract of herbivorous mammals (23, 70–72). Indeed, most of the AGF identified diversity and biomass on earth currently resides in mammalian, rather than non-mammalian, herbivores.

Finally, comparative analysis between M- and T-AGF clearly indicates a significantly lower frequency of gene acquisition via HGT in T-AGF compared to M-AGF (Table 1). Our analysis suggests that a primary purpose of HGT in T-AGF is to enable their transition from an aerobic ancestor to an anaerobic lineage, a prerequisite for their establishment in the tortoise AGF tract. Only few (13 out of 35) HGT events were associated with improving plant degradation capacity in T-AGF lineages, which is, in turn, reflected in the slower cellulose-degradation ability and the lack of xylan degradation abilities in T-AGF taxa (Figure S5). Further, most of the HGT events identified in T-AGF in this study were also observed in M-AGF indicating ancient acquisition events prior to T-AGF and M-AGF split (Fig. 3). However, these relatively few ancient HGT events were followed by a more extensive second wave of HGT-mediated gene acquisition that occurred solely in M-AGF, and was mostly responsibly for equipping M-AGF with a powerful plant biomass degradation machinery enabling their propagation, establishment, and competition in the highly competitive, bacteria and archaea dominated rumen and hindgut in mammalian herbivores (35, 44).

What prevented T-AGF genera from undergoing a similar massive acquisition of CAZymes to improve their competitive advantage in the tortoise GI tract and beyond? We provide two possible explanations for the observed deficiency. First, such difference could be niche related. Mammalian rumen and hindguts are characterized by higher temperatures, larger food intake, rapid digestion and substrate turnover, higher microbiome density and diversity and higher overall metabolic activity. Such conditions provide for a more active milieu of cells, extracellular DNA, and viruses with higher opportunities for HGT through transduction, natural uptake, and transformation. Second, such differences in HGT frequency could be related to the hyphal growth pattern of AGF taxa. T-AGF genera identified appear to be polycentric, and such taxa produce lower number of zoospores, and can depend on hyphal propagation as means of reproduction. In contrast, most M-AGF genera (16/20 genera), including earliest evolving ones (e.g. Khoyollomyces, Piromyces) are monocentric, with strict dependency on sporangial development and free zoospore release followed by encystment and growth. Given the fact that fungal zoospores are naturally competent (73, 74) and represent the most appropriate and logical stage for DNA uptake by the AGF, such differential prevalence could contribute to the observed differences in HGT between both groups.

Funding. This work has been supported by the NSF grant number 2029478 to MSE and NHY. Computational resources used at Oklahoma State University was supported by NSF grant OAC-1531128.

Authors Contributions. Conceptualization: MSE, NHY, CJP; Methodology: MSE, NHY, CJP; Formal analysis: NHY, CJP; Investigation: CJP, DKJ, EEE, ALJ, CHM; Resources: MSE, NHY; Data Curation: NHY; Writing - Original Draft: MSE, NHY; Writing - Review & Editing: MSE, NHY, CJP, EEE, CHM; Visualization: NHY, CJP; Supervision: NHY, MSE; Project administration: NHY, MSE; Funding acquisition: NHY, MSE.

Acknowledgments. We thank Drs. Jennifer D’Agostino, and Rebecca Snyder (Oklahoma City Zoo) for collecting and providing tortoise fecal samples.

Competing interests. The authors declare no competing interest.

I. Rowland et al., Gut microbiota functions: metabolism of nutrients and other food components. Eur J Nutr 57, 1-24 (2018).
A. H. Moeller, J. G. Sanders, Roles of the gut microbiota in the adaptive evolution of mammalian species. Philos Trans Royal Soc B Biol Sci 375, 20190597 (2020).
G. Wunderlich, M. Bull, T. Ross, M. Rose, B. Chapman, Understanding the microbial fibre degrading communities & processes in the equine gut. Anim Microbiome 5, 3 (2023).
H.-D. Sues, R. R. Reisz, Origins and early evolution of herbivory in tetrapods. Trend Ecol Evol 13, 141-145 (1998).
G. King, Reptiles and herbivory (Chapman & Hall, London, UK., 1996).
M. E. Collinson, J. J. Hooker, Fossil evidence of interactions between plants and plant-eating mammals. Philos Trans R Soc Lond B Biol Sci 333, 197-207 (1991).
I. D. Hume, A. C. I. Warner, "Evolution of microbial digestion in mammals" in Digestive physiology and metabolism in ruminants: Proceedings of the 5th International Symposium on Ruminant Physiology, held at Clermont — Ferrand, on 3rd–7th September, 1979, Y. Ruckebusch, P. Thivend, Eds. (Springer Netherlands, Dordrecht, 1980), 10.1007/978-94-011-8067-2_32, pp. 665-684.
H. J. Flint, E. A. Bayer, M. T. Rincon, R. Lamed, B. A. White, Polysaccharide utilization by gut bacteria: potential for new insights from genomic analysis. Nat Rev Microbiol 6, 121-131 (2008).
R. J. Gruninger, T. A. McAllister, R. J. Forster, Bacterial and archaeal diversity in the gastrointestinal tract of the North American beaver (Castor canadensis). PLoS One 11, e0156457 (2016).
R. E. Ley et al., Evolution of mammals and their gut microbes. Science 320, 1647-1651 (2008).
B. D. Muegge et al., Diet drives convergence in gut microbiome functions across mammalian phylogeny and within humans. Science 332, 970-974 (2011).
B. St-Pierre, A. D. G. Wright, Diversity of gut methanogens in herbivorous animals. Animal 7, 49-56 (2013).
D. M. Stevenson, P. J. Weimer, Dominance of Prevotella and low abundance of classical ruminal bacterial species in the bovine rumen revealed by relative quantification real-time PCR. Appl Microbiol Biotechnol 75, 165-174 (2007).
C. M. Thomas, E. Desmond-Le Quéméner, S. Gribaldo, G. Borrel, Factors shaping the abundance and diversity of the gut archaeome across the animal kingdom. Nat Commun 13, 3358 (2022).
N. D. Youngblut et al., Vertebrate host phylogeny influences gut archaeal diversity. Nat Microbiol 6, 1443-1454 (2021).
M. A. García-Amado et al., Bacterial diversity in the cecum of the world's largest living rodent (Hydrochoerus hydrochaeris). Microb Ecol 63, 719-725 (2012).
T. O. Andersen et al., Metabolic influence of core ciliates within the rumen microbiome. ISME J 17, 1128-1140 (2023).
G. Gürelli, S. Canbulat, N. Aldayarov, B. A. Dehority, Rumen ciliate protozoa of domestic sheep (Ovis aries) and goat (Capra aegagrus hircus) in Kyrgyzstan. FEMS Microbiol Lett 363 (2016).
G. Gürelli, B. Göçmen, Intestinal ciliate composition found in the feces of racing horses from Izmir, Turkey. Eur J Protist 48, 215-226 (2012).
C. J. Newbold, G. de la Fuente, A. Belanche, E. Ramos-Morales, N. R. McEwan, The role of ciliate protozoa in the rumen. Front Microbiol 6, 1313 (2015).
R. Solomon et al., Protozoa populations are ecosystem engineers that shape prokaryotic community structure and function of the rumen microbial ecosystem. ISME J 16, 1187-1197 (2022).
R. J. Gruninger et al., Application of transcriptomics to compare the carbohydrate active enzymes that are expressed by diverse genera of anaerobic fungi to degrade plant cell wall carbohydrates. Front Microbiol 9, 1581 (2018).
R. J. Gruninger et al., Anaerobic fungi (phylum Neocallimastigomycota): Advances in understanding their taxonomy, life cycle, ecology, role and biotechnological potential. FEMS Microbiol Ecol 90, 1-17 (2014).
L. H. Hagen et al., Proteome specialization of anaerobic fungi during ruminal degradation of recalcitrant plant fiber. ISME J 15, 421-434 (2021).
R. A. Hanafy, B. Johnson, N. H. Youssef, M. S. Elshahed, Assessing anaerobic gut fungal diversity in herbivores using D1/D2 large ribosomal subunit sequencing and multi-year isolation. Environ Microbiol 22, 3883-3908 (2020).
M. Hess et al., Anaerobic fungi: past, present, and future. Front Microbiol 11, 584893 (2020).
C. H. Meili et al., Patterns and determinants of the global herbivorous mycobiome. Nat Commun 14, 3798 (2023).
M. Le, C. J. Raxworthy, W. P. McCord, L. Mertz, A molecular phylogeny of tortoises (Testudines: Testudinidae) based on mitochondrial and nuclear genes. Mol Phylogenet Evol 40, 517-531 (2006).
P. S. Barboza, Digesta passage and functional anatomy of the digestive tract in the desert tortoise (Xerobates agassizii). J Comp Physiol B 165, 193-202 (1995).
J.-M. Hatt, M. Clauss, R. Gisler, A. Liesegang, M. Wanner, Fiber digestibility in juvenile Galapagos tortoises (Geochelone nigra) and implications for the development of captive animals. Zoo Biol 24, 185-191 (2005).
M. L. Yuan, S. H. Dean, A. V. Longo, B. B. Rothermel, T. D. Tuberville, K. R. Zamudio, Kinship, inbreeding and fine-scale spatial structure influence gut microbiota in a hindgut-fermenting tortoise. Mol Ecol 24, 2521-2536 (2015).
E. Vlachos, M. Rabi, Total evidence analysis and body size evolution of extant and extinct tortoises (Testudines: Cryptodira: Pan-Testudinidae). Cladistics 34, 652-683 (2018).
R. C. Thomson, P. Q. Spinks, H. B. Shaffer, A global phylogeny of turtles reveals a burst of climate-associated diversification on continental margins. Proc Natl Acad Sci USA 118 (2021).
R. R. Schoch, H.-D. Sues, A Middle Triassic stem-turtle and the evolution of the turtle body plan. Nature 523, 584-587 (2015).
Y. Wang, N. H. Youssef, M. B. Couger, R. A. Hanafy, M. S. Elshahed, J. E. Stajich, Molecular dating of the emergence of anaerobic rumen fungi and the impact of laterally acquired genes. mSystems 4, e00247-00219 (2019).
C. B. Stanford et al., Turtles and tortoises are in trouble. Curr Biol 30, R721-R735 (2020).
D. Young et al., Simultaneous metabarcoding and quantification of Neocallimastigomycetes from environmental samples: Insights into community composition and novel lineages. Microorganisms 10, 1749 (2022).
P. D. Schloss et al., Introducing mothur: open-source, platform-independent, community-supported software for describing and comparing microbial communities. Appl Environ Microbiol 75, 7537-7541 (2009).
P. J. McMurdie, S. Holmes, phyloseq: An R package for reproducible interactive analysis and graphics of microbiome census data. PLOS ONE 8, e61217 (2013).
C. J. Pratt, E. E. Chandler, N. H. Youssef, M. S. Elshahed, Testudinimyces gracilis gen. nov, sp. nov. and Astrotestudinimyces divisus gen. nov, sp. nov., two novel, deep-branching anaerobic gut fungal genera from tortoise faeces. Int J Syst Evol Microbiol 73, doi: 10.1099/ijsem.1090.005921. (2023).
L. Fu, B. Niu, Z. Zhu, S. Wu, W. Li, CD-HIT: accelerated for clustering the next-generation sequencing data. Bioinformatics 28, 3150-3152 (2012).
M. Manni, M. R. Berkeley, M. Seppey, F. A. Simão, E. M. Zdobnov, BUSCO update: novel and streamlined workflows along with broader and deeper phylogenetic coverage for scoring of eukaryotic, prokaryotic, and viral genomes. Mol Biol Evol 38, 4647-4654 (2021).
R. A. Hanafy, Y. Wang, J. E. Stajich, C. J. Pratt, N. H. Youssef, M. H. Elshahed, Phylogenomic analysis of the Neocallimastigomycota: Proposal of Caecomycetaceae fam. nov., Piromycetaceae fam. nov., and emended description of the families Neocallimastigaceae and Anaeromycetaceae. bioRxiv 2022.07.04.498725; doi: https://doi.org/10.1101/2022.07.04.498725 (2022).
C. L. Murphy et al., Horizontal gene transfer forged the evolution of anaerobic gut fungi into a phylogenetically distinct gut-dwelling fungal lineage. . Appl Environ Microbiol 85, e00988-00919 (2019).
C. H. Haitjema et al., A parts list for fungal cellulosomes revealed by comparative genomics. Nat Microbiol 2, 17087 (2017).
S. E. Wilken et al., Experimentally validated reconstruction and analysis of a genome-scale metabolic model of an anaerobic Neocallimastigomycota fungus. mSystems 16, e00002-00021 (2021).
Y. Li et al., Combined genomic, transcriptomic, proteomic, and physiological characterization of the growth of Pecoramyces sp. F1 in monoculture and co-culture with a syntrophic methanogen. Front Microbiol 10, 435 (2019).
A. Rambaut, A. J. Drummond, D. Xie, G. Baele, M. A. Suchard, Posterior summarization in Bayesian phylogenetics using Tracer 1.7. Syst Biol 67, 901-904 (2018).
M. A. Suchard, P. Lemey, G. Baele, D. L. Ayres, A. J. Drummond, A. Rambaut, Bayesian phylogenetic and phylodynamic data integration using BEAST 1.10. Virus Evol 4, vey016 (2018).
M. Kanehisa, Y. Sato, K. Morishima, BlastKOALA and GhostKOALA: KEGG tools for functional characterization of genome and metagenome sequences. J Mol Biol 428, 726-731 (2016).
V. Thumuluri, J. J. Almagro Armenteros, Alexander R. Johansen, H. Nielsen, O. Winther, DeepLoc 2.0: multi-label subcellular localization prediction using protein language models. Nucl Acids Res 50, W228-W234 (2022).
C. H. Haitjema et al., A parts list for fungal cellulosomes revealed by comparative genomics. Nat Microbiol 2, 17087 (2017).
J. R. Wiśniewski, A. Zougman, N. Nagaraj, M. Mann, Universal sample preparation method for proteome analysis. Nat Methods 6, 359-362 (2009).
J. Cox, M. Mann, MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification. Nat Biotechnol 26, 1367-1372 (2008).
R. A. Hanafy, Y. Wang, J. E. Stajich, C. J. Pratt, N. H. Youssef, M. S. Elshahed, Phylogenomic analysis of the Neocallimastigomycota: proposal of Caecomycetaceae fam. nov., Piromycetaceae fam. nov., and emended description of the families Neocallimastigaceae and Anaeromycetaceae. Int J Syst Evol Microbiol 73 (2023).
M. B. Couger, N. H. Youssef, C. G. Struchtemeyer, A. S. Liggenstoffer, M. S. Elshahed, Transcriptomic analysis of lignocellulosic biomass degradation by the anaerobic fungal isolate Orpinomyces sp. strain C1A. Biotechnol Biofuels 8, 208 (2015).
K. V. Solomon et al., Early-branching gut fungi possess a large, comprehensive array of biomass-degrading enzymes. Science 351, 1192-1195 (2016).
J. L. Brown et al., Co‑cultivation of the anaerobic fungus Caecomyces churrovis with Methanobacterium bryantii enhances transcription of carbohydrate binding modules, dockerins, and pyruvate formate lyases on specific substrates. Biotechnol Biofuels 14, 234 (2021).
L. Shoemaker, A. Clauset, Body mass evolution and diversification within horses (family Equidae). Ecol Lett 17, 211-220 (2014).
E. Eizirik, W. J. Murphy, S. J. O'Brien, Molecular dating and biogeography of the early placental mammal radiation. J Hered 92, 212-219 (2001).
T. J. Hackmann, J. N. Spain, Ruminant ecology and evolution: Perspectives useful to ruminant livestock research and production. J Dairy Sci 93, 1320-1334 (2010).
N. S. Heckeberg, The systematics of the Cervidae: a total evidence approach. PeerJ 8, e8114 (2020).
B. G. Lovegrove, K. D. Lobban, D. L. Levesque, Mammal survival at the Cretaceous-Palaeogene boundary: metabolic homeostasis in prolonged tropical hibernation in tenrecs. Proc Biol Sci 281, 20141304 (2014).
K. D. Rose, The beginning of the age of mammals (Johns Hopkins University Press, Baltimore, 2006).
R. I. Mackie, Mutualistic fermentative digestion in the gastrointestinal tract: diversity and evolution. Integr Comp Biol 42, 319-326 (2002).
D. B. Nicholson, P. A. Holroyd, R. B. J. Benson, P. M. Barrett, Climate-mediated diversification of turtles in the Cretaceous. Nat Commun 6, 7848 (2015).
J. Claesen et al., A Cutibacterium acnes antibiotic modulates human skin microbiota composition in hair follicles. Sci Translat Med 12, eaay5445 (2020).
I. Khan et al., Mechanism of the gut microbiota colonization resistance and enteric pathogen infection. Front Cell Infect Microbiol 11, 716299 (2021).
T. Hodkinson, "Evolution and taxonomy of the grasses (Poaceae): A model family for the study of species-rich groups". (2018), 10.1002/9781119312994.apr0622, pp. 1-39.
R. Elliott, A. J. Ash, F. Calderon-Cortes, B. W. Norton, T. Bauchop, The influence of anaerobic fungi on rumen volatile fatty acid concentrations in vivo. J Agri Sci 109, 13-17 (1987).
S. A. Huws et al., Addressing global ruminant agricultural challenges through understanding the rumen microbiome: past, present, and future. Front Microbiol 9, 2161 (2018).
J. V. Nolan, R. A. Leng, D. I. Demeyer, The roles of protozoa and fungi in ruminant digestion: Proceedings of an International Seminar Held at the University of New England, Armidale, Australia, 26-29th September, 1988 (Penambul Books, 1989).
S. Calkins, N. C. Elledge, R. A. Hanafy, M. S. Elshahed, N. Youssef, A fast and reliable procedure for spore collection from anaerobic fungi: Application for RNA uptake and long-term storage of isolates. J Microbiol Meth 127, 206-213 (2016).
C. A. Hooker, R. Hanafy, E. T. Hillman, J. Muñoz Briones, K. V. Solomon, A genetic engineering toolbox for the lignocellulolytic anaerobic gut fungus Neocallimastix frontalis. ACS Synth Biol 12, 1034-1045 (2023).

There is NO Competing Interest.

DS1copy.xlsx
Dataset 1
DS2copy.xlsx
Dataset 2
DS3copy.xlsx
Dataset 3
DS4copy.xlsx
Dataset 4
DS5copy.xlsx
Dataset 5
SuppDoc7NoMethodsy.pdf
Supplementary document

Download PDF

Journal Publication

published 28 Mar, 2024

Read the published version in Nature Communications →

Version 1

posted

You are reading this latest preprint version

Anaerobic fungi in the tortoise alimentary tract illuminate early stages of host-fungal symbiosis and Neocallimastigomycota evolution

Status:

Journal Publication

Version 1

Abstract

Figures

Introduction

Materials and Methods

Results

Discussion

Declarations

References

Additional Declarations

Supplementary Files

Status:

Journal Publication

Version 1