Metagenome Functional Profiles
Mt. Erebus has evolved over time, starting with seafloor rifting and growing as a subaerial volcano into a modern-day stratovolcano (7, 8). The current conditions at Tramway Ridge differ remarkably from other extant non-Antarctic terrestrial and marine hydrothermal systems, being primarily driven by the unique phonolite magmatic source resulting in alkaline fumaroles with low sulphidic content. It is unknown how long geothermal features such as the fumaroles at Tramway Ridge have been present on Mt. Erebus; however, volcanism was once widespread across the Ross Island massif (6). Extant geothermal features may represent a once widespread ecosystem of similar features. Given this complex history of Mt. Erebus, we questioned whether the Tramway Ridge community has retained a legacy signature of its origin on the seafloor or if the modern-day Tramway Ridge community better resembles other terrestrial hydrothermal sites.
To answer this question, we compared functional profiles of assembled Tramway Ridge metagenomes to a large set of publicly available metagenomes. Metagenomes from Tramway Ridge were distinct from all others (Fig. 2c) but showed the most similarity to terrestrial hydrothermal environments such as hot springs and associated sediments. This indicates that the legacy of a sea floor origin is less important than the extant conditions at Tramway ridge in defining the microbial community. This is also reflected in the prevalence of taxa at Tramway Ridge that are found in other terrestrial hydrothermal environments but not in seafloor hydrothermal systems, such as the genera Mastigocladus, Caldithermus, Meiothermus and phyla such as the Chloroflexota, Armatimonadota and Actinobacteriota.
The unique nature of Tramway Ridge metagenomic profiles likely reflects the site’s relatively low diversity (2) and the type of functional profile analysis used here. First, pfams were used, which are very coarse functional units. Second, gene abundance was not available for most metagenomes, so we were forced to use presence-absence as data. Third, we found that the use of pfam presence-absence was sensitive to metagenome assembly size as well as the total number of unique pfams detected. In combination, these features had the potential to obfuscate any meaningful relationships. Our analysis was optimized to be as permissive as possible, allowing as many metagenomes into the analysis while still comparing like against like. Using this strategy, we were able to compare Tramway Ridge metagenomic datasets to 4513 out of 7652 publicly available and unrestricted metagenomes (see 10.5281/zenodo.10928974).
Within continental Antarctica, only three known surface-expressed active geothermal areas exist (Mt. Rittmann, Mt. Melbourne and Mt Erebus), each of which is separated by vast ice fields (15). It is thought that intercontinental transport of microbe-bearing particulates into Antarctica occurs much less frequently than intracontinental transport (62), suggesting that the introduction of exogenous microbes is relatively infrequent. Recent studies have used database searches to identify endemic species (63), and a lack of sequence identity to database entries has been used in the past to suggest that novel sequences indicate endemism (2). However, defining endemism based on whether sequence matches exist within a database can be problematic as this definition is sensitive to database composition. Conclusions drawn may not withstand the inevitable growth in database size and the diversity it holds. For this study, we took an alternative approach and defined the degree of endemism of a given taxon as being proportional to the in situ diversity of that taxon. For these inferences, we assumed that rare colonization events have been limited to single clones due to the extreme isolation of Mt. Erebus. Therefore, populations arising from recently introduced taxa would be expected to exhibit relatively low levels of genetic polymorphism and endemic microbial populations would be expected to show high levels of genetic polymorphism. In particular, we focused on accumulations of synonymous mutations, which were assumed to be under reduced selection pressure.
We developed the endemicity index (EI) (Fig. 3F) to assess the diversity of a microbial population represented by a MAG. Similar calculations have been successfully applied to approximate effective population size and genomic fluidity (64). High values (e.g. 10− 2) of EI indicate high diversity which we interpreted as reflecting a neutral evolutionary process occurring under minimal contemporary selection pressures. Low values (e.g. 10− 6) indicate low diversity, which we interpreted as possible evidence of a relatively recent arrival, a local population bottleneck, or a recent selective sweep.
A cyanobacterium belonging to either the Fischerella or Mastigocladus genus recovered the lowest median EI value (1 x 10− 5) for a single species. This species dominated the near-surface. Although difficult to distinguish these two genera based on 16S rRNA gene sequence and GTDB classifies all members of the genera Fischerella and Mastigocladus as Fisherella, the distinction of Mastigocladus is recognized as a distinct genus by the List of Prokaryotic names with Standing in Nomenclature (LPSN). Therefore we classified this MAG as a member of the Mastigocladus genus to be consistent with the classification of specimens collected from Tramway Ridge in the past (65). The low endemicity index observed for this taxon was consistent with previous studies that showed that the global phylogeography of Mastigocladus reflects a geologically recent radiation from Yellowstone National Park, USA (66) and that the surface-associated microbial community at Tramway Ridge is likely dominated by aeolian-distributed cosmopolitan members of non-Antarctic temperate and terrestrial hydrothermal soil communities (2).
We identified Candidatus Australlarchaeum erebusii (ERB_5_1) and Candidatus Fervidibacteria erebusii (ERB_15_1) as two MAGs with relatively high EI values and abundance in the subsurface at depths greater than 2 cm (Fig. 3f, Fig. 3g). In our earlier amplicon-based study, these two taxa were similarly identified as dominant, endemic, and associated with the subsurface, but were referred to as “Thaumarchaeota-like archaeon” and OCtSpA1-106 respectively (2).
The most abundant endemic, subsurface-associated organism recovered was Candidatus A. erebusii (ERB_5_1), a member of the Nitrososphaeria (formerly Thaumarchaeota). This class of Archaea is a globally distributed group that is best known for the chemolithoautotrophic oxidation of ammonia (67) and for the apparent universal synthesis of cobalamin (68). However, several deeply divergent lineages of Nitrososphaeria identified through the construction of MAGs (45, 69, 70) and a single cultivated species, Candidatus Conexivisphaera calidus NAS-02 (71) have been shown to lack these hallmark attributes. These deeply diverging lineages have been predicted to be predominantly anaerobic heterotrophs (69, 70) capable of beta oxidation of fatty acids and protein/peptide degradation (71, 72). Candidatus A. erebusii (ERB_5_1) is one of the deepest-branching members of the Nitrososphaeria (Fig. 3b) and like other deeply diverging lineages, it encodes genes for the beta oxidation of fatty acids and peptide degradation while lacking marker genes for cobalamin biosynthesis and ammonia oxidation (Supplementary Table 5). Candidatus A. erebusii may also employ amino acid-based oxidation, similar to a pathway used by Thermococcus kodakarensis (73) and a proposed alternative metabolism for Ca. Nitrosocaldus islandicus (74), a representative of thermophilic ammonia oxidizing archaea (AOA). In this proposed metabolism, glutamate could be utilized to generate both reducing power and ATP to power the cell.
However, Ca. A. erebusii is also predicted to respire oxygen, a unique prediction among its closest, presumably anaerobic relatives. It encodes two aa3-type (low-affinity) cytochrome C-oxidases which could presumably drive beta oxidation of fatty acids. However it is unclear whether aerobic respiration could be coupled with an amino acid degradation pathway, which is typically thought to be an anaerobic metabolism (73, 74). It is unclear if these energy-generating pathways are mutually exclusive and operate under specific oxidative conditions. At least during summer, oxygen levels in the subsurface are around 30% saturation (2) and therefore, no organisms inhabiting the fumaroles are likely to be obligate anaerobes. However, it is also reasonable to assume that the wet, steamy subsurface experiences anoxia at least at small spatial scales. Therefore, we hypothesize that Ca. A. erebusii switches between metabolic pathways depending on the oxic environment, using beta-oxidation of fatty acids when oxygen levels are sufficiently high and peptide fermentation when oxygen levels are low. CO metabolism is likely to be used for maintenance during times of nutritional stress, as has previously been shown for Antarctic soil microbes (49).
Another abundant and endemic subsurface MAG examined in detail belongs to the thermophilic genus Candidatus Fervidibacteria, which was first discovered in Octopus Spring, Yellowstone National Park (clone OctSpA1-106, (75). The genus is named after Ca. Fervidibacter sacchari, which encodes a large repertoire of carbohydrate-active enzymes (CAZymes) (76) and which has recently been isolated and described (59). Like other Ca. Fervidibacter, Ca. F. erebusii (ERB_15_1) encodes a significant number and diversity of CAZymes including a large, diverse cohort of the unusual and poorly characterized GH109 family. Previous findings suggest that these novel enzymes are involved in extracellular polysaccharide metabolism unique to thermophilic systems (59, 77, 78) with the diversity of Ca. F. erebusii GH109s suggesting a diverse and unique polysaccharide utilization profile typical of this genus. Interestingly, Ca. Fervidibacter genomes, including Ca. F. erebusii appear to also encode mechanisms that enable growth on casamino acids (59). The encoded hydrogenase and CODH are likely maintenance mechanisms to enable survival under carbon limitation with growth primarily supported through aerobic heterotrophic growth on saccharides and anaerobic growth on amino acids.