3.1. Overview of mitogenomic organization in amphibians
Numerous errors in the annotation were identified and reviewed. Error types include but are not limited to abnormal reading direction (strand), erroneous gene designations, missing gene or other feature annotations, mistaken identity of trnL1/trnL2 and trnS1/trnS2 tRNAs, and inconsistencies in gene names. The most misleading situation is when the errors were included in RefSeq [45]. After data cleaning, error checking and taxonomic reconciliation, we obtained a dataset containing 1777 samples representing 710 species in 244 genera under 65 families, yielding 88 gene rearrangement types (Table S1). Species coverage within each genus was variable, with a mean of 35% and a median of 25%. Among them, Caudata was the richest (mean 58%, median 50%), and Neobatrachia was the least abundant (mean 23%, median 13%) (Fig. 1 and Table S3).
The unexpected complexity and variability among amphibian mitogenomes were revealed (e.g., Fig. 2 and Table S2). The major lineages of both Gymnophiona and Archaeobatrachia, as well as a few Caudata, were identical to the typical vertebrate gene order (orange circle in Fig. 1a), which was named the Amphibian ground pattern herein and labeled type 1 in additional tables. In addition, both Caudata and Neobatrachia have their own autapomorphies. The Caudata ground pattern (type 2, red circle) features one more CR derived between trnT and trnP. The Neobatrachian ground pattern (type 21, blue circle) has a strongly rearranged gene order, involving the long-term translocation of trnL1 to form the LTPF tRNA gene cluster located upstream of the 12S rRNA gene [46].
In contrast to all 10 Caudata families, which have consistently retained the ground pattern among their members, only 24 of 35 Neobatrachia families show the Neobatrachia ground pattern. Among these, 9 families have evolved new types. The remaining 11 families exhibited completely derived types, leading to a total of 69 rearranged types. This makes Neobatrachia the most variable group, followed by Archaeobatrachia with 10 types, Caudata with 8, and Gymnophiona with 5, as shown in Fig. 2a.
The most common rearrangement events observed in this study involved nad5, trnTP, trnL1 and CR [46]. Inversions are rare relative to translocations. Notably, consistent with findings from long-read sequencing, multiple rearrangement types involved duplication of genes or gene clusters, as well as repetition of CR or a combination of both [47]. Among them, trnM duplications were the most common, followed by trnP duplications, as detailed in Fig. 2b and Table S2.
3.2. Mitogenomic structural changes across four lineages
Mosaic evolution and lineage specificity coexisted throughout mitogenome evolution in amphibians. In particular, the ground patterns of vertebrate (n = 162) and Caudata (n = 415) are shared among distantly related species of Gymnophiona, Caudata, and Archaeobatrachia. However, both patterns are notably absent in Neobatrachia. Moreover, the 67 derived types that evolved from the Neobatrachian ground pattern (n = 756) are unique to this lineage and are not observed in the other three lineages. These observations challenge our understanding of the organization of the ancestral amphibian mitogenome. Both Rhinatrematidae and Ichthyophiidae (Uraeotyphlus only), the two most basal families of Gymnophiona, possess the Caudata ground pattern [48]. Given the rearrangements found in Caudata and Archaeobatrachia, it is difficult to rule out the possibility that the ancestral state of the amphibians was a Caudata ground pattern. However, the majority of Gymnophiona, with the exception of the aforementioned families, exhibit this vertebrate pattern. Therefore, applying the principle of parsimony, the most plausible hypothesis is that the ground pattern of amphibian mitogenomes is a vertebrate pattern.
Almost all mitogenomic structural changes in Gymnophiona are genus specific, except for Boulengerula, which has interspecific variation, with one of the two species having a duplicated trnP (Type 3). There are also two types (types 4 and 5) concentrated in the trnWANCY clusters that are restricted to two genera in Siphonopidae. The Caudata ground pattern is shared by all 10 Caudata families, yet two of them have derived taxa. Among the 168 Caudata species examined, there is intrageneric differentiation in the genus Tylototriton, and two of these species are type 11, involving the expansion of the CR and its flanking genes [14]. The seven Plethodontidae species from Tylototriton, Aneides, Hydromantes and Plethodon are all intragenerially diverse to types 6–9, and the single-sample Stereochilus is type 10. Archaeobatrachia comprises four successively evolved lineages, all of which can be traced back to the ancestral amphibian type. Two out of four genera of Pipidae, Pipa and Xenopus, possess both Type 1 and Type 2. Derived types 20 (swaps of nad6-trnE and cob-trnT) were found in both examined species of Leiopelmatida. The more diverse Megophryidae show an additional seven derived types occurring in Leptobrachium (Type 14), Leptobrachella (Type 13) [49, 50], Scutiger (Type 17) [51] and Oreolalax [49, 52, 53], which intragenetically diversified into five derived types. These changes primarily involve the duplication of trnM [46] and/or the remote transposition of trnW.
There is very high variation in the genomic structure of Neobatrachia. Whether derived in terms of majority rules or the status of the outgroups, the ancestral type should contain a unique rearrangement in which trnLTP forms a cluster of genes located between the CR and trnF. In addition to type 21, which is shared by 24 families and is therefore considered to be the ground pattern, there is another type shared by four distant families (type 48), which differs from the ground pattern only by an additional CR between nad5 and nad6. Another pair of sister groups shares type 27, which has an additional interchange of trnA and trnN on top of the ground pattern. The remaining 66 types other than these three are unique to a given family, meaning that they are no longer shared between families [46]. The clade comprising Dicroglossidae [17, 46, 54–56], Ranidae [55, 57–63] and Rhacophoridae [64–66] exhibits the strongest changes, encompassing nearly half of the derived types (32 in total) with Breviceps, Hyperolius, Ptychadena, Cornufer, Limnonectes, Amolops, Nidirana, Odorrana, Rana, Polypedates, Nanorana, Quasipaa, and Nanorana presenting intrageneric changes and the latter two even presenting intraspecific changes [17, 18]. Nested within this clade, Mantellidae and Ranixalidae each harbor family-specific types. Other intrageneric changes occur in Cornufer, Ptychadena [67], Hyperolius [9], Breviceps [9, 68], Ischnocnema [69], Brachycephalus, Boana, and Bokermannohyla and reach an extreme of nine types in Pristimantis. The changes are varied and involve almost all regions of the mitochondria. The most notable variable regions involved the tRNA gene cluster WANCY [70]; in particular, multiple OLs were identified in the clusters [71].
3.3. Quantitative analysis of mitogenomes changes across amphibians
The hotspot is depicted in Fig. 1b on the heatmap plotted against the relative frequency (RF), detailed in additional table S5. Each amphibian group is represented by a different color, with purple indicating the entire amphibian group. The gradient of color shading reflects the relative RF score. Among all the genes, trnL1 exhibited the highest score due to its long-range translocation across various amphibian taxa. However, when examined at a finer taxonomic level, trnL1 rarely undergoes changes within that specific taxon. It should be noted that hotspots differ between taxa. For instance, Gymnophiona primarily experiences changes in the trnWANCY gene cluster, while Archaeobatrachia scores highest for the flanking genes of nad2. Neobatrachia displayed significantly more variable regions than did the other taxa, with almost all the genes exhibiting some degree of rearrangement except for cox2, atp6, and nad4l, which had RF scores of 0. Notable hotspots common to all three taxa may be confined to the region between nad6 and trnF, including CR [16].
Further analysis of the RS distribution (Fig. 1c and Table S2) highlighted the significant variation in mitogenomes architecture among amphibians. Gymnophiona, the most primitive group, exhibited the lowest median RS intensity, suggesting a relatively low level of genomic change. Caudata has a greater median RS than Archaeobatrachian, possibly due to Pyxicephalus adspersus, which undergoes strong rearrangement with gene duplication, resulting in 49 genes [12]. This extreme value elevates the overall distribution profile with score peaks at 50 (qGO) and 46 (qMGR). However, disregarding this extreme value, both taxa exhibited relatively uniform distributions of RS that were generally greater than those observed in Gymnophiona. It is worth noting that although both Caudata and Archaeobatrachia had a minimum RS value of 0, considering that type 1 in Cadauta evolved from ancestral type 2, this score varied when type 2 was changed to the reference. The multimodal distribution observed in both Archaeobatrachia and Neobatrachia indicates heterogeneity, potentially suggesting divergent evolutionary directions. A more intriguing alternative inference concerns the emergence of Neobatrachia, which disrupts continuity and gives rise to fluctuations in Archaeobatrachia. Neobatrachia, as the most recently evolved lineage, stands out for its high RS values and several extreme values. There are ten values above 20, a magnitude that far exceeds that of any other taxon, suggesting that it may have evolved in an aggressive way, avoiding the fatal decrease in evolutionary potential. This is consistent with the evolutionary pattern observed in other vertebrates, where more recently evolved taxa tend to have greater complexity. The extremely high RS may also indicate that the taxa have undergone rapid adaptive radiation or evolutionary innovation, resulting in significant increases in complexity.
3.4. Evolutionary dynamics underlying patterns of diversification
By aligning the RS of each taxon with the time of origin of its corresponding branch and aggregating the RS values over time, a growth curve can be generated (Fig. 3 and Table S5). A comparison between the growth curves for all amphibians and those for each individual taxon revealed a consistent pattern of stepwise growth over time. This suggests that there were distinct periods of rapid expansion followed by relatively stable phases in amphibian evolution.
Interestingly, while the absolute values of RS exhibited considerable variation across taxa, they demonstrated remarkably consistent patterns of increase, except for Gymnophonian, which consistently experienced a step change during the Cretaceous period. This indicates that Gymnophiona underwent unique evolutionary events or adaptations during this specific period. Furthermore, when examining the remaining three branches, it becomes evident that they all display a rapid increase with nearly comparable slopes soon after the beginning of the paleogene. This sudden surge in their RS values is preceded by an extended period of stagnation and a minor, limited magnitude increment occurring around or prior to the mid-Cretaceous epoch. The long periods of silencing in the early clade may indicate an underestimation of potential diversity for various reasons, such as extinctions associated with specific rearrangements.