Genome organization
The complete mitogenomes of the three newly sequenced Sitta species are very similar to each other. All three mitogenomes are closed circular molecules ranging from 16,822 to 16,830 bp in length, consisting of 13 PCGs, 22 tRNAs, 2 rRNAs and a control region (Fig. 1). The gene order of mitogenomes of all three species analyzed are highly conserved (Fig. 1), which is also identical to the gene order found in the mitogenome of Gallus gallus [16]. For the whole mitogenomes of the three species, one PCG (nad6) and eight tRNAs (trnQ, trnA, trnN, trnC, trnY, trnS2(UCN), trnP, trnE) are encoded by the L-strand, while all the other genes are encoded by the H-strand. The comparison of the three Sitta species discovered the longest overlap (10 bp) between atp8 and atp6. The longest intergenic spacer (21 bp) is located between trnV and rrnL in mitogenome of S. himalayensis (Table 1). Similar to other typical vertebrates [17], the mitogenomes of the three Sitta species show a significant bias towards A and T, with the nucleotide composition of A and T ranging from 53.1–55.7%. The AT-skew and the GC-skew of the whole mitogenomes of three Sitta species are 0.10 to 0.13 and -0.39 to -0.35, respectively (Table 2).
Table 1
Annotation of the complete mitogenome of the three Sitta species in this study
Gene | Start codon | Stop codon | Anti-codon | Strand | Intergenic Nucleotides (IGN) |
trnF | | | GAA | H | -1 |
rrnS | | | | H | 0(a)/-1(b,c) |
trnV | | | TAC | H | 21(a)/2(b,c) |
rrnL | | | | H | 2(a)/1(b)/7(c) |
trnL2(UUR) | | | TAA | H | 11(a,c)/12(b) |
nad1 | ATG | TAA/TA(A)/TAG | | H | 7(a,c)/10(b) |
trnI | | | GAT | H | 6 |
trnQ | | | TTG | L | 4(a)/3(b)/2(c) |
trnM | | | CAT | H | 1(a,b)/0(c) |
nad2 | ATG | TAA | | H | 1 |
trnW | | | TCA | H | 1 |
trnA | | | TGC | L | 10 |
trnN | | | GTT | L | 2(a,c)/11(b) |
trnC | | | GCA | L | -1 |
trnY | | | GTA | L | 1 |
cox1 | GTG | AGG | | H | -9 |
trnS2(UCN) | | | TGA | L | 4 |
trnD | | | GTC | H | 10(a)/11(b,c) |
cox2 | ATG | TAA | | H | 1 |
trnK | | | TTT | H | 1 |
atp8 | ATG | TAA | | H | -10 |
atp6 | ATG | TAA | | H | 9(a,b)/11(c) |
cox3 | ATG | TA(A) | | H | -1 |
trnG | | | TCC | H | 0 |
nad3 | ATG | TAA | | H | -1 |
trnR | | | TCG | H | 1 |
nad4L | ATG | TAA | | H | -7 |
nad4 | ATG | T(AA) | | H | 0 |
trnH | | | GTG | H | 0 |
trnS1(AGN) | | | GCT | H | -1 |
trnL1(CUN) | | | TAG | H | 1(a)/0(b,c) |
nad5 | ATG | AGA | | H | 11 |
Cytb | ATG | TAA | | H | 6(a)/3(b,c) |
trnT | | | TGT | H | 7(a)/8(b,c) |
trnP | | | TGG | L | 14(a,b)/6(c) |
nad6 | ATG | TAG | | L | 0 |
trnE | | | TTC | L | 4(a)/5(b,c) |
CR | | | | H | 279(a)/282(b)/266(c) |
Note: H and L refer to the heavy and light strands, CR = control region. ‘/’ indicate type of intergenic nucleotides in Sitta species. S. himalayensis (a), S. nagaensis (b), S. yunnanensis (c). |
Table 2
Nucleotide composition of the mitochondrial genome of the three Sitta species in this study
Species | Whole genome | PCGs | tRNAs | 16S rRNA | 12S rRNA | Control region |
A+T (%) | AT skew | GC skew | A+T (%) | AT skew | GC skew | A+T (%) | AT skew | GC skew | A+T (%) | AT skew | GC skew | A+T (%) | AT skew | GC skew | A+T (%) | AT skew | GC skew |
S. himalayensis | 53.9 | 0.13 | -0.38 | 53.1 | 0.07 | -0.41 | 58.3 | 0.03 | 0.02 | 55.6 | 0.24 | -0.12 | 51.2 | 0.18 | -0.11 | 54.6 | -0.15 | -0.22 |
S. nagaensis | 53.1 | 0.13 | -0.39 | 52.0 | 0.09 | -0.42 | 58.0 | 0.04 | 0.01 | 55.6 | 0.24 | -0.11 | 51.5 | 0.18 | -0.11 | 53.3 | -0.12 | -0.22 |
S. yunnanensis | 55.7 | 0.10 | -0.35 | 55.2 | 0.04 | -0.38 | 58.3 | 0.04 | 0.01 | 56.4 | 0.23 | -0.10 | 52.2 | 0.18 | -0.11 | 55.5 | -0.15 | -0.22 |
Protein-coding genes and codon usage
In the three Sitta species, the scope of A + T content in PCGs is between 52.0% and 55.2% (Table 2). The start codons and stop codons of the 13 PCGs are mostly the same among the three species. cox1 uses GTG as the start codon, while the rest twelve PCGs initiate strictly with the standard start codon ATG (Table 1). There are six kinds of stop codons (TAA, AGG, AGA, TAG, TA*, T**) included in the mitogenome of the three Sitta species. For the incomplete stop codons, the missing nucleotides may be the result of post-transcriptional polyadenylation [18], which is common in animal mitogenomes and could produce functional stop codons by polycistronic transcription cleavages and polyadenylation mechanisms [18, 19]. The PCG nad1 in the mitochondrial genome of S. nagaensis contains the incomplete stop codon TA*, while TAN (N represents A, G) occurs in the other two species. Except for nad1, all of the other PCGs use the same stop codon across the three Sitta species.
The relative synonymous codon usage (RSCU) of 13 PCGs in the three newly sequenced mitogenomes was calculated. As shown in Fig. 2, CGA (Arg) and CUA (Leu1) are most commonly used in all three Sitta species. The highest value of RSCU is 3.86 of CGA in S. yunnanensis and the lowest value of RSCU is 0.03 of UAG in S. himalayensis and S. nagaensis. In addition, analysis of the RSCU values for the 13 PCGs indicates an AT bias. As for PCGs, the AT bias can be attributed to the frequent use of NNA and NNU (N represents A, T, C, G) codons [20]. The A + T content in PCGs of S. yunnanensis is slightly higher than that of the other two species, and the use of NNA and NNU codons is also more common in S. yunnanensis.
The tRNA genes and rRNA genes
The 22 tRNAs of the three Sitta species are typical and include all 20 types of amino acids, ranging from 66 to 75 bp in size. And the total length of the 22 mitogenome tRNAs of S. nagaensis is 1542 bp, which is the same as that of S. yunnanensis and only one-base different from that of S. himalayensis. The A + T content of the total mitogenome tRNAs of S. nagaensis is 58.0%, which is lower than that of S. himalayensis (58.3%) and S. yunnanensis (58.3%) (Table 2). The tRNAs of three Sitta species were all predicted to fold into typical cloverleaf secondary structures. Furthermore, mismatched base pairs were identified in the stems of 22 different tRNAs, most of which were G-U pairs.
In the mitogenome of the three Sitta species, the 16S rRNA is located between trnV and trnL2(UUR), ranging from 1575 to 1592 bp in length, while the 12S rRNA is located between trnF and trnV, ranging from 977 to 980 bp. The longest 16S rRNA was found in S. nagaensis and the shortest in S. himalayensis, while the longest 12S rRNA was discovered in S. yunnanensis and the shortest in S. himalayensis. The A + T contents of 16S rRNA and 12S rRNA range from 55.6–56.4% and from 51.2–52.2% respectively (Table 2).
The control region
The control region of the three species is located between trnE and trnF genes (Fig. 1). The size of control region of S. yunnanensis is 975 bp, which is longer than that of S. himalayensis (945 bp) and S. nagaensis (945 bp). The A + T content of the control region ranges from 53.3–55.5% (Table 2). The AT-skew is -0.15 to -0.12 and the GC-skew is -0.22, and the A + C content is higher than the T + G content. In this study, we analyzed the control region of three Sitta species and the predicted structures are shown in Figure 3. The entire control region contains three structural domains, namely Domain Ⅰ, Domain Ⅱ and Domain Ⅲ. Domain Ⅱ is relatively conservative, while Domain Ⅰ and Domain Ⅲ are heterogeneous across species in terms of nucleotide composition and size [21].
Domain Ⅰ includes extended termination-associated sequences such as ETAS1 and ETAS2 and CSB1-like sequences. Domain Ⅱ is the central conserved domain in the control region, including six conserved sequence blocks (F-box, E-box, D-box, C-box, b-box and B-box). Domain Ⅲ includes CSB1 sequence and light/heavy strand promoter (LSP/HSP), which are located at 911-929 bp in the control region (Figure 3).
Phylogenetic Analyses
Based on the concatenated nucleotide sequences of 13 PCGs, the phylogenetic analyses of 18 Passeriformes mitogenome sequences were performed, with one of the Regulus species as the outgroup (Table 3). BI and ML analyses generated similar tree topologies, so the topology of the BI tree is shown (Fig. 4). The results of this study indicate that Sittidae is closely related to Troglodytidae (1.00 posterior probability and 95% bootstrap value). Muscicapidae and Turdidae were herein corroborated to be sister groups (1.00 posterior probability and 100% bootstrap value). These results are consistent with the work of Barker on the sister groups of Sittidae and Troglodytidae [14].
Table 3
List of 18 species used for the phylogenetic analyses in this study
Species | GenBank NO. | Mitogenome Size (bp) | Total A+T % | References |
Sittidae |
Sitta nagaensis | MK343427 | 16,828 | 53.1 | This study |
Sitta europaea | MN356255 | 16,827 | 53.2 | Unpublished |
Sitta himalayensis | MK343426 | 16,822 | 53.9 | This study |
Sitta villosa | MT444149 | 16,816 | 55.4 | [22] |
Sitta yunnanensis | MN052793 | 16,830 | 55.7 | This study |
Sitta carolinensis | NC_024870 | 16,826 | 54.8 | [23] |
Troglodytidae |
Campylorhynchus zonatus | NC_022840 | 16,780 | 50.6 | [24] |
Campylorhynchus brunneicapillus | NC_029482 | 16,786 | 50.3 | Unpublished |
Henicorhina leucosticta | NC_024673 | 16,727 | 52.9 | [25] |
Muscicapidae | | | | |
Oenanthe isabellina | NC_040290 | 16,812 | 52.7 | [26] |
Ficedula zanthopygia | JN018411 | 16,794 | 53.2 | Unpublished |
Niltava davidi | NC_039538 | 16,770 | 54.2 | Unpublished |
Cyanoptila cyanomelana | HQ896033 | 16,802 | 53.0 | Unpublished |
Copsychus saularis | NC_030603 | 16,827 | 52.7 | [27] |
Turdidae |
Turdus kessleri | NC_041095 | 16,754 | 52.8 | [28] |
Turdus eunomus | NC_028273 | 16,737 | 52.7 | Unpublished |
Geokichla sibirica | MK377247 | 16,766 | 52.3 | [29] |
Regulidae |
Regulus regulus | NC_029837 | 16,847 | 55.5 | Unpublished |
In the genus Sitta, S. nagaensis and S. europaea were found to be the sister to the S. himalayensis (1.00 posterior probability and 100% bootstrap value). S. villosa is the sister to S. yunnanensis (1.00 posterior probability and 100% bootstrap value). All datasets supported a monophyletic clade of S. carolinensis, which was placed at the basal position of the genus Sitta (1.00 posterior probability and 100% bootstrap value). These results are generally identical to the previous study conducted by Pasquet et al. [30]. Currently, published mitochondrial genome data of Sitta species are very limited, so mitochondrial genomes of more Sitta species should be sequenced to better elucidate these phylogenetic relationships.