Phylogeography and ecological niche modelling implicate multiple micro-refugia of Swertia tetraptera during Quaternary glaciations

doi:10.21203/rs.3.rs-1980534/v1

Background

Climate fluctuations during the Pleistocene and mountain uplift are vital driving powers affecting the geographic distribution and population dynamics history of organisms. However, how did an annual plant react to Pleistocene glaciations was little to know.

Methods

In this study, we analyzed the population demographic history of the endemic QTP annual herb plant Swertia tetraptera Maxim (Gentianaceae). Phylogeographic analysis with species distribution modeling were combined to detect the genetic variations in S.tetraptera. In total, 301 individuals from 35 populations of S.tetraptera were analyzed based on two maternally inherited chloroplast fragments (trnL-trnF and trnS-trnG).

Results

The genetic diversity of S.tetraptera was high, which was caused by wide natural range, high proportion of endemic haplotypes and evolutionary history. Fifty-four haplotypes were identified in S.tetraptera. Only a few haplotypes were widespread (H4, H1, H3) which were dispersed throughout the present geographical range of S.tetraptera, while a lot of haplotypes were confined to single populations. The cpDNA dataset showed that the phylogeographic structure was lack across the distribution range of S.tetraptera. Meanwhile, analyses of molecular variance showed that most of genetic variation was found within populations (70.51%). In addition, the relationships of the haplotypes were almost completely not resolved by phylogenetic reconstruction. Both mismatch distribution analysis and neutrality tests showed a recent expansion across the distribution range of S. tetraptera. The MAXENT analysis showed S.tetraptera had a wider distribution range during the last glacial maximum and a narrower distribution range during the current, with predictions into the future showing the distribution range of S.tetraptera shrinking.

Conclusion

Our study implies current geographic and genetic distribution of S.tetraptera is likely to have been shaped by both QTP uplift and Quaternary periods. Multiple micro-refugia of S.tetraptera were existed during Quaternary glaciations. Rapid intraspecific diversification and hybridization and/or introgression may have played a vital role in shaping current distribution patterns of S.tetraptera. The distribution range of S.tetraptera appeared to have experienced expansion during the LGM; in the future, when the global climate becomes warmer with rising carbon dioxide, the distribution of S.tetraptera will expansion and migration to higher altitude.

cpDNA trnS-trnG

Swertia tetraptera

haplotypes

QTP

refugia

phylogeographic structure

The Qinghai-Tibetan Plateau (QTP), which covers about 2.5×10⁶ km² or one-quarter of China, is the world's largest and highest plateau with an average altitude of 4500 m above sea level [1]. In the early Tertiary, the uplift of the QTP was caused by the colliding of the Indian Plate with the Eurasian Plate [2]. Huge changes have taken place in the biota of the QTP and its neighbouring mountains due to the alteration of topography and local climate. Numerous species became extinct [3–4]. However, a biodiversity hotspot has become in the southern edge of QTP and the Hengduan Mountains Region [5]. In addition, a new and young biota has developed on the plateau [6–7]. Of alpine seed plant species, 34% are endemic to the QTP [8] whereas only 2.9% of the genera are endemic [6]. Where this young alpine flora came from and how the uplift of the plateau as well as the related climate changes have influenced their differentiation, evolution, and dispersal are the subject of continuing debate.

Using the phylogeographic method [9], which has been confirmed to be very efficacious in reconstructing the history of the distribution of species [10–14], the expansion and differentiation of some species on the QTP and its adjacent areas have been recently surveyed (e.g. [15–22]. However, these species are trees or perennial herbs, only few have concentrated on annual taxa [23–24]. Nevertheless, as stressed by Hamrick & Godt (1996) [25], the genetic diversity of plant species was affected by life-history traits. Compared with long-lived trees, annuals, with their specific life-history characteristics, may show different levels and structuring of genetic variation. The high selfing rate [26], a low mutation rate [27] and a short generation time are main characteristics of annuals that could influence level and structure of diversity. Furthermore, the failure to reproduce due to adverse conditions in a particular year may have a strong impact on the demography of a population of annuals, and as a result, annual species may be expected more bottlenecks. Researches of such species are therefore necessary to increase our understanding on how the Quaternary climate changes have affected range distributions and intraspecific divergence of alpine plants on the QTP.

Swertia tetraptera, belonging to the genus Swertia in the Gentiaceae, is an annual herb plant endemic to QTP. The mainly distribution of S.tertraptera is in Qinghai, Gansu, and Sichuan Provinces, occurring primarily in moist hillsides and shrub locations with an elevation of 2,000–5,000 m. The main characteristic that distinguish S. tetraptera from other Swertia plants is its heteromorphic flowers, that is, it includes two kinds of flowers in every plant: normal 'open pollinated' flowers and 'closed or cleistogamous' flowers. As an endemic species of the QTP, it formed within the strong uplift of the Qinghai-Tibet Plateau (about 3.97 Ma) [28]. Therefore, S.tetraptera is an indispensable part of the study of the influence of the uplift of the QTP on the distribution pattern of modern plants. By now, only few studies of S.tetraptera based on molecular biology have been reported [29–30]. Yang et al. (2011) [29] clarified the phylogeography of S.tetraptera based on only one chloroplast DNA (cpDNA) fragments. However, this previous study did not concentrate on sides on phylogeography structure and deep evolutionary history.

In the current study, 301 individuals from 35 populations of S. tetraptera were collected from the entire geographic distributions in the high-altitude QTP and adjacent areas. To characterize the population histories of this species, two chloroplast DNA (cpDNA) markers were used to detect the genetic variations. The aims were: (1) to explore the genetic structure and diversity level of S.tetraptera in Qinghai-Tibetan Plateau; (2) to identify evolutionary history of S.tetraptera; (3) to speculate on the reasons for the existing geographic distribution patterns of S.tetraptera.

Population sampling

During the summers of 2008 through 2009, we collected the samples throughout the range of S.tetraptera.

Fresh leaves were collected from 35 populations and, with few exceptions, 4–12 individuals that were at least 50 m apart from each other were sampled from each population (Table 1, Fig. 1). We measured the longitude, latitude and altitude of each collection location using an Etrex Global Positioning System (Garmin). In total, 301 individuals were sampled and leaves were dried with silica gel. All samples were identified by Professor Guoying Zhou. Voucher specimens of all populations were deposited at the herbarium of the QTPMB (Qinghai-Tibetan Plateau Museum of biology), Xining, Qinghai Province, China.

Table 1

Geographic distributions, gene diversity, nucleotide diversity, and haplotype frequencies of cpDNA sequences for *Swertia tetraptera*
	Population code	Longitude	latitude	Sample number	Hd	Pi	Haplotype (no. of individuals)
Qilian, Qinghai	QL	100.27ºE	38.16ºN	8	0.67857	0.00068	H1(4), H4(3), H6(1)
Mengyuanqingshizui, Qinghai	MY1	101.41ºE	37.46ºN	7	0.80952	0.00134	H3(2), H4(3), H7(1), H10(1)
Mengyuanxianmi, Qinghai	MY2	102.01ºE	37.28ºN	4	0.50000	0.00083	H24(1), H54(3)
Mengyuanxianmi, Qinghai	MY3	102.02ºE	37.38ºN	11	0.80000	0.00321	H7(2), H16(3), H53(4), H54(2)
Huzhu, Qingahi	HZ	102.85 ºE	37.03ºN	11	0.18182	0.00015	H3(1), H4(10)
Gangcha, Qinghai	GC	101.18ºE	37.25ºN	7	0.71429	0.00291	H4(4), H16(1), H32(1), H42(1)
Datong, Qinghai	DT	101.53ºE	37.22ºN	9	0.80556	0.00101	H1(2), H2(1), H3 (1), H4 (4), H5(1)
Huangyuan, Qinghai	HY	101.37ºE	36.73ºN	10	0.86667	0.00106	H1(1), H3 (1), H4 (3), H6(3), H7(1), H8(1)
Huangzhong, Qinghai	HZH	101.70ºE	36.30ºN	7	0.66667	0.00087	H1(4), H4 (1), H6(2)
Pingan, Qinghai	PA	101.91ºE	36.32ºN	12	0.86364	0.00170	H1(1),H2 (1),H3 (4),H4(3),H7(1),H9(1),H10(1)
Hualong, Qinghai	HL	101.97ºE	36.27ºN	11	0.87273	0.00183	H1(4),H2(2),H6(1),H7(1),H11(1),H12(1),H13(1)
Ledu, Qinghai	LD	102.39ºE	36.66ºN	8	0.25000	0.00083	H1(7),H14(1)
Minghe, Qinghai	MH	102.69ºE	36.09ºN	9	0.94444	0.00381	H1(1),H4(4),H8(1),H13(2),H15(1),H16(2),H17(1)
Xunhua, Qinghai	XH	102.31ºE	35.74ºN	10	0.77778	0.00184	H1(1),H15(5),H16(1),H18(1),H19(1),H20(1)
Tongre, Qinghai	TR	101.94ºE	35.31ºN	4	0.50000	0.00041	H4(1),H6(3)
Zeku, Qinghai	ZK	101.87ºE	35.25ºN	9	0.69444	0.00073	H3(1),H4(1),H6(2),H7(5)
Henan, Qinghai	HN	101.55ºE	34.60ºN	6	0.80000	0.00138	H3(1),H4(3),H6(1),H21(1)
Maqing, Qinghai	MQ	100.21ºE	34.47ºN	9	0.83333	0.00128	H1(1),H3(1),H4(4),H6(1),H7(1),H9(1)
Jiuzhi, Qinghai	JZ	101.37ºE	33.39ºN	10	0.73333	0.00202	H4(5),H5(2),H22(2),H23(1)
Dari, Qinghai	DR	99.60ºE	33.75ºN	10	0.66667	0.00066	H1(1),H4(6),H6(1),H24(1),H25(1)
Banma, Qinghai	BM	100.71ºE	33.05ºN	10	0.88889	0.00229	H1(2),H3(3),H4(2),H24(1),H26(1),H27(1)
Yushu, Qinghai	YS	96.73ºE	33.13ºN	7	0.90476	0.00173	H3(2),H7(1),H28(1),H29(2),H30(1)
Chenduo, Qinghai	CD	97.44ºE	33.11ºN	8	0.96429	0.00357	H7(1),H12(1),H15(1),H19(1),H22(1),H31(1),H32(2)
Nangqian, Qinghai	NQ	96.14ºE	32.44ºN	11	0.98182	0.00316	H3(1),H6(1),H9(1),H12(1),H13(1),H32(1),H33(1), H34(2),H35(1),H36(1)
Tianzhu, Gansu	TZ	102.60ºE	36.94ºN	8	0.92857	0.00177	H3(2),H7(1),H30(1),H37(1),H38(2),H39(1)
Xiahe, Gansu	XHE	102.47ºE	35.15ºN	11	0.96364	0.00201	H1(2),H3(1),H4(2),H7(1),H13(1),H40(1),H41(1), H42(1),H43(1)
Hezuo, Gansu	HZU	102.91ºE	35.06ºN	9	0.91667	0.00142	H1(2),H2(1),H3 (2),H4(2),H7(1),H44(1)
Maqu, Gansu	MQU	102.64ºE	34.10ºN	10	0.95556	0.00209	H4(2),H5(1),H24(1),H30(1),H35(1),H45(1),H46(2), H47(1)
Diebu, Gansu	DB	103.89ºE	34.23ºN	9	0.97222	0.00362	H1(1),H4(1),H7(1),H14(1),H16(2),H26(1),H31(1), H48(1)
Shiqu, Sichuan	SQ	97.52ºE	33.15ºN	9	0.69444	0.00092	H1(2),H4(5),H24(1),H49(1)
Ruoergaijiangzha, Sichuan	REG1	103.18ºE	33.60ºN	10	0.86667	0.00198	H3(3),H4(3),H11(1),H21(1),H34(1),H50(1)
Ruoergaibasi, Sichuan	REG2	102.78ºE	34.14ºN	9	0.91667	0.00229	H1(1),H4(3),H7(1),H36(1),H42(1),H51(1),H52(1)
Ruoergaibanyou, Sichuan	REG3	103.10ºE	33.59ºN	9	0.86111	0.00335	H1(2),H2(2),H16(3),H44(1),H46(1)
Aba, Sichuan	AB	101.85ºE	32.92ºN	5	0.70000	0.00149	H14(1),H16(1),H53(3)
Hongyuan, Sichuan	HOY	102.33ºE	32.65ºN	4	0.50000	0.00083	H24(1),H54(3)
Total				301	0.90800	0.00257

DNA extraction, amplification and sequencing

Total genomic DNA was extracted from silica gel-dried leaves using the modified CTAB method [31] and used as template in the polymerase chain reaction (PCR). A preliminary universal primer scans of chloroplast DNA genomes were performed on 10 individuals from 10 different populations using 5 pairs of primers. Primers a and f of Taberlet et al. (1991) [32] were used to amplify trnT-trnF region and sequenced with primers a, c and f. The other four regions psbB-psbH, rpl20–5′rps12, trnS-trnG and psbA-trnH were amplified and sequenced using primers described in Hamilton (1999) [33]. Primer used to amplify trnS-trnG, showed different sequences within the 10 individuals tested, and were used thereafter for the large-scale survey of haplotype variation within S.tetraptera. Polymerase chain reaction (PCR) was performed in a 25µLvolume, containing 10–40 ng plant DNA, 50 mMTris-HCl, 1.5 mM MgCl₂, 250µg/mL BSA, 0.5 mM dNTPs, 2µMof each primer, and 1 unit of Taq polymerase. Initial template denaturation was programmed at 95°C for 5 min, followed by 35 cycles of 94°C for 1 min, 52°C for 1 min, and 72°C for 1 min plus a final extension of 72°C for 7 min. The BioDev Gel Extraction System B Kit (BioDev-Tech) was used to purify all successfully amplified DNA fragments. The PCR primers trnS and trnG were adopted to perform sequencing reactions using ABI Prism Bigdye™ Terminator Cycle Sequencing Ready Reaction Kit. The all trnL-trnF sequences of S.tetraptera used in this study were quoted from Yang et al.,(2011)[30].

Proofreading and Alignment of DNA, and Data Analysis

We used BioEdit v 7.0.9.0 [34] software for manual proofreading and examining the variable sites. And then we used MEGA v 7.0 [35] software to get rid of low quality sequences and we only used high quality sequences for subsequent analysis. The sequencematrix-1.8 was used to combine the trnL-trnF and trnS-trnG gene segments, and the combined cpDNA gene segment was used to carry out the subsequent analysis.

Genetic Variation and Genetic Structure Analysis

We used the program ARLEQUIN version 3.11 [36] to calculate the indices of unbiased genetic diversity (Hd) and nucleotide diversity (π) for each population. Differentiations among populations and within populations were calculated by analyses of molecular variance (AMOVA) using ARLEQUIN version 3.11.

The program PERMUT was used to calculate total gene diversity (H_T), average gene diversity within populations (H_S), G_ST (coefficient of genetic variation over all populations) and N_ST (equivalent coefficient taking into account sequences similarities between haplotypes) [37]. A comparison was made between G_ST and N_ST using a permutation test with 1000 permutations. The phylogeographic structure appear when the N_ST value is significantly larger than G_ST value.

Phylogenetic Analysis

We used DnaSP v5.10 to identify haplotypes of the cpDNA genes. Haplotype distribution was visualized by GenGIS [38]. The program NETWORK 4.5.1.0 (available at http:⁄ ⁄www.fluxus-engineering.com) was used to construct the haplotype network based on Median Joining (MJ) method [39] and MP calculation [40].

In this study, phylogenetic tree was constructed based on Bayesian inference (BI) [41] and maximum parsimony (MP). Mafft 7.205 software was used to compare the cpDNA sequences and remove the irregular sequences at both ends [42]. Before building BI tree, PAUP and MrModeltest are jointly run through MrMTgui. AkaikeInformation Criterion (AIC) results showed that the best model for BI analysis is GTR + I + G, with random tree as starting tree. Start with four Markov chains, namely three hot chains and one cold chain, save one tree every 100 generations, calculate 9 000 000 generations in total, discard the first 25% preheated (Burin-in) tree, and use the remaining tree to calculate the Bayesian posterior probability of the consistent tree and each branch (PP, Posteriorpssibility). PAUP 4.0b10 software was used to construct MP tree [43].

Demographic history

Mismatch distribution and neutral test for the existing populations of S.tetraptera were conducted with DnaSP Ver.5.10 [44] software. If the result of mismatch distribution is unimodal, it indicates that the population may have experienced recent expansion. If it is multiple peaks, it means that the population size is relatively stable and in individual equilibrium in a long time [45]. For the neutral test, two infinite mutation-site model indices such as Tajima’s D, Fu's Fs [46–48] were selected to predict the nature of sequence evolution and possible population history dynamics. Negative values of two indices indicate that the population may have undergone recent expansion or selective sweep. Positive values of two indices indicate that populations may have been geographically isolated for a long time and mutation differences between populations were accumulated or controlled by balanced selection [45, 49]. Arlequin Ver. 3.5.2 [36] was used to test the results of mismatch distribution analysis, among which SSD (sum of square deviation) was used to test whether to accept the hypothesis of population rapid expansion recently.

Species Distribution Modeling

A total of 97 sites were collected in this study, covering the known distribution areas of S.tetraptera. The geographic distribution data of S.tetraptera were obtained mainly by the following methods: (1) field investigation (for detailed information, see Table 1); (2) the network data, including the China digital plant herbarium (http://www.cvh.org.cn/), China plants subject database (http://www.plant.csdb.cn/) and the Chinese image library (http://www.plantphoto.cn/). (3) Literature review, including Chinese and English journals, flora of China, flora of local areas, investigation reports of nature reserves, etc. After removing duplicate records using ENMTools program from the same locality which can reduce the influences of autocorrelation, 82 sites of S.tetraptera were applied for the ecological niche modeling (ENM).

Potential distributions of the Last Glacial Maximum (LGM, 0.021–0.018 Ma), current and future (year 2050) of S.tetraptera was predicted using MaxEnt v3.4.1 [50–51]. Three general circulation models (BCC-CSM2-MR, CNRM-CM6-1 & MIROC-ES2L) and four shared socio-economic pathways (SSP1-2.6, SSP2-4.5, SSP3-7.0 & SSP5-8.5) were selected for 12 sets of climate simulation data for the future decade. As for as LGM period, paleo climatic layers simulated by the New Earth System Model of the Max Planck Institute for Meteorology, the Community Climate System Model Version Version 4 (CCSM4; [52]), and the Model for Interdisciplinary Research on Climate Earth System Model (MIROC-ESM; [53]). We downloaded layers for 19 bioclimatic variables (Table S2) of these models plus for the current time (1970–2000) at 2.5 arc-min resolution from the WorldClim website (www.worldclim.org) for the study area. There is a certain correlation between 19 bioclimatic variables (Peterson et al., 2011). Pearson correlation coefficient matrix among 19 contemporary bioclimatic variables was calculated using ENMTools version 1.4.4 [53], and 0.80 was used as the threshold to determine whether the correlation was significant. MaxEnt software version 3.4.1was used for pre-modeling to obtain the percentage contribution of each factor to the model and the analysis result of the Jackknife. By retaining the relatively important factor among the significant correlation factors, the combination of climate factors for model construction was determined [54]. The eventual chose variables that were applied to find changes in the distribution ranges of S.tetraptera were bio4, bio10, bio11, bio14, and bio15.

Feature class and regularization multiplier optimization realized using R program and Kuenm package [55]. Select the minimum value of AICc as the optimal setting and establish the model [54, 56]. LQ and 0.1 were selected as the optimal setting for feature class and regularization multiplier.

The distribution data of S.tetraptera and corresponding bioclimatic variables in each period were imported into Maxent V3.4.1. In parameter setting, the Maximum iterations is set to 5000, and the Subsample method is adopted to run 10 times repeatedly. The default parameters of other parameters were used for MaxEnt, and the final distribution model is obtained by taking average values. The output format of model analysis results is ASCII grid layer, and the value of fitness index is between 0 and 1. Receiver Operating characteristic Curve (ROC) was used to analyze the prediction accuracy. The greater the Area under the ROC curve (AUC), the higher the prediction accuracy of the model [57]. The calculation results of MaxEnt V3.4.1 were imported into DIVA-GIS V7.5, and the Mongolian map layer made by Chinese map was used to limit the analysis scope to China. As threshold rule, we selected the maximum test sensitivity plus specificity logistic threshold, which is very robust with all types of data [58].

cpDNA variations and Haplotype Distributions

Two chloroplast gene fragments (trnL-trnF and trnS-trnG) were applied to analyze 301 individuals from 35 populations of S.tetraptera. The total length of the fragments was 1215 bp, and the lengths of the trnL-trnF, and trnS-trnG regions were 761 and 454 bp, respectively, which included 22 mutation sites (Supplementary Table S1). Due to the uniparental inherited of cpDNA, the two chloroplast gene fragments were combined in the subsequent population genetics analysis.

The distribution of the observed haplotypes in each population was indicated in Table 1 and Fig. 1. In total, 54 haplotypes were detected in the S.tetraptera. The main feature of the 54 haplotype distribution was the absence of a clear geographic structuring. The most common haplotype, H4, was found in 24 of the 35 populations and made up 23.92% of the total sample. Haplotypes H1, H3, H6, H7, H13 and H53 were also common and they were present in 12.96%, 8.64%,5.32%, 6.31%, 4.32% and 3.32% of the individuals, respectively (Fig. 1, Table 1). The rest haplotypes were divided into two classes: (i) rare but widely distributed ones (H5, H8, H9, H10, H11, H12, H13, H14, H15, H19, H21, H22, H24, H26, H29, H30, H31, H32, H34, H35, H36, H38, H42, H44 and H54); (ii) haplotypes belonged to population-specific haplotypes (H17, H18, H20, H23, H25, H27, H28, H33, H37, H39, H40, H41, H43, H45, H47, H48, H49, H50, H51 and H52).

Genetic Diversity and Structure

Unbiased haplotype diversity (Hd) within the 35 populations ranges from 0.18182 to 0.98182, and nucleotide diversity (π) from 0.00015 to 0.00362 (Table 1). NQ population had the highest haplotype diversity (0.98182) and DB population had the highest nucleotide diversity (0.00362). Total gene diversity (H_T) was estimated to be 0.912, and average gene diversity within populations (H_S) was 0.768. The values of N_ST and G_ST were 0.158 and 0.315, respectively. No phylogeographic signal in the haplotype distribution was detected by means of a standard phylogeographic analysis, because N_ST was insignificantly larger than G_ST. Analysis of molecular variance (AMOVA) suggested that most of genetic variation was found within populations (70.51%) as opposed to between populations (29.49%) (Table 2).

Table 2

Results of analysis of molecular variance (AMOVA) of cpDNA sequence data from populations of *S.tetraptera*.
Source of variance	d.f.	Sum of squares	Variance components	Percentage total (%)	Fixation index
Among populations	34	172.445	0.46208	29.49	F_ST=0.29491^★
Within populations	266	293.868	1.10477	70.51
Total	300	466.312	3.80569
Note: Vaule marked by an asterisk(^★) wsa statistically significant at the P < 0.05 level.

Phylogenetic Relationships

We used BI and MP to reconstruct the phylogenetic relationship for cpDNA haplotypes in S.tetraptera. Both the BI and MP phylogenetic trees showed that haplotypes in different populations mixed up in phylogenetic trees and did not cluster separated branch according to population. That is, haplotypes lack a distinct geographic distribution structure and disperse into different clades (Fig. 2 and Fig. 3). And then these shallow divergent cpDNA haplotypes were subjected to a median-joining network. In cpDNA network, haplotypes with high distribution frequency (e.g., H4, H1, and H3) were located in the central positions of individual networks, while population-specific haplotyes with low frequency generally occupied network tips (Figs. 4). Divergence between adjacent cpDNA haplotypes was even shallower, and was usually distinguished by one mutation step (Fig. 4).

Population Dynamics History and Divergence Time

DnaSP software was used to test the neutrality of all populations based on the chloroplast data of S.tetraptera. The results showed that Tajima'D value was − 0.12975 (P < 0.001), Fu 'Fs was − 1.28549 (P < 0.001), that was, all were significant negative values, and the observed mismatch distribution analysis results were single-peak curves (Fig. 5). The results of mismatch distribution analysis were verified by Arlequin software, and the SSD and HRI values were positive and insignificant (SSD = 0.04703, P > 0.05; HRI = 0.004, P > 0.05). The results indicated that the distribution area or quantity of S.tetraptera in its current distribution area had experienced a recent expansion.

Species Distribution Modeling

The average test AUC (area under curve) value was high (0.979 ± 0.007), demonstrating good predictive model performance. According to ENM results, the predicted distribution under present conditions was broadly similar to its observed distribution across the QTP, with the Qinghai and Gansu as the major distribution areas, and dispersed distribution in Sichuan and Xizang (Fig. 6). At the LGM, the predicted distribution range of S.tetraptera would have expanded and moved eastward, occupying most of northwest Sichuan, eastern Qinghai, southern Gansu, southern Shanxi, and eastern Tibetan (Fig. 6). The predicted distribution of S.tetraptera in the future (2050) is extended compared with the present (at most, there is a slight migration to higher altitude) (Fig. 6).

Genetic Diversity of S.tetraptera

In this study, based on the combined chloroplast fragment, we found that at the species level, the Hd (haplotype diversity) and π (nucleotide diversity) of S.tetraptera were 0.908 and 0.0257, respectively. According to statistics, these values were higher than those of other herbaceous plants in Qinghai-Tibetan plateau, such as Rhodiola chrysanthemifolia (Hd = 0.411, π = 0.0025) [59], Notopterygium incisum (Hd = 0.75, π = 0.00086) [22], Meconopsis integrifolia (Hd = 0.8064, π = 0.000144) [60]. Although different studies used different molecular markers, and the environment, biological characteristics and evolutionary history of different species were not completely the same, the biological significance of such indirect comparison still needs further study, but such comparison can intuitively indicate the degree of genetic diversity of S.tetraptera. This indicated that the genetic diversity of S.tetraptera was high. The high level of genetic diversity in S.tetraptera may be caused by the following reasons: First, there is a strong correlation between plant genetic diversity and geographical distribution of species [61–63]. Species with a wide natural range usually contain more genetic diversity. Although S.tetraptera is an annual herb, it has a wide distribution range, which is widely distributed in Qinghai, Gansu, Sichuan and Tibetan of China. It can be predicted that S.tetraptera should have a high level of genetic diversity. This is consistent with the genetic diversity parameters obtained from our analysis of 35 populations. Second, the proportion of endemic haplotypes was high in the populations of S.tetraptera, which may have correspondingly increased the level of diversity within populations. Third, the high genetic diversity of S.tetraptera may be related to its evolutionary history. According to previous study, Swetia originated in the early Miocene of the Tertiary (29.60 Ma), and the differentiation of S.tetraptera was completed in the late Pliocene of Tertiary (3.97 Ma) (Data were presented in supplementary materials). Therefore, prior to the Quaternary ice age, S.tetraptera may have been widely distributed in the Qinghai-Tibet Plateau, which may be one of the reasons for its high genetic diversity. In addition, the uplift of the Qinghai-Tibetan Plateau, Himalayan, Hengduan, Qinling mountains, and the inland dry during the mid-Tertiary period led to the differentiation of plants along the direction of adaptation to alpine dry. During the evolution process, S.tetraptera experienced different environmental climate changes and accumulated more genetic diversity under different environmental conditions for the survival of its own species, so as to adapt to various possible environmental pressures.

Genetic Structure of S.tetraptera

In general, chloroplast data (G_ST=0.315, F_ST=0.2949) indicated a low degree of differentiation among populations of S.tetraptera. This may be caused by sufficiently strong gene flow between populations [64–65]. However, a large number of endemic haplotypes were detected in S.tetraptera and were endemic to most populations, suggesting that gene flow between populations was limited. Although the mechanisms of pollen and seed dispersal in this species are not well understood, previous study has shown that outcrossing and self-crossing existed simultaneously in S.tetraptera [29]. However, for populations that grow in extreme environments, where pollen is scarce, self-pollination is especially important in order to ensure reproduction, which can be confirmed by the special flowers (normal 'open pollinated' flowers and 'closed or cleistogamous' flowers). Therefore, effective gene flow was not responsible for the low level of differentiation between populations in the S.tetraptera. Thus, the rapid intraspecific diversification was one of the reasons for the low degree of differentiation among populations of S.tetraptera. Of course, it is possible that other factors, such as hybridization and/or introgression, contribute to the low level of differentiation between populations in the S.tetraptera, but we do not currently know whether this phenomenon exists in this species. According to our long-term observation in the field, it was found that the distribution of S.tetraptera and Halenia elliptica was sympatric. When plant spacing between sympatric species is less than 10cm, geographical opportunities for hybridization and/or gene introgression between them can be provided. Therefore, considering the genetic structure of the population and the results of close relationship between S.tetraptera and H.elliptica as indicated in previous studies [66–72], it is speculated that there may be hybridization and/or gene introgression between the sympatric species of H.elliptica.

A large number of private haplotypes existed in the current populations of this species, and a small number of widely distributed haplotypes with high frequency occurred among populations, which might reduce the degree of differentiation between populations to some extent. In addition, limited mutation sites were detected in many haplotypes, and adjacent haplotypes were separated by a limited number of mutation steps. Due to the limited number of mutation sites of haplotype generated by cpDNA fragments, obvious results were not obtained when constructing phylogenetic relationship trees of MP and BI. The low mean nucleotide diversity among populations further confirmed that the differentiation of the chloroplast haplotypes of this species were shallower. Therefore, the lack of phylogeographic structure in S.tetraptera might be the consequence of the low occurrence frequency, scattered distribution and shallow differentiation of the endemic haplotypes, which was similar to the Saxifraga sinomontana [60], Rhodiola chrysanthemifolia [59], Rhodiola alsia [16], Stellera chamaejasme[73], Potentilla glabra [21].

Glacial refugia of S.tetraptera

A summarization of the QTP alpine species investigated to date shows that ‘‘contraction/recolonization’’ hypothesis, ‘‘platform refugia/local expansion’’ hypothesis and ‘‘microrefugia’’ hypothesis are three main phylogeographic patterns of plant species on the QTP during the Quaternary glaciations (Gao et al., 2016; Qiu et al., 2011). In this study, a lot of private haplotypes are dispersed across the distribution range of S. tetraptera, and populations with high genetic diversity demonstrate an even distribution. Meanwhile, ancestral (H4) and unique haplotypes are appeared in the same population (Table 1, Fig. 1). All these results intensively suggest the existence of multiple micro-refugia of S.tetraptera in QTP. This result was also supported by the ENM. At the LGM, the range of S.tetraptera showed a dispersed distribution in the QTP (Fig. 5). As a matter of fact, taking into consideration of the topological heterogeneity in QTP and rejecting the claim that there was no unified ice sheet in QTP during the ice age (Seong et al. 2008), it was likely that suitable microenvironments existed for cold-tolerant herbs (Gao et al.,2016), to survive glaciations in situ. However, both mismatch distribution analysis and neutrality tests showed a recent expansion across the distribution range of S. tetraptera. As mentioned above, combined the present genetic structure of S.tetraptera with the fact of low dispersal ability of seeds, provided evidence object to extensive horizontal range expansion across the distribution range of S.tetraptera. Therefore, the discovered expansion signal possibly represented demographic expansion or altitudinal migration in response to repeated glacier developments and retreats, which was also detected in Saxifraga sinomontana (Li et al., 2018), Rhodiola chrysanthemifolia (Gao et al., 2016), Potentilla fruticose (Shimono et al., 2010) and P. glabra (Wang et al., 2009). As a result, we have speculated that S.tetraptera had a continuous distribution in the QTP region before the Quaternary glaciations, and had some widespread haplotypes (such as H4, H3) across its distribution range. After the repeated development and withdrawal of the glaciers during the Quaternary, the distribution range of S.tetraptera may have fragmented into isolated patches, finally facilitating in situ allopatric divergence. Since then, owing to the bottleneck effects and genetic drifts during the Quaternary glaciations, S.tetraptera has lost some of the ancient genetic structure in some degree, producing a large number of unique haplotypes, which eventually forms the existing genetic structure.

Our study implies current geographic and genetic distribution of S.tetraptera is likely to have been shaped by both QTP uplift and Quaternary periods. Multiple micro-refugia of S.tetraptera were existed during Quaternary glaciations. Rapid intraspecific diversification and hybridization and/or introgression may have played a vital role in shaping current distribution patterns of S.tetraptera. The distribution range of S.tetraptera appeared to have experienced expansion during the LGM; in the future, when the global climate becomes warmer with rising carbon dioxide, the distribution of S.tetraptera will expansion and migration to higher altitude.

Acknowledgements

We thank Hechun Liu, Xueli Zhou for material collection and experiment assistance.

Authors’ contributions

YL conceived and designed the research, performed the experiments, data analyses and wrote the article. ZG revised the article. All authors read and approved the final article

Funding

This research was funded by the Second Tibetan Plateau Scientific Expedition and Research Program (No.2019QZKK1003) and Key deployment project of Chinese Academy of Sciences (No. ZDRW-ZS-2020).

Availability of data and materials

All data generated or analysed during this study are included in this published article and its supplementary information files. The datasets used and/or analysed during the current study are available from the corresponding author on reasonable request.

Ethics and consent to participate

The authors declared that experimental research works on the plants described in this paper comply with institutional, national and international guidelines. Field studies were conducted in accordance with local legislation and get permissions from provincial department of forest and grass of Gansu, Qinghai and Sichuan province. Voucher specimens of all populations were deposited at the herbarium of the QTPMB (Qinghai-Tibetan Plateau Museum of biology), Xining, Qinghai Province, China.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Zhang YL, Li B, and Zheng D. A discussion on the boundary and area of the Tibetan Plateau in China. Geogr Res. 2002; 21:1–10.
Zheng D, Yao TD. Progress in research on formation and evolution of Tibetan Plateau with its envir onment and resource effects. Chin Basic Sci. 2004; 2:15–21.
Huang W, Ji H. Discovery of Hipparion of fauna in Xizang. Chinese Sci Bull. 1979; 24:885–885.
Spicer RA, Harris NBW, Widdowson M, et al. Constant elevation of southern Tibet over the past 15 million years. Nature. 2003; 421:622–624.
Myers N, Mittermeier RA, Mittermeier CG, da Fonseca GAB& Kent J. Biodiversity hotspots for conservation priorities. Nature. 2000; 403:853–858.
Wu SG, Yang YP, Fei Y. On the flora of the alpine region in the Qinghai-Xizang Plateau. Acta Bot Yunnan. 1995; 17:233–25.
Qiu ZD, Li CK. Evolution of Chinese mammalian faunal regions and elevation of the Qinghai-Xizang (Tibet) Plateau. Sci China Ser D. 2005; 48: 1 246-1 258.
Fan ZX, Liu SY, Liu Y, Zhang XY, Yue BS. How Quaternary geologic and climatic events in the southeastern margin of the Tibetan Plateau influence the genetic structure of small mammals: inferences from phylogeography of two rodents, Neodon irene and Apodemus latronum. Genetica. 2011; 139:339–351.
Avise JC, Nelson WS, Bowen B. W., Walker D. Phylogeography of colonially nesting seabirds, with special reference to global matrilineal patterns in the sooty tern (Sterna fuscata). Mol Ecol. 2000; 9:1783–1792.
Petit RJ, Pineau E, Demesure B, Bacilieri R, Ducousso A, and Kremer A. Chloroplast DNA footprints of postglacial recolonization by oaks. Proc Natl Acad Sci. 1997; 94:9996–10001.
Abbott RJ, Brochmann C. 2003. History and evolution of the arctic flora: in the footsteps of Eric Hulten. Mol Ecol. 2003; 12:299–313.
Hewitt GM. Genetic consequences of climatic oscillations in the Quaternary. Philos T R Soc B. 2004; 359:183–195.
Petit RJ, Hampe A, and Cheddadi R. Climate changes and tree phylogeography in the Mediterranean. Taxon. 2005; 54:877–885.
Soltis DE, Morris AB, McLachlan JS, Manos PS, Soltis PS. Comparative phylogeography of unglaciated eastern North America. Mol Ecol. 2006; 15:4261–4293.
Zhang Q, Chiang TY, George M, Liu JQ, Abbott R J. Phylogeography of the Qinghai-Tibetan Plateau endemic Juniperus przewalskii (Cupressaceae) inferred from chloroplast DNA sequence variation. Mol Ecol. 2005; 14:3513–3524.
Gao QB, Zhang DJ, Duan YZ, Zhang FQ, Li YH, Fu, PC, Chen SL. Intraspecific divergences of Rhodiola alsia (Crassulaceae) based on plastid DNA and internal transcribed spacer fragments. Bot J Linn Soc. 2012; 168: 204–215.
Meng LH, Yang R, Abbott RJ, Miehe G, Hu TH, Liu JQ. Mitochondrial and chloroplast phylogeography of Picea crassifolia Kom. (Pinaceae) in the Qinghai-Tibetan Plateau and adjacent highlands. Mol Ecol. 2007; 16:4128–4137.
Yuan QJ, Zhang ZY, Peng H, Ge S. Chloroplast phylogeography of Dipentodon (Dipentodontaceae) in southwest China and northern Vietnam. Mol Ecol. 2008; 17:1054–1065.
Zheng W, Wang LY, Meng LH, Liu JQ. Genetic variation in the endangered Anisodus tanguticus (Solanaceae), an alpine perennial endemic to the Qinghai-Tibetan Plateau. Genetica. 2008; 132:123–129.
Wang L, Abbott R J, Zheng W, Chen P, Wang YJ, Liu JQ. History and evolution of alpine plants endemic to the Qinghai-Tibetan Plateau: Aconitum gymnandrum (Ranunculaceae). Mol Ecol. 2009a; 18:709–721.
Wang LY, Ikeda H, Liu TL, Wang YJ, Liu JQ. Repeated Range Expansion and Glacial Endurance of Potentilla glabra (Rosaceae) in the Qinghai-Tibetan Plateau. J Integr Plant Biol. 2009b; 51:698–706.
Shahzad K, Jia Y, Chen F-L, Zeb U, Li Z-H. Effects of Mountain Uplift and Climatic Oscillations on Phylogeography and Species Divergence in Four Endangered Notopterygium Herbs. Front Plant Sci. 2017; 8; e01929.
Chen SY, Wu GL, Zhang DJ, Gao QB. Molecular phylogeography of alpine plant Metagentiana striata (Gentianaceae). J Syst Evol. 2008a; 46:573–585.
Chen SY, Wu GL, Zhang DJ., Gao QB. Potential refugium on the Qinghai-Tibet Plateau revealed by the chloroplast DNA phylogeography of the alpine species Metagentiana striata (Gentianaceae). Bot J Linn Soc. 2008b; 157:125–140.
Hamrick JL, Godt MJW. Effects of life history traits on genetic diversity in plant species. Philos Trans R Soc Lond B Biol Sci. 1996; 351:1291–1298.
Barrett SCH, Harder LD, Worley AC. The comparative biology of pollination and mating in flowering plants. Philos T Roy Soc B. 1996; 351:1271–1280.
Linhart YB. Variation in woody plants: molecular markers, evolutionary processes and conservation biology. Pp. 341–374 in Jain, S. M., and Minocha, S. C., eds. Molecular Biology of Woody Plants. Kluwer Academic, Dordrecht. 2000.
Yang LC, Li JJ, Zhou GY. Comparative chloroplast genome analyses of 23 species in Swertia L. (Gentianaceae) with implications for its phylogeny. Front Genet. 2022; 13:895146.
Yang LC, Zhou GY, Chen GC. Genetic diversity and population structure of Swertia tetraptera (Gentianaceae), an endemic species of Qinghai-Tibetan Plateau. Biochem Syst Ecol. 2011; 39(4–6):302–308.
Yang LC, Zhou GY, Li CL, Song WZ, Chen GC. Potential refugia in Qinghai-Tibetan Plateau revealed by the phylogeography study of Swertia tetraptera (Gentianaceae). Pol J Ecol. 2011; 59(4):753–764.
Murray MG, and Thompson WF. Rapid isolation of high molecular weight plant DNA. 1980; Pp.4320–4325. Nucleic Acids Research.
Taberlet P, Gielly L, Pautou G, Bouvet J. Universal Primers for Amplification of 3 Noncoding Regions of Chloroplast DNA. Plant Mol Biol. 1991;17:1105–1109.
Hamilton MB. Four primer pairs for the amplification of chloroplast intergenic regions with intraspecific variation. Mol Ecol. 1999; 8:521–523.
Hall TA. BioEdit: A User-Friendly Biological Sequence Alignment Editor and Analysis Program for Windows 95/98/NT. Nucleic acids symp ser. 1999; 41: 95–98.
Kumar S, Nei M, Dudley J, Tamura K. MEGA: A biologist-centric software for evolutionary analysis of DNA and protein sequences. Brief Bioinform. 2008; 9(4):299–306.
Excoffier L, Lischer HEL. Arlequin Suite ver 3.5, a New Series of Programs to Perform Population Genetics Analyses under Linux and Windows. Mol Ecol Resour. 2010; 10: 564–567.
Pons O, and Petit RJ. Measuring and testing genetic differentiation with ordered versus unordered alleles. Genetics. 1996; 144:1237–1245.
Parks DH, Mankowski T, Zangooei S, Porter MS, Armanini DG, Baird DJ, Langille MGI, Beiko RG. GenGIS 2: Geospatial analysis of traditional and genetic biodiversity, with new gradient algorithms and an extensible plugin framework. PLoS One. 2013; 8: 7, e69885.
Bandelt HJ, Forster P, Rohl A. Median-joining networks for inferring intraspecific phylogenies. Mol Biol Evol. 1999; 16:37–48.
Polzin T, and Daneshmand SV. On Steiner trees and minimum spanning trees in hypergraphs. Oper Res. 2003; 31:12–20.
Ronquist F, Huelsenbeck JP. MrBayes 3: bayesian phylogenetic inference under mixed models. Bioinformatics. 2003; 19:1572–1574.
Katoh K, Misawa K, Kuma K, Miyata T. MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res. 2002; 30:3059–3066.
Swofford DL. PAUP*: phylogenetic analysis using parsimony (and other methods), version 4.0 Beta. Sinauer, Sunderland. 2002.
Librado P, Rozas J. DnaSP v5: a software for comprehensive analysis of DNA polymorphism data. Bioinformatics. 2009; 25:1451–1452.
Rogers AR, Harpending H. Population growth makes waves in the distribution of pairwise genetic differences. Molec Biol Evol. 1992; 9:552–569.
Simonsen KL, Churchill GA, Aquadro CF. Properties of statistical tests of neutrality for DNA polymorphism data. Genetics. 1995; 141: 413–429.
Fu YX, Li WH. Statistical tests of neutrality of mutations. Genetics. 1993; 133;693–709.
Tajima F. Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics. 1989; 123:585–595.
Slatkin M, Hudson RR. Pairwise comparisons of mitochondrial DNA sequences in stable and exponentially growing populations. Genetics. 1991; 129:555–562.
Phillips SJ, Anderson RP, and Schapire RE. Maximum entropy modeling of species geographic distributions. Ecol Model. 2006; 190, 231–259.
Phillips SJ, and Dudík M. Modeling of species distributions withMaxent: new extensions and a comprehensive evaluation. Ecography. 2008; 31, 161–175.
Gent PR, Danabasoglu G, Donner LJ, Holland MM, Hunke EC, Jayne SR, Lawrence DM, Neale R, Rasch PJ, Vertenstein M, Worley P, Yang ZL, Zhang MH. Community Climate System Model Version 4. J Clim. 2011; 24:4973–91.
Warren DL, Glor RE, Turelli M. Environmental niche equivalency versus conservatism: quantitative approaches to niche evolution. Evolution. 2008; 62:2868–83.
Morales NS, Fernández IC, Baca-González V. MaxEnt's parameter configuration and small samples: are we paying attention to recommendations? A systematic review. PeerJ. 2017; 14;5:e3093.
Cobos ME, Peterson A T, Barve N, Osorio-Olvera L. Kuenm: an R package for detailed development of ecological niche models using Maxent. PeerJ. 2019;7:e6281.
Phillips SJ, Anderson RP, Dudík M, Schapire RE, Blair ME. Opening the black box: an open-source release of Maxent. Ecography. 2017; 40: 887–893.
Hewitt G. The genetic legacy of the Quaternary ice ages. Nature. 2000; 405:907–913.
Poirazidis K, Bontzorlos V, Xofis P, Zakkak S, Xirouchakis, S, Grigoriadou E, Kechagioglou S, Gasteratos I, Alivizatos H, Panagiotopouloui M. Bioclimatic and environmental suitability models for capercaillie (Tetrao urogallus) conservation: Identification of optimal and marginal areas in Rodopi Mountain-Range National Park (Northern Greece). Glob Ecol Conserv. 2017; 17: e00526.
Gao QB, Zhang FQ, Xing R,Gornall, RJ, Fu, PC, Li Y, Gengji ZM, Chen, SL. Phylogeographic study revealed microrefugia for an endemic species on the Qinghai-Tibetan Plateau: Rhodiola chrysanthemifolia (Crassulaceae). Plant Syst Evol. 2016; 302(9): 1179–1193.
Li Y, Gao, QB, Gengji ZM, Jia LK, Wang, ZH, Chen, SL. Rapid Intraspecific Diversification of the Alpine Species Saxifraga sinomontana (Saxifragaceae) in the Qinghai-Tibetan Plateau and Himalayas. Front Genet. 2018; 9: 381.
Hamrick JL, Godt MJW, Murawski DA, Loveless MD. Correlations between species traits and allozyme diversity: implications for conservation biology. In: Genetics and Conservation of Rare Plants (eds Falk DA, Holsinger KE), Oxford University Press, New York; 1991.pp. 75–86.
Nybom H. Comparison of different nuclear DNA markers for estimating intraspecific genetic diversity in plants. Mol Ecol. 2004; 13: 1143–1155.
Aguinagalde I, Hampe A, Martin JP, Duminil J, Petit RJ. Effects of life history traits and species distribution on genetic structure at maternally inherited markers in European trees and shrubs. J Biogeogr. 2005; 32: 329–339.
Oliver C, Hollingsworth PM, Gornall RJ. Chloroplast DNA phylogeography of the arctic-montane species Saxifraga hirculus (Saxifragaceae). Heredity. 2006; 96: 222–231.
Brochmann C, Gabrielsen T M, Nordal I, Landvik JY, Elven R. Glacial survival or tabula rasa? The history of the North Atlantic biota revisited. Taxon. 2003; 52: 417–450.
Shi GR. Cluster analysis for embryological characters of 12 species in Gentianaceae. J. Huaibei Coal Indus. Teach. Colle. 2004; 25 (2);51–55.
Struwe L, Albert VA. Gentianaceae: Systematics and natural history. New York: Cambridge University Press. 2002; 242.
Von Hagen, KB, Kadereit JW. Phylogeny and flower evolution of the Swertiinae (Gentianaceae-Gentianeae): Homoplasy and the principle of variable proportions. Syst Bot. 2002; 27:548–572.
Chassot P, Nemomissa S, Yuan YM, Kupfer P. High paraphyly of Swertia L. (Gentianaceae) in the Gentianella-lineage as revealed by nuclear and chloroplast DNA sequence variation. Plant Syst Evol. 2001; 229 (1–2): 1–21.
Favre A, Yuan Y.M, Küpfer P, Alvarez N. Phylogeny of subtribe Gentianinae (Gentianaceae): Biogeographic inferences despite limitations in temporal calibration points. Taxon. 2010; 59 (6): 1701–1711.
Xi HC, Sun Y, Xue CY. Molecular phylogeny of Swertiinae (Gentianaceae-Gentianeae) based on sequence data of ITS and matK. Plant Divers Res. 2014; 36 (2):145–156.
Cao Q, Xu LH, Wang JL, Zhang FQ, Chen SL. Molecular phylogeny of subtribe swertiinae. Bull Bot Res. 2021; 41 (3): 408–418.
Zhang YH, Volis S, Sun H. Chloroplast phylogeny and phylogeography of Stellera chamaejasme on the Qinghai-Tibet Plateau and in adjacent regions. Mol Phylogenet Evol. 2010; 57(3): 1162–1172

No competing interests reported.

Phylogeography and ecological niche modelling implicate multiple micro-refugia of Swertia tetraptera during Quaternary glaciations

Status:

Journal Publication

Version 1

Abstract

Background

Methods

Results

Conclusion

Figures

Introduction

Materials And Methods

Results

Discussion

Conclusion

Declarations

References

Additional Declarations

Supplementary Files

Status:

Journal Publication

Version 1