Counterbalancing the time-dependence effect on the Human Mitochondrial DNA Molecular Clock

doi:10.21203/rs.2.17533/v2

Download PDF

Methodology article

Counterbalancing the time-dependence effect on the Human Mitochondrial DNA Molecular Clock

https://doi.org/10.21203/rs.2.17533/v2

This work is licensed under a CC BY 4.0 License

Journal Publication

published 29 Jun, 2020

Read the published version in BMC Evolutionary Biology →

Version 2

posted

You are reading this older preprint version

Read the latest preprint version →

Background: The molecular clock is an important genetic tool to estimate evolutionary timescales. However, the detection of a time dependency effect on the substitution rate estimates is complicating its application. It has been suggested that demographic processes could be the main cause of this confounding effect. In the present study I propose a new algorithm to estimate the coalescent age of phylogenetically related sequences, taking into account the observed time dependency effect on the molecular rate detected by others.

Results: Applying this method to real human mitochondrial DNA trees, with shallow and deep topologies, I have obtained significantly older molecular ages for the main events of human evolution than in previous estimates. These ages are in close agreement with the most recent archaeological and paleontological records that are in favor of an emergence of early anatomically modern humans in Africa at 315 ± 34 thousand years ago and the presence of recent modern humans out of Africa as early as 174 ± 48 thousand years ago. Furthermore, in the implementation process, we demonstrated that in a population with fluctuating sizes, the probability of fixation of a new neutral mutant depends on the effective population size which is more in accordance with the fact that, under the neutral theory of molecular evolution, the fate of a molecular mutation is mainly determined by random drift.

Conclusions: I suggest that the demographic history of populations has a more decisive effect than purifying selection and/or mutational saturation on the time dependence effect observed for the substitution rate and propose a new method that corrects for this effect.

Evolutionary Biology

Human evolution

molecular clock

mutation rate

During the last three decades, the mitochondrial DNA (mtDNA) variation has played a dominant role in the studies of human evolution. Recently, this small molecule is being substituted by the analysis of whole genomes. However, before turning the page, it would be convenient to solve the patent contradictions between the mtDNA molecular clock time estimations, and those lately proposed by paleontological and archaeological data. A key achievement of early mtDNA analyses was the dating and origin of the most recent common female ancestor of all living women around 200 kya in Africa[1]. However, recently, hominin fossils and associated Middle Stone Age artifacts from Jebel Irhoud in Morocco have been aged at 315 ± 34 kya [2]. These older dates were genetically confirmed in a study of ancient African genomes that estimated modern human divergence at 350 to 260 kya. [3] Another controversial milestone of the mtDNA molecular clock was the dating, based on the coalescence age of macrohaplogroup L3, of the dispersal of modern humans out of Africa 50 to 70 kya [4]. This is against the presence of early modern human remains in the Levant at the Skhul and Qafzeh caves dated at about 80-120 kya [5], the presence of middle stone age industries in the Arabian peninsula with similar dates around 80-130 kya [6–8], the recent discovery in southern China of unequivocally modern human teeth dated to 80-120 kya [9], or the lately reported detection of an ancient gene flow from early modern humans into the ancestors of eastern Neanderthals more than 100 kya[10] . Furthermore, the most recent finding of a Homo sapiens maxilla found at Misliya Cave, Israel, dated to 177-194 kya[11], could significantly anticipate the exit of Homo sapiens from Africa. Curiously, these dates within the marine isotope stage 7 (MIS-7) last interstadial, are in agreement with the age proposed for an ancient African hominin introgression into the European Neanderthals[12]. In addition, optically stimulated luminescence (OSL) dating of stratigraphic undisturbed basal stone tool assemblages in Madjedbebe [13], placed the human colonization of Australia around 65 kya with minor age uncertainties of only ± 3-4 kyr. The above date is significantly older than the 43-47 kya coalescence age estimated recently from Australian aboriginal mitogenomes[14]. Evidently, under the actual molecular rate estimates given for different sequences of the human genome, and the accepted constancy of the molecular clock along the time, all these older archaeological and fossil dates are in conflict with their analogue molecular estimates. At the beginning, the molecular coalescence ages were calculated by means of simple statistics as the still popular rho [15] that uses the average of the number of polymorphisms observed in a set of related sequences. Subsequently, more sophisticated Bayesian-based methods using relaxed clock phylogenies have been implemented [16]. However, the ages obtained applying these simple and complex methods to several outstanding events of the human history have given similar age frames for them[17] Thus, it could be that all the above mentioned old human dispersals, detected by the archaeological record, represent failed dispersals that did not contribute to the present-day genetic pool of modern humans or that the molecular clock approach is needed of some additional adjustment. In this paper I try to demonstrate that, under the molecular neutral theory conditions, using an overall mitogenome germ line mutation rate, and taking into account past fluctuations in the effective population size, deduced from any tree topology, it is possible to obtain slower molecular substitution rates that are more in frame with these fossil based calibrations.

Application of the new Rho Statistic to the main Human Evolution Events. To apply this new rho statistic to real mtDNA human data we need a rooted tree relating the sampled sequences. Using coalescent methodology, we could obtain a probabilistic tree. However, in the case of human mtDNA, we have a very much contrasted phylogenetic tree[48], constructed using the Network program[49]. In it, mutations are placed hierarchically from the tips to the root and multiple hits, identified by network reticulations, have been resolved attending to the relative mutation rate of the positions involved[17]. Thus, following this standard, I constructed an African mtDNA genome-based phylogenetic tree, using 86 previously published complete mtDNA sequences, in which all the main African haplogroups are represented (Figure S1). Likewise, using 142 already published complete mtDNA sequences, I constructed a second phylogenetic tree for the Australasian specific haplogroup P (Figure S2). Finally, to test a more recent human colonization, I constructed a third tree including 48 already published complete mtDNA sequences belonging to the Americas specific haplogroup B2 (Figure S3). Using these trees, I applied the proposed time-dependent based estimator to calculate coalescence ages of several essential nodes of the human history (Tables S1 to S11). I found a TMRCA for all the extant human African mtDNAs of 315 801 ± 17 827 years (Table 2 and S2) which is highly compatible with the recent archaeological and paleontological estimations of modern human origin around 315 000 ya[2, 50]. It has been proposed an early out-of-Africa of modern humans recently, carrying haplogroup L3 precursor lineages, in a favorable time window around 125 000 ya[51], that is in harmony with the age calculated here for the L3'4 split in Africa of 165 610 ± 12 869 ya as a lower bound (Tables 2 and S3). Furthermore, this age frame is compatible with the presence of modern humans in the Levant [5] and in China [52] around 100 000 ya. In the same paper, a return to Africa of basal L3 lineages over 75kya was also suggested. Again, the age for the L3 African expansion calculated with the method reported here of 112 829 ± 10 622 ya makes this suggestion feasible (Tables 2 and S4). Furthermore, under this new temporal window the great morphological variability of the Skhul/Qazfeh remains and their corresponding wide range of ages (120-80 kya), could easily fit into the whole molecular period proposed elsewhere [51], beginning with the out-of-Africa of early modern humans and finishing with their early return to the same Continent, carrying basic L3 lineages (125-75 kya). However, notice that a return to Africa from the Arabian Peninsula would also be supported by the dates estimated from the archaeological record of the region. On the other hand, our TMRCA for the Australasian haplogroup P (103 267 ± 10 332 ya) is also in agreement with an early presence of modern humans in Asia (Tables 2 and S6, S7, S8, S9). Thus, it represents a lower bound for the colonization of Philippines [53] Sumatra [54] and Australia [13] around 65 000 to 73 000 ya. It has to be mentioned that from the genome sequencing of an Aboriginal Australian[55], it was deduced that Aboriginal Australians are descendants of human dispersal into Eastern Asia that occurred about 62-75 kya. Finally, the age of human expansion into the American Continent, deduced from the haplogroup B2 phylogeny, is around 37,000 ya (Table 2, and table S11). This age is in support of a pre-Clovis occupation of the New World, well before the last glacial maximum. As the calculated ages have wide statistical confidence intervals (Table 2), different models could be adjusted into their frames. For example, we might suppose an earlier out of Africa matching the Misliya maxilla dated to 177-194 kya, then, the Skhul and Qafzeh remains, dated around 80-130 kya might signal the return to Africa of the carriers of the mtDNA basal haplogroup L3 lineages instead of the out of Africa of early anatomically modern humans as proposed here. In the same way, the controversial presence of H. sapiens in Java [56] and Sulawesi [57] as early as 120 kya would fit into the window age of the Australasian mtDNA haplogroup P (Table 2). Sure, future archaeological discoveries and more precise fossil dating will outline the most appropriate model.

Finally, one must be aware that, in addition to its wide uncertainty range, the method proposed here, depends on the accuracy of several external assumptions. One of them is the overall germline mtDNA mutation rate estimated for the studied species. For example, after this article was written, a new paper dealing with the germline mtDNA variability in humans, with an experimental design as rigorous as the one used by Rebolledo-Jaramillo et al [19], has been published [58]. The authors estimated the overall germline mtDNA mutation rate per site per generation as 4.72 x 10^-7 (95% bootstrap CI: 3.93 – 5.52 x 10^-7) that is about 75% higher than the one used here. Thus, it would be convenient that, when more empirical estimations accumulated, a consensus average rate be reached. The same is extensible to the average human generation interval, which is necessary to convert generations into years.

This algorithm is also very dependent on the degree with which the tree reflects the demographic history of the population analyzed and, therefore, on the demographic parameters assumed in the construction of the tree for either phylogenetic [59] or coalescent methods [60].

In less degree, this algorithm is also sensible to the sample size used to construct the tree in order to identify nodes that include less frequent lineages still present in the extant population. Fortunately, in the case of humans, in the last decades, its mtDNA diversity has been exhaustively sampled both at continental and population levels so that the risk of miss rare lineages is very low.

Taking into account the time-dependence of the mtDNA evolutive rate in humans as proposed here, and choosing a conservative mtDNA germ-line mutation rate as experimentally obtained by others [19], has significantly delayed the mtDNA molecular clock in humans, in such a way that all the main events of the human history, dated by paleontological and archaeological methods, fit in this new mtDNA temporal scale without the necessity of external node calibrations. Finally, it should be stressed that the applicability of this method to other demographic scenarios, and to other markers susceptible of being represented as phylogenetic trees such as the Y-chromosome and the non-recombining portions of the autosomes deserves, in my opinion, further investigation.

The used Human Mitochondrial mutation rate. The efficiency of the molecular clock [18] is based on the reliability of several implicit assumptions as a) the correctness of the mutation rate point estimate (µ) of the gene under study; b) constancy with time of the rate of molecular substitution (Ɵ) and c) rate homogeneity among the different lineages involved in the phylogeny. To accomplish the first point, in this study I have used the full-length mtDNA germ-line mutation rate of 1.3 x 10^-8 (interquartile range from 4.2 x 10^-9 to 4.1 x 10^-8) mutations per site per year (assuming a generation time of 20 years) and its derived rate scalar of one mutation every 4651 years estimated by others[19]. This mtDNA mutation rate is about ten times lower than estimates in most pedigree studies which the authors explain because, at analyzing two tissues, they could discard somatic heteroplasmies. In this respect, it has to be mentioned that second-generation massive sequencing has made possible the direct calculation of the germ-line human genomic mutation rate which resulted in half of the phylogenetic mutation rate thus, doubling the estimated divergence dates of Africans and proposing that crucial events in human evolution have occurred earlier than suggested previously[20].

Accounting for the time-dependence effect on the Rate of Molecular Evolution. It is well known that rates of molecular evolution are not constant neither at interspecific nor intraspecific levels[21]. In general, they decline with increasing divergence time, but the rate of decay differs among taxa. This time-dependent pattern has also been observed for human mtDNA in coding as well as non-coding regions[22, 23]. Purifying selection on deleterious mutations and mutation saturation has been suggested as the main forces responsible for this time rate decay[23]. However, the unrealistic large effective population sizes required to explain the long-term persistence of significantly deleterious mutations cast doubts on whether purifying selection alone can explain the observed rate acceleration[24]. It has also been found unlikely that the apparent decline in rates over time is due to mutational saturation[23]. Congruently, correcting for the effects of purifying selection and saturation has only slightly modified the mtDNA evolutionary mutation rate, providing molecular times still in apparent contradiction with archaeological and paleontological ages[17, 23]. Demographic processes such as serial bottlenecks and expansions have also been proposed to explain the differences in rate estimation over time[25]. It seems evident that some adjustment should be implemented to correct the time dependency of the molecular clock. In this paper, I propose a practical approach to counteract the time dependency effect on the molecular rate estimates taking into account tree topologies.

Approaching Lack of Mutation Rate homogeneity between Lineages. Since early molecular analyses, it was observed that rates of homologous nuclear DNA sequence evolution differ between taxonomic groups [26] which was also extensible to mtDNA[27]. Later, significant differences in rates of molecular evolution between mtDNA human lineages were also detected at haplogroup level[28–32]. Different relaxed molecular-clock methods have been implemented to incorporate rate variation among lineages[33, 34]. However, the application of these methods to the human mtDNA has yielded age estimates for the main milestones of human evolution in agreement with previous molecular estimates[35, 36]. In this paper, when distributing mutations of lineages with significant rate differences within coalescent periods, I have used a simple proportionality criterion. I allowed a window from 0 to 5 mutations between sequences within coalescent periods, as it has been demonstrated that, under a Poisson distribution, even in an extended period of 10,000 years, there could be lineages still carrying the same mutations that their common ancestor, and lineages that have accumulated five new mutations with probabilities higher than 0.05 percent[37]. Another technical problem is the mutation distribution into coalescent periods of those isolated sequences that directly radiate from ancestral nodes. To resolve this issue, I have used a weighted distribution, consisting in multiplying the number of mutations in the isolates by the number of mutations between internodes in each coalescent period, and then to divide the result by the total number of mutations in all the internodes.

Fluctuating Population Size effect on Mutation Substitution Rate.

In a population, the fixation time of a mutation (forward) or the coalescence time to the most recent common ancestor (backward), is usually calculated by the estimator Ɵ = 4N_eµ (being N_e the effective population size). For the haploid mtDNA genome, Ɵ equals to 2N_efµ (being N_ef the female effective population size). Kimura demonstrated that under strict neutral theory parameters, the rate of substitution is equated to the mutation rate[38]. However, the same author warned us that a clear distinction exists between mutation rate (µ) and mutation substitution (Ɵ). The former refers to the change of genetic material at the individual level, and the latter refers to that at the population level[38]. Thus, only when N_e is constant across generations, keeping small or large sizes, Ɵ equals to µ. This holds because with large constant sizes the number of new mutations incorporated into the population (Neµ) increases but, on the same path, the probability of fixation (1/Ne) decreases. Contrarily, with small constant sizes, the number of new mutations decreases but the probability of fixation of any of them increases at a similar level. It is widely admitted that N fluctuated largely during the human history and that global exponential population growth is happening since recent times[25]. In this paper, I have taken into account the changes in population size that occurred backward in time and its influence on the rate of gene substitution. When the population size fluctuates across generations, the probability of fixation of a mtDNA neutral variant (q) is no longer the 1/N constant. It will depend on the difference in population size of the next generation (N₁) with respect to the initial size (N₀):

[Please see the supplementary files section to view the equation.]

For example, if N₁ is twice the size of N₀, q equals 2/N₀ and, on the contrary, if N₁ is half the size of N₀, q equals 1/2N₀ . As a consequence, the rate of substitution for neutral mutations in a population with fluctuating size depends on the change in size between generations:

[Please see the supplementary files section to view the equation.]

Using a different approach, the dependence on population size of the substitution rate at neutral genes was already demonstrated for populations with fluctuating sizes and overlapping generations[39].

As human populations have been growing exponentially for several centuries, we should counterbalance this effect from the present-day generation (N_n) going backward in time by inverting the fraction between consecutive generations (N_n-1/N_n). Notice that this dependence might explain the differences in rate estimation over time observed empirically [22]. I will take into consideration this important relationship for the calculation of Ɵ.

A new Rho Statistic to estimate Coalescent Ages. Several statistics, based on DNA polymorphism, exist to estimate the parameter Ɵ. One, S, the number of segregating sites per nucleotide in a sample of sequences [40] is strongly dependent on the sample size. A second, π, is defined as the average number of nucleotide differences per site in a sample of sequences[41]. These two estimators were used to implement a statistical method for testing the neutral mutation hypothesis[42]. A third statistic, rho (ρ), is referred to as the mean number of nucleotide differences of a sample of sequences compared to their common ancestral type[15]. This last statistic is calculated from a rooted phylogenetic tree relating the sampled sequences. The accuracy of molecular dating with the rho statistic has been questioned by some, because it shows downward biased data estimations, large asymmetric variances and strong dependency of demographic factors[43], but defended by others[44]. Anyway, it is still a commonly used method to measure intraspecific mtDNA divergence events in humans. Although the distribution of pairwise nucleotide site differences between individuals has been used to detect episodes of population growth and decline[45], none of the above mentioned statistics considers the past demography of the sample in their age estimations. In this paper, I propose the use of a modified rho (ρ_m) that, taking into account the coalescent genealogical structure, significantly improves the molecular date estimation of key events in the human history based on mtDNA genome data. In order to make explicit our modifications to the classical rho, I have depicted, in figure 1, a real genealogy constructed from five lineages (a), and an idealized star-like phylogeny of the same five lineages (b) supposing a population exponential growth short after a severe bottleneck[46]. The number of lineages sampled is represented by n_i; t_i are the time periods defined by progressive coalescent events from the tips to the most recent common ancestor (TMRCA) root; i represents the number of independent lineages left after successive coalescences, ϒ_i is the number of mutations accumulated during each coalescent period, and m_ithe number of mutations accumulated along each lineage. Mutations in the star-like phylogeny are distributed into periods following the pattern found in the real phylogeny. As the accumulation of mutations along each lineage is an individual process driven by the mutation rate, µ, and distributed as independent Poisson processes, ρ, the average number of mutations per lineage, has the same value irrespective of the topology. However, as a consequence of the fact that the lineages in the real tree are not independent, because of their shared genealogy, the variance decay is much slower (1/logn) than in the independent star-like tree (1/n)[47]. As a consequence of this, for the rho calculation, mutations within lineages in the star-like phylogeny are counted only once. On the contrary, in the real phylogeny, only mutations occurring at the tips are counted only once while mutations in subsequent coalescent periods going backward to the MRCA node are counted as many times as the number of the period to which they belong. For this reason, mutations in the older periods are overrepresented in the rho calculation. Because of lineage independence, star-like phylogenies are statistically optimal to calculate ρ and π estimators. Under this topology, as mutations along lineages are counted from the root to the tips in ρ and from tip to tip in π pairwise comparisons, the value of π doubles that of ρ. However, as rho ignores the dependence among lineages existing in the majority of the trees, it is necessary to correct for this dependence. For this reason, I propose a modified rho statistic (ρ_m) that is the summation of classical rho statistics calculated for each coalescent period in the tree:

[Please see the supplementary files section to view the equation.]

This compound Poisson distribution is also Poisson distributed, therefore mean and variance are equal and, the standard deviation is the square root of this variance. Thus, uncertainty in the estimates were calculated using the Poisson confidence interval. Another important difference between the real and star-like genealogies is that in the first the number of lineages decreases as one stepwise function across coalescent periods while in the second, the number of lineages is constant until the root is reached. Equating the number of lineages in the sample as an approximation to the effective population size in the population, we should take into account this backward real decrease in N_e to improve the estimation of the MRCA age. In an ideal coalescent model, we should have i-1 population decreasing sizes, but in real phylogenies, in addition to bifurcations, there are also multifurcations and lineages with long internal segments without any branching event. Even so, I applied to each rho in consecutive periods going backward the reverse proportion used to counteract the time dependence effect on the evolutionary rate. That is, multiplying in each i-1 period the mutation rate µ by (i-1)/i and leaving the µ rate as calculated from germ-line estimations for the most recent period, comprising the tips of all the lineages sampled. With this method, I have obtained a time-dependent scaled mutation rate, Ɵ, which gave human mtDNA intraspecific ages congruent with the archaeological and paleontological calibrated nodes representing key events in the human history.

[Please see the supplementary files section to view the equation.]

Performing calculations on the empirical genealogy (Figure 1a), I have obtained an age of 22 277 ± 4 720 years using the standard ρ and age of 30 396 ± 5 513 years, 1.36 times greater, when using the time-dependent Ɵ estimator proposed here (Table 1).

CI: Coefficient Interval

Kya: Thousand Years Ago

MIS: Marine Isotope Stage

OSL: Optically Stimulated Luminescence

TMRCA: The Most Recent Common Ancestor

Ethics approval and consent to participate: Not applicable

Consent for publication: Not appliable

Avaibility of data and materials: All data analyzed during this study are included in this article and its Supplementary Information files.

Competing interests: The author declare he has no competing interests.

Funding: Not applicable.

Author’s contributions: VMC is the sole author of this article.

Acknowledgements: I am grateful to my colleagues Ana M. González and José M. Larruga for their ideas brought to this work.

Author’s information: The author, VMC, is actually retired.

Cann RL, Stoneking M, Wilson AC: Mitochondrial DNA and human evolution. Nature 1987, 325:31–36.
Hublin J-J, Ben-Ncer A, Bailey SE, Freidline SE, Neubauer S, Skinner MM, Bergmann I, Le Cabec A, Benazzi S, Harvati K, Gunz P: New fossils from Jebel Irhoud, Morocco and the pan-African origin of Homo sapiens. Nature 2017, 546:289–292.
Schlebusch CM, Malmström H, Günther T, Sjödin P, Coutinho A, Edlund H, Munters AR, Vicente M, Steyn M, Soodyall H, Lombard M, Jakobsson M: Southern African ancient genomes estimate modern human divergence to 350,000 to 260,000 years ago. Science 2017, 358:652–655.
Soares P, Alshamali F, Pereira JB, Fernandes V, Silva NM, Afonso C, Costa MD, Musilová E, Macaulay V, Richards MB, Cerny V, Pereira L: The Expansion of mtDNA Haplogroup L3 within and out of Africa. Mol. Biol. Evol. 2012, 29:915–27.
Grün R, Stringer CB: Electron spin resonance dating and the evolution of modern humans. Archaeometry 1991, 33:153–199.
Armitage SJ, Jasim SA, Marks AE, Parker AG, Usik VI, Uerpmann H-P: The southern route “out of Africa”: evidence for an early expansion of modern humans into Arabia. Science 2011, 331:453–6.
Rose JI, Usik VI, Marks AE, Hilbert YH, Galletti CS, Parton A, Geiling JM, \vCern’y V, Morley MW, Roberts RG: The Nubian Complex of Dhofar, Oman: An African Middle Stone Age Industry in Southern Arabia. PLoS ONE 2011, 6:e28239.
Groucutt HS, Scerri EM, Lewis L, Clark-Balzan L, Blinkhorn J, Jennings RP, Parton A, Petraglia MD: Stone tool assemblages and models for the dispersal of Homo sapiens out of Africa. Quaternary International 2015, 382:8–30.
Liu W, Martinón-Torres M, Cai Y, Xing S, Tong H, Pei S, Sier MJ, Wu X, Edwards RL, Cheng H, others: The earliest unequivocally modern humans in southern China. Nature 2015, 526:696–699.
Kuhlwilm M, Gronau I, Hubisz MJ, de Filippo C, Prado-Martinez J, Kircher M, Fu Q, Burbano HA, Lalueza-Fox C, de la Rasilla M, Rosas A, Rudan P, Brajkovic D, Kucan Ž, Gušic I, Marques-Bonet T, Andrés AM, Viola B, Pääbo S, Meyer M, Siepel A, Castellano S: Ancient gene flow from early modern humans into Eastern Neanderthals. Nature 2016, 530:429–33.
Hershkovitz I, Weber GW, Quam R, Duval M, Grün R, Kinsley L, Ayalon A, Bar-Matthews M, Valladas H, Mercier N, others: The earliest modern humans outside Africa. Science 2018, 359:456–459.
Posth C, Wißing C, Kitagawa K, Pagani L, van Holstein L, Racimo F, Wehrberger K, Conard NJ, Kind CJ, Bocherens H, others: Deeply divergent archaic mitochondrial genome provides lower time boundary for African gene flow into Neanderthals. Nature communications 2017, 8:1–9.
Clarkson C, Jacobs Z, Marwick B, Fullagar R, Wallis L, Smith M, Roberts RG, Hayes E, Lowe K, Carah X, others: Human occupation of northern Australia by 65,000 years ago. Nature 2017, 547:306–310.
Tobler R, Rohrlach A, Soubrier J, Bover P, Llamas B, Tuke J, Bean N, Abdullah-Highfold A, Agius S, O’Donoghue A, others: Aboriginal mitogenomes reveal 50,000 years of regionalism in Australia. Nature 2017, 544:180–184.
Forster P, Harding R, Torroni A, Bandelt H-J: Origin and evolution of Native American mtDNA variation: a reappraisal. American journal of human genetics 1996, 59:935.
Drummond AJ, Ho SYW, Phillips MJ, Rambaut A: Relaxed phylogenetics and dating with confidence. PLoS Biol. 2006, 4:e88.
Soares P, Ermini L, Thomson N, Mormina M, Rito T, Rӧhl A, Salas A, Oppenheimer S, Macaulay V, Richards MB: Correcting for purifying selection: an improved human mitochondrial molecular clock. The American Journal of Human Genetics 2009, 84:740–759.
Zuckerkandl E, Pauling L: Evolutionary divergence and convergence in proteins. In Evolving genes and proteins. Elsevier; 1965:97–166.
Rebolledo-Jaramillo B, Su MS-W, Stoler N, McElhoe JA, Dickins B, Blankenberg D, Korneliussen TS, Chiaromonte F, Nielsen R, Holland MM, others: Maternal age effect and severe germ-line bottleneck in the inheritance of human mitochondrial DNA. Proceedings of the National Academy of Sciences 2014, 111:15474–15479.
Scally A, Durbin R: Revising the human mutation rate: implications for understanding human evolution. Nature Reviews Genetics 2012, 13:745–753.
Molak M, Ho SY: Prolonged decay of molecular rate estimates for metazoan mitochondrial DNA. PeerJ 2015, 3:e821.
Henn BM, Gignoux CR, Feldman MW, Mountain JL: Characterizing the time dependency of human mitochondrial DNA mutation rate estimates. Molecular Biology and Evolution 2009, 26:217–230.
Ho SYW, Phillips MJ, Cooper A, Drummond AJ: Time dependency of molecular rate estimates and systematic overestimation of recent divergence times. Choosing appropriate substitution models for the phylogenetic analysis of protein-coding sequences. 2005, 22:1561–1568.
Woodhams M: Can deleterious mutations explain the time dependency of molecular rate estimates? Molecular Biology and Evolution 2006, 23:2271–2273.
Gignoux CR, Henn BM, Mountain JL: Rapid, global demographic expansions after the origins of agriculture. Proceedings of the National Academy of Sciences 2011, 108:6044–6049.
Britten RJ: Rates of DNA sequence evolution differ between taxonomic groups. Science 1986, 231:1393–1398.
HASEGAWA M, KISHINO H: Heterogeneity of tempo and mode of mitochondrial DNA evolution among mammalian orders. The Japanese Journal of Genetics 1989, 64:243–258.
Torroni A, Rengo C, Guida V, Cruciani F, Sellitto D, Coppa A, Calderon FL, Simionati B, Valle G, Richards M, others: Do the four clades of the mtDNA haplogroup L2 evolve at different rates? The American Journal of Human Genetics 2001, 69:1348–1356.
Maca-Meyer N, González AM, Pestano J, Flores C, Larruga JM, Cabrera VM: Mitochondrial DNA transit between West Asia and North Africa inferred from U6 phylogeography. BMC genetics 2003, 4:15.
Howell N, Elson JL, Turnbull DM, Herrnstadt C: African Haplogroup L mtDNA sequences show violations of clock-like evolution. Mol. Biol. Evol. 2004, 21:1843–54.
Merriwether DA, Hodgson JA, Friedlaender FR, Allaby R, Cerchio S, Koki G, Friedlaender JS: Ancient mitochondrial M haplogroups identified in the Southwest Pacific. Proceedings of the National Academy of Sciences 2005, 102:13034–13039.
Pierron D, Chang I, Arachiche A, Heiske M, Thomas O, Borlin M, Pennarun E, Murail P, Thoraval D, Rocher C, others: Mutation rate switch inside Eurasian mitochondrial haplogroups: impact of selection and consequences for dating settlement in Europe. PLoS One 2011, 6.
Duchêne S, Lanfear R, Ho SYW: The impact of calibration and clock-model choice on molecular estimates of divergence times. Mol. Phylogenet. Evol. 2014, 78:277–89.
Battistuzzi FU, Filipski A, Hedges SB, Kumar S: Performance of relaxed-clock methods in estimating evolutionary divergence times and their credibility intervals. Mol. Biol. Evol. 2010, 27:1289–300.
Fu Q, Mittnik A, Johnson PLF, Bos K, Lari M, Bollongino R, Sun C, Giemsch L, Schmitz R, Burger J, Ronchitelli AM, Martini F, Cremonesi RG, Svoboda J, Bauer P, Caramelli D, Castellano S, Reich D, Pääbo S, Krause J: A revised timescale for human evolution based on ancient mitochondrial genomes. Curr. Biol. 2013, 23:553–9.
Rieux A, Eriksson A, Li M, Sobkowiak B, Weinert LA, Warmuth V, Ruiz-Linares A, Manica A, Balloux F: Improved calibration of the human mitochondrial clock using ancient genomes. Mol. Biol. Evol. 2014, 31:2780–92.
Larruga JM, Marrero P, Abu-Amero KK, Golubenko MV, Cabrera VM: Carriers of mitochondrial DNA macrohaplogroup R colonized Eurasia and Australasia from a southeast Asia core area. BMC Evol. Biol. 2017, 17:115.
Kimura M: The neutral theory of molecular evolution: a review of recent evidence. Jpn. J. Genet. 1991, 66:367–86.
Balloux F, Lehmann L: Substitution rates at neutral genes depend on population size under fluctuating demography and overlapping generations. Evolution 2012, 66:605–11.
Watterson G: On the number of segregating sites in genetical models without recombination. Theoretical population biology 1975, 7:256–276.
Nei M, Li WH: Mathematical model for studying genetic variation in terms of restriction endonucleases. Proc. Natl. Acad. Sci. U.S.A. 1979, 76:5269–73.
Tajima F: Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics 1989, 123:585–95.
Cox MP: Accuracy of molecular dating with the rho statistic: deviations from coalescent expectations under a range of demographic models. Hum. Biol. 2008, 80:335–57.
Macaulay V, Soares P, Richards MB: Rectifying long-standing misconceptions about the ρ statistic for molecular dating. PLoS ONE 2019, 14:e0212311.
Rogers AR, Harpending H: Population growth makes waves in the distribution of pairwise genetic differences. Mol. Biol. Evol. 1992, 9:552–69.
Slatkin M, Hudson RR: Pairwise comparisons of mitochondrial DNA sequences in stable and exponentially growing populations. Genetics 1991, 129:555–62.
Donnelly P, Tavaré S: Coalescents and genealogical structure under neutrality. Annu. Rev. Genet. 1995, 29:401–21.
Van Oven M, Kayser M: Updated comprehensive phylogenetic tree of global human mitochondrial DNA variation. Hum. Mutat. 2009, 30:E386–94.
Bandelt HJ, Forster P, Röhl A: Median-joining networks for inferring intraspecific phylogenies. Mol. Biol. Evol. 1999, 16:37–48.
Richter D, Grün R, Joannes-Boyau R, Steele TE, Amani F, Rué M, Fernandes P, Raynal J-P, Geraads D, Ben-Ncer A, Hublin J-J, McPherron SP: The age of the hominin fossils from Jebel Irhoud, Morocco, and the origins of the Middle Stone Age. Nature 2017, 546:293–296.
Cabrera VM, Marrero P, Abu-Amero KK, Larruga JM: Carriers of mitochondrial DNA macrohaplogroup L3 basal lineages migrated back to Africa from Asia around 70,000 years ago. BMC Evol. Biol. 2018, 18:98.
Liu W, Martinón-Torres M, Cai Y, Xing S, Tong H, Pei S, Sier MJ, Wu X, Edwards RL, Cheng H, Li Y, Yang X, de Castro JMB, Wu X: The earliest unequivocally modern humans in southern China. Nature 2015, 526:696–9.
Mijares AS, Détroit F, Piper P, Grün R, Bellwood P, Aubert M, Champion G, Cuevas N, De Leon A, Dizon E: New evidence for a 67,000-year-old human presence at Callao Cave, Luzon, Philippines. J. Hum. Evol. 2010, 59:123–32.
Westaway KE, Louys J, Awe RD, Morwood MJ, Price GJ, Zhao J-X, Aubert M, Joannes-Boyau R, Smith TM, Skinner MM, Compton T, Bailey RM, van den Bergh GD, de Vos J, Pike AWG, Stringer C, Saptomo EW, Rizal Y, Zaim J, Santoso WD, Trihascaryo A, Kinsley L, Sulistyanto B: An early modern human presence in Sumatra 73,000-63,000 years ago. Nature 2017, 548:322–325.
Rasmussen M, Guo X, Wang Y, Lohmueller KE, Rasmussen S, Albrechtsen A, Skotte L, Lindgreen S, Metspalu M, Jombart T, Kivisild T, Zhai W, Eriksson A, Manica A, Orlando L, De La Vega FM, Tridico S, Metspalu E, Nielsen K, Ávila-Arcos MC, Moreno-Mayar JV, Muller C, Dortch J, Gilbert MTP, Lund O, Wesolowska A, Karmin M, Weinert LA, Wang B, Li J, Tai S, Xiao F, Hanihara T, van Driem G, Jha AR, Ricaut F-X, de Knijff P, Migliano AB, Gallego Romero I, Kristiansen K, Lambert DM, Brunak S, Forster P, Brinkmann B, Nehlich O, Bunce M, Richards M, Gupta R, Bustamante CD, Krogh A, Foley RA, Lahr MM, Balloux F, Sicheritz-Pontén T, Villems R, Nielsen R, Wang J, Willerslev E: An Aboriginal Australian genome reveals separate human dispersals into Asia. Science 2011, 334:94–8.
Westaway KE, Morwood MJ, Roberts RG, Rokus AD, Zhao J, Storm P, Aziz F, van den Bergh G, Hadi P, Jatmiko, de Vos J: Age and biostratigraphic significance of the Punung Rainforest Fauna, East Java, Indonesia, and implications for Pongo and Homo. J. Hum. Evol. 2007, 53:709–17.
Van den Bergh GD, Li B, Brumm A, Grün R, Yurnaldi D, Moore MW, Kurniawan I, Setiawan R, Aziz F, Roberts RG, Suyono, Storey M, Setiabudi E, Morwood MJ: Earliest hominin occupation of Sulawesi, Indonesia. Nature 2016, 529:208–11.
Zaidi AA, Wilton PR, Su MS-W, Paul IM, Arbeithuber B, Anthony K, Nekrutenko A, Nielsen R, Makova KD: Bottleneck and selection in the germline and maternal age influence transmission of mitochondrial DNA in human pedigrees. Proc. Natl. Acad. Sci. U.S.A. 2019, 116:25172–25178.
Ronquist FSI: Phylogenetic methods in biogeography. Annual Review of Ecology systematic and evolution 2011, 42:441–464.
Drummond AJ, Rambaut A, Shapiro B, Pybus OG: Bayesian coalescent inference of past population dynamics from molecular sequences. Mol. Biol. Evol. 2005, 22:1185–92.

Table 1: Coalescence age for Figure 1 Tree using the compound rho

Period

Lineages

Mutations

Rho

i/i + 1

1/μ

Years

1.50

0.67

1.44 x 10-4

6944

10 416

2.00

0.75

1.61 x 10-4

6211

12 422

0.50

0.80

1.72 x 10-4

5814

2 907

1.00

2.15 x 10-4

4651

4 651

Coalescence age for Figure 1 tree: 30 396 ± 5 513

Table 2: Coalescence age estimates for several human mtDNA evolutive events

Haplogroup split

Evolutive event

Mean age in years

95% Coefficient interval

L0/L1'2'5'6'4'3

Most recent African common ancestor

317 814 ya

(352 755-282 873 ya)

L3'4

Out of Africa

165 610 ya

(190 833-140 387 ya)

Return to Africa of L3

112 829 ya

(133 648-92 010 ya)

Reaching the Pacific

106 752 ya

(127 003-86 501 ya)

Reaching Australia

108 034 ya

(128 406-87 662 ya)

Reaching Philippines

111 545 ya

(132 244-90 846 ya)

Reaching New Guinea

112 070 ya

(132 818-91 322 ya)

Expansion Americas

37 701 ya

(49 735-25 667 ya)

Download PDF

Journal Publication

published 29 Jun, 2020

Read the published version in BMC Evolutionary Biology →

Version 2

posted

You are reading this older preprint version

Read the latest preprint version →

Counterbalancing the time-dependence effect on the Human Mitochondrial DNA Molecular Clock

Status:

Journal Publication

Version 2

Abstract

Figures

Background

Results and Discussion

Conclusions

Methods

List of Abbreviations

Declarations

References

Tables

Supplementary Files

Status:

Journal Publication

Version 2