Counterbalancing the time-dependence effect on the Human Mitochondrial DNA Molecular Clock

doi:10.21203/rs.2.17533/v3

Download PDF

Methodology article

Counterbalancing the time-dependence effect on the Human Mitochondrial DNA Molecular Clock

https://doi.org/10.21203/rs.2.17533/v3

This work is licensed under a CC BY 4.0 License

Journal Publication

published 29 Jun, 2020

Read the published version in BMC Evolutionary Biology →

Version 3

posted

You are reading this latest preprint version

Background: The molecular clock is an important genetic tool for estimating evolutionary timescales. However, the detection of a time-dependent effect on substitution rate estimates complicates its application. It has been suggested that demographic processes could be the main cause of this confounding effect. In the present study, I propose a new algorithm for estimating the coalescent age of phylogenetically related sequences, taking into account the observed time-dependent effect on the molecular rate detected by others.

Results: By applying this method to real human mitochondrial DNA trees with shallow and deep topologies, I obtained significantly older molecular ages for the main events of human evolution than were previously estimated. These ages are in close agreement with the most recent archaeological and paleontological records favoring the emergence of early anatomically modern humans in Africa 315 ± 34 thousand years ago (kya) and the presence of recent modern humans outside of Africa as early as 174 ± 48 thousand years ago. Furthermore, during the implementation process, I demonstrated that in a population with fluctuating sizes, the probability of fixation of a new neutral mutant depends on the effective population size, which is in better accordance with the fact that under the neutral theory of molecular evolution, the fate of a molecular mutation is mainly determined by random drift.

Conclusions: I suggest that the demographic history of populations has a more decisive effect than purifying selection and/or mutational saturation on the time-dependent effect observed for the substitution rate, and I propose a new method that corrects for this effect.

Evolutionary Biology

Human evolution

molecular clock

mutation rate.

During the last three decades, mitochondrial DNA (mtDNA) variation has played a dominant role in studies of human evolution. Recently, the analysis of this small molecule has been substituted by the analysis of whole genomes. However, before completely turning the page on this type of analysis, it would be convenient to solve the patent contradictions between mtDNA molecular clock time estimations and those recently proposed on the basis of paleontological and archaeological data.

A key achievement of early mtDNA analyses was the dating and origin of the most recent common female ancestor of all living women to approximately 200 kya in Africa[1]. However, hominin fossils and associated Middle Stone Age artifacts from Jebel Irhoud in Morocco have recently been aged to 315 ± 34 kya [2]. These older dates were genetically confirmed in a study of ancient African genomes that estimated modern human divergence to have occurred at 350 to 260 kya [3].

Another controversial milestone of the mtDNA molecular clock is the dating of the dispersal of modern humans out of Africa to 50 to 70 kya based on the coalescence age of macrohaplogroup L3 [4]. This timing is not in agreement with the presence of early modern human remains in the Levant at the Skhul and Qafzeh caves dated to approximately 80-120 kya [5], the presence of middle stone age industries on the Arabian peninsula with similar dates of approximately 80-130 kya [6–8], the recent discovery of unequivocally modern human teeth dated to 80-120 kya in southern China [9], or the recently reported detection of ancient gene flow from early modern humans to the ancestors of eastern Neanderthals more than 100 kya[10]. Furthermore, the most recent discovery of a Homo sapiens maxilla at Misliya Cave, Israel, dated to 177-194 kya[11], could significantly anticipate the exit of Homo sapiens from Africa. Curiously, these dates within the last interstadial of marine isotope stage 7 (MIS-7) are in agreement with the age proposed for ancient African hominin introgression into European Neanderthals[12].

On the other hand, the optically stimulated luminescence (OSL) dating of stratigraphic undisturbed basal stone tool assemblages in Madjedbebe [13] placed the human colonization of Australia at approximately 65 kya with minor age uncertainties of only ± 3-4 kyr. The above date is significantly older than the 43-47 kya coalescence age recently estimated from Australian aboriginal mitogenomes[14].

Under the actual molecular rate estimates given for different sequences of the human genome and the accepted constancy of the molecular clock over time, all of these older archaeological and fossil dates are evidently in conflict with analogous molecular estimates. Initially, the molecular coalescence ages were calculated by means of simple statistics such as the still popular rho statistic [15], based on the average number of polymorphisms observed in a set of related sequences. Subsequently, more sophisticated Bayesian-based methods using relaxed clock phylogenies have been implemented [16]. However, the ages obtained by applying these simple and complex methods to several outstanding events in human history have provided similar age timeframes for them[17]. Thus, it could be that all the above mentioned old human dispersals, detected from the archaeological record represent failed dispersals that did not contribute to the present-day genetic pool of modern humans or that the molecular clock approach is needed for additional adjustment of these estimates.

In this paper, I try to demonstrate that under neutral molecular theory conditions using an overall mitogenome germ line mutation rate and taking into account past fluctuations in the effective population size deduced from any tree topology, it is possible to obtain slower molecular substitution rates that are more in frame with these fossil-based calibrations.

Application of the new Rho Statistic to the main Events in Human Evolution. To apply the new rho statistic described in the Methods section to real human mtDNA data, a rooted tree showing the relationships of the sampled sequences is necessary. Using coalescent methodology, we could obtain a probabilistic tree. However, in the case of human mtDNA, we have a very contrasted phylogenetic tree[18] constructed using the Network program[19]. In this tree, mutations are placed hierarchically from the tips to the roots, and multiple hits identified by network reticulations have been resolved according to the relative mutation rate of the positions involved[17]. Thus, following this standard, I constructed an African mtDNA genome-based phylogenetic tree using 86 previously published complete mtDNA sequences in which all the main African haplogroups are represented (Figure S1). Likewise, using 142 published complete mtDNA sequences, I constructed a second phylogenetic tree for Australasian-specific haplogroup P (Figure S2). Finally, to test a more recent human colonization, I constructed a third tree including 48 already published complete mtDNA sequences belonging to the Americas-specific haplogroup B2 (Figure S3). Using these trees, I applied the proposed time-dependent-based estimator to calculate the coalescence ages of several essential nodes in human history (Tables S1 to S11).

I found a TMRCA for all the extant human African mtDNAs of 315 801 ± 17 827 years (Table 2 and S2), which is highly compatible with the recent archaeological and paleontological estimations of modern human origin of approximately 315 000 ya[2, 20].

Recently, an early out-of-Africa hypothesis for modern humans carrying haplogroup L3 precursor lineages within a favorable time window of approximately 125 000 ya was proposed [21], which is in accord with the age calculated here for the L3'4 split in Africa of 165 610 ± 12 869 ya, as a lower boundary (Tables 2 and S3). Furthermore, this age timeframe is compatible with the presence of modern humans in the Levant [5] and in China [22] at approximately 100 000 ya.

In the same paper, a return to Africa of basal L3 lineages over 75 kya was also suggested. Again, the age calculated for the L3 African expansion with the method reported here of 112 829 ± 10 622 ya makes this suggestion feasible (Tables 2 and S4). Furthermore, under this new temporal window, the great morphological variability of the Skhul/Qafzeh remains, and the corresponding wide range of ages (120-80 kya) could easily fit into the whole molecular period proposed elsewhere [21], beginning with the out-of-Africa expansion of early modern humans and ending with their early return to the same continent carrying basic L3 lineages (125-75 kya). However, it should be noted that a return to Africa from the Arabian Peninsula would also be supported by the dates estimated from the archaeological record of the region.

On the other hand, our TMRCA for Australasian haplogroup P (103 267 ± 10 332 ya) was also in agreement with an early presence of modern humans in Asia (Tables 2 and S6, S7, S8, S9). Thus, this timing indicates a lower boundary for the colonization of the Philippines [23] Sumatra [24] and Australia [13] of approximately 65 000 to 73 000 ya. It has to be mentioned that from the genome sequencing of an Aboriginal Australian[25], it was deduced that Aboriginal Australians are descendants of human dispersal into Eastern Asia that occurred approximately 62-75 kya.

Finally, the time of human expansion to the American Continent deduced from the haplogroup B2 phylogeny was approximately 37,000 ya (Table 2, and Table S11). This age supports a pre-Clovis occupation of the New World, well before the last glacial maximum.

The mtDNA ages calculated for the main human movements out of Africa applying the method proposed here are in remarkable concordance with those obtained from the most recent paleontological and archaeological records, which were conversely in open conflict with the hitherto most recent mtDNA molecular data [26]. However, as the ages calculated here have wide statistical confidence intervals (Table 2), different models could be adjusted to fall within their timeframes. For example, we might assume an earlier out-of-Africa expansion matching the Misliya maxilla dated to 177-194 kya; then, the Skhul and Qafzeh remains dated to approximately 80-130 kya might signal the return to Africa of the carriers of the basal mtDNA haplogroup L3 lineages, rather than the out-of-Africa expansion of early anatomically modern humans as proposed here. Similarly, the controversial presence of H. sapiens in Java [27] and Sulawesi [28] as early as 120 kya would fit into the age window of Australasian mtDNA haplogroup P (Table 2). Future archaeological discoveries and more precise fossil dating will help to determine the most appropriate model.

It is worth mentioning that a return to Africa of carriers of the basal maternal L3 and paternal E lineages, as we proposed previously [21], dated here to approximately 75 kya, has received strong support from a recent study that provides evidence of Neanderthal sequences present in modern Africans, most likely as the result of the early back-migration of putative Eurasian groups to Africa[29]. Interestingly, there is a possibility of testing whether these Eurasian groups were effectively carriers of the mtDNA L3 and Y-chromosome E lineages because, were this the case, native African groups such as the eastern African Hadza and Sandawe, the southern African Khoesan and the central African pygmies should exhibit comparatively less Neanderthal ancestry introgressed into their genomes than the group that returned to Africa later.

On the other hand, there is also recently published archaeological evidence that shows that modern humans could have colonized the central Siberian Arctic as early as 45 kya [30]. This gives rise to the possibility of an earlier entry into the Americas, as proposed here.

Finally, one must be aware that in addition to its wide uncertainty range, the method proposed here depends on the accuracy of several external assumptions. One is the overall germline mtDNA mutation rate estimated for the studied species. For example, after this article was written, a new paper addressing the germline mtDNA variability within humans with an experimental design as rigorous as that used by Rebolledo-Jaramillo et al [31] was published [32]. The authors estimated the overall germline mtDNA mutation rate per site per generation to be 4.72 x 10^-7 (95% bootstrap CI: 3.93 – 5.52 x 10^-7), which was approximately 75% higher than the rate used here. Thus, it would be convenient if, when more empirical estimates accumulated, a consensus average rate could be reached. This can also be extended to the average human generation interval, which is necessary to convert generations into years.

Additionally, this algorithm is highly dependent on the degree to which the tree reflects the demographic history of the analyzed population and therefore on the demographic parameters assumed in the construction of the tree for either phylogenetic [33] or coalescent methods [34].

To a lesser degree, this algorithm is also sensitive to the sample size used to construct the tree to identify nodes that include less frequent lineages that are still present in the extant population. Fortunately, in the case of humans, mtDNA diversity has been exhaustively sampled at both the continental and population levels in recent decades, so the risk of missing rare lineages is very low.

Taking into account the time dependence of the mtDNA evolutionary rate in humans as proposed here and choosing a conservative mtDNA germ-line mutation rate, as experimentally obtained by others [31], has resulted in a significantly slower mtDNA molecular clock in humans in such a way that all the main events of human history dated by paleontological and archaeological methods fit within this new mtDNA temporal scale without the necessity of external node calibrations. Finally, it should be stressed that in my opinion, the applicability of this method to other demographic scenarios and other markers susceptible to being represented in phylogenetic trees, such as the Y-chromosome and the nonrecombining portions of the autosomes, deserves further investigation.

The applied Human Mitochondrial mutation rate. The efficiency of the molecular clock [35] is based on the reliability of several implicit assumptions, such as a) the correctness of the mutation rate point estimate (µ) of the gene under study; b) the constancy with time of the rate of molecular substitution (Ɵ); and c) the rate homogeneity among the different lineages involved in the phylogeny. To address the first point, in this study, I used the full-length mtDNA germ-line mutation rate of 1.3 x 10^-8 (interquartile range, 4.2 x 10^-9 to 4.1 x 10^-8) mutations per site per year (assuming a generation time of 20 years) and its derived rate scalar of one mutation every 4651 years estimated by others[31]. This mtDNA mutation rate is approximately ten times lower than the estimates in most pedigree studies, which the authors explain by their approach of the analysis of two tissues, which allowed them to discard somatic heteroplasmies. In this respect, it must be mentioned that second-generation massive sequencing has made possible the direct calculation of the human germline genomic mutation rate, which reduced the phylogenetic mutation rate by half, thus doubling the estimated divergence dates of Africans and suggesting that crucial events in human evolution occurred earlier than suggested previously[36].

Accounting for the time-dependent effect on the Rate of Molecular Evolution. It is well established that rates of molecular evolution are not constant at interspecific or intraspecific levels[37]. In general, they decline with increasing divergence time, but the rate of decay differs among taxa. This time-dependent pattern has also been observed for human mtDNA in both coding and noncoding regions[38, 39]. Purifying selection on deleterious mutations and mutation saturation have been suggested as the main forces responsible for this time rate decay[39]. However, the unrealistic large effective population sizes required to explain the long-term persistence of significantly deleterious mutations cast doubt on whether purifying selection alone can explain the observed rate acceleration[40]. It has also been found to be unlikely that the apparent decline in rates over time is due to mutational saturation[39]. Congruently, correcting for the effects of purifying selection and saturation has only slightly modified the mtDNA evolutionary mutation rate, providing molecular times that are still in apparent contradiction with archaeological and paleontological ages[17, 39]. Demographic processes such as serial bottlenecks and expansions have also been proposed to explain the differences in rate estimates over time[41]. It seems evident that some adjustment should be implemented to correct the time dependency of the molecular clock. In this paper, I propose a practical approach for counteracting the time-dependent effect on molecular rate estimates, taking into account tree topologies.

Approaching the Lack of Mutation Rate homogeneity between Lineages. Since some of the earliest molecular analyses, it has been observed that rates of homologous nuclear DNA sequence evolution differ between taxonomic groups [42], which can be extended to mtDNA[43]. Later, significant differences in the rates of molecular evolution between mtDNA human lineages were also detected at the haplogroup level[44–48]. Different relaxed molecular-clock methods have been implemented to incorporate rate variation among lineages[49, 50]. However, the application of these methods to human mtDNA has yielded age estimates for the main milestones of human evolution that are in agreement with previous molecular estimates[51, 52]. In this paper, when distributing the mutations of lineages with significant rate differences within coalescent periods, I used a simple proportionality criterion. I allowed a window of 0 to 5 mutations between sequences within coalescent periods, as it has been demonstrated that under a Poisson distribution, even over an extended period of 10,000 years, lineages that still carry the same mutations as their common ancestor could still exist, in addition to lineages that have accumulated five new mutations, with probabilities higher than 0.05 percent[53]. Another technical problem is the mutation distribution among coalescent periods of the isolated sequences that directly radiate from ancestral nodes. To resolve this issue, I used a weighted distribution calculated by multiplying the number of mutations in the isolates by the number of mutations between internodes in each coalescent period and then dividing the result by the total number of mutations at all the internodes.

Effect of Fluctuating Population Size on Mutation Substitution Rates. Within a population, the fixation time of a mutation (forward) or the coalescence time to the most recent common ancestor (backward) is usually calculated by the estimator Ɵ = 4N_eµ (where N_e is the effective population size). For the haploid mtDNA genome, Ɵ equals 2N_efµ (being N_ef the female effective population size). Kimura demonstrated that under strict neutral theory parameters, the rate of substitution is equated to the mutation rate[54]. However, the same author warned us that a clear distinction exists between the mutation rate (µ) and mutation substitution (Ɵ). The former refers to the change in genetic material at the individual level, and the latter refers to that at the population level[54]. Thus, only when N_e is constant across generations, maintaining small or large sizes, is Ɵ equal to µ. This holds because with large constant sizes, the number of new mutations incorporated into the population (Neµ) increases, but along the same path, the probability of fixation (1/Ne) decreases. In contrast, with small constant sizes, the number of new mutations decreases, but the probability of the fixation of any of the mutations increases at a similar level. It is widely admitted that N has fluctuated greatly during human history and that global exponential population growth has occurred in recent times[41]. In this paper, I have taken into account the changes in population size that occurred backward in time and their influence on the rate of gene substitution. When the population size fluctuates across generations, the probability of fixation of a neutral mtDNA variant (q) is no longer the 1/N constant. It will depend on the difference in the population size of the next generation (N₁) with respect to the initial size (N₀): (see Equation 1 in the Supplementary Files)

For example, if N₁ is twice the size of N₀, q equals 2/N₀_, and, on the contrary, if N₁ is half the size of N₀, q equals 1/2N₀. As a consequence, the rate of substitution for neutral mutations in a population of fluctuating size depends on the change in size between generations:(See Equation 2 in the Supplementary Files)

Using a different approach, the dependence on the population size of the substitution rate at neutral genes was already demonstrated for populations with fluctuating sizes and overlapping generations[55].

As human populations have been growing exponentially for several centuries, we should counterbalance this effect from the present-day generation (N_n) going backward in time by inverting the fraction between consecutive generations (N_n-1/N_n). Note that this dependence might explain the differences in rate estimates over time observed empirically [38]. I will take into consideration this important relationship for the calculation of Ɵ.

A new Rho Statistic for estimating Coalescent Ages. Several statistics based on DNA polymorphism exist for estimating the parameter Ɵ. One such statistic, S, the number of segregating sites per nucleotide in a sample of sequences [56], is strongly dependent on the sample size. A second, π, is defined as the average number of nucleotide differences per site in a sample of sequences[57]. These two estimators were used to implement a statistical method for testing the neutral mutation hypothesis[58]. A third statistic, rho (ρ), is referred to as the mean number of nucleotide differences in a sample of sequences compared to their common ancestral type[15]. This last statistic is calculated from a rooted phylogenetic tree showing the relationships of the sampled sequences. The accuracy of molecular dating with the rho statistic has been questioned by some because it shows downward biased data estimations, large asymmetric variances and strong dependency of demographic factors[59], but it is defended by others[60]. Regardless, it is still a commonly used method for measuring intraspecific mtDNA divergence events in humans. Although the distribution of pairwise nucleotide site differences between individuals has been used to detect episodes of population growth and decline[61], none of the abovementioned statistics considers the past demography of the sample in their age estimates. In this paper, I propose the use of a modified rho (ρ_m) that by taking into account the coalescent genealogical structure, significantly improves molecular date estimation for key events in human history based on mtDNA genome data. To make explicit our modifications to the classical rho, I have depicted a real genealogy constructed from five lineages (a) and an idealized star-like phylogeny of the same five lineages (b) in Figure 1, assuming population exponential growth shortly after a severe bottleneck[62]. The number of lineages sampled is represented by n_i; t_i represents the time periods defined by progressive coalescent events from the tips to the most recent common ancestor (TMRCA) root; i represents the number of independent lineages remaining after successive coalescences; ϒ_i is the number of mutations accumulated during each coalescent period; and m_iis the number of mutations accumulated along each lineage. Mutations in the star-like phylogeny are distributed into periods following the pattern found in the real phylogeny. As the accumulation of mutations along each lineage is an individual process driven by the mutation rate, µ, and distributed as independent Poisson processes, ρ, the average number of mutations per lineage has the same value irrespective of the topology. However, as a consequence of the fact that the lineages in the real tree are not independent because of their shared genealogy, the rate of variance decay is much slower (1/logn) than in the independent star-like tree (1/n)[63]. As a consequence, for the rho calculation, mutations within lineages in the star-like phylogeny are counted only once. In contrast, in the real phylogeny, only mutations occurring at the tips are counted only once, while mutations in subsequent coalescent periods going backward to the MRCA node are counted as many times as the number of periods to which they belong. For this reason, mutations in the older periods are overrepresented in the rho calculation. Because of lineage independence, star-like phylogenies are statistically optimal for calculating ρ and π estimators. Under this topology, as mutations along lineages are counted from the root to the tips in ρ and from tip to tip in π pairwise comparisons, the value of π is twice that of ρ. However, as rho ignores the dependence among lineages existing in the majority of the trees, it is necessary to correct for this dependence. For this reason, I propose a modified rho statistic (ρ_m) that represents the summation of the classical rho statistics calculated for each coalescent period in the tree: (see Equation 3 in the Supplementary Files)

This compound Poisson distribution is also Poisson distributed; therefore, the mean and variance are equal, and the standard deviation is the square root of this variance. Thus, uncertainty in the estimates was calculated using the Poisson confidence interval. Another important difference between the real and star-like genealogies is that in the former, the number of lineages decreases as one stepwise function across coalescent periods, while in the latter, the number of lineages is constant until the root is reached. Equating the number of lineages in the sample as an approximation of the effective population size in the population, we should take into account this backward real decrease in N_e to improve the estimation of the MRCA age. In an ideal coalescent model, we should have i-1 decreasing population sizes, but in real phylogenies, in addition to bifurcations, there are also multifurcations and lineages with long internal segments without any branching events. Even so, I applied the reverse proportion used to counteract the time-dependent effect on the evolutionary rate to each rho in consecutive periods going backward. That is, the mutation rate, µ, was multiplied in each i-1 period by (i-1)/i, leaving µ as calculated from the germline estimations for the most recent period, comprising the tips of all the lineages sampled. With this method, I obtained a time-dependent scaled mutation rate, Ɵ, that produced human mtDNA intraspecific ages congruent with the archaeologically and paleontologically calibrated nodes representing key events in human history. (see Equation 4 in the Supplementary Files)

By performing calculations on the basis of the empirical genealogy (Figure 1a), I obtained an age of 22 277 ± 4 720 years using the standard ρ and an age of 30 396 ± 5 513 years (1.36 times greater) when using the time-dependent Ɵ estimator proposed here (Table 1).

CI: Coefficient Interval

Kya: Thousand Years Ago

MIS: Marine Isotope Stage

OSL: Optically Stimulated Luminescence

TMRCA: The Most Recent Common Ancestor

Ethics approval and consent to participate: Not applicable

Consent for publication: Not appliable

Availability of data and materials: All data analyzed during this study are included in this article and its Supplementary Information files.

Competing interests: The author declares that he has no competing interests.

Funding: Not applicable.

Author contributions: VMC is the sole author of this article.

Acknowledgments: I am grateful to my colleagues Ana M. González and José M. Larruga for their ideas contributed to this work.

Author’s information: The author, VMC, is retired.

Cann RL, Stoneking M, Wilson AC: Mitochondrial DNA and human evolution. Nature 1987, 325:31–36.
Hublin J-J, Ben-Ncer A, Bailey SE, Freidline SE, Neubauer S, Skinner MM, Bergmann I, Le Cabec A, Benazzi S, Harvati K, Gunz P: New fossils from Jebel Irhoud, Morocco and the pan-African origin of Homo sapiens. Nature 2017, 546:289–292.
Schlebusch CM, Malmström H, Günther T, Sjödin P, Coutinho A, Edlund H, Munters AR, Vicente M, Steyn M, Soodyall H, Lombard M, Jakobsson M: Southern African ancient genomes estimate modern human divergence to 350,000 to 260,000 years ago. Science 2017, 358:652–655.
Soares P, Alshamali F, Pereira JB, Fernandes V, Silva NM, Afonso C, Costa MD, Musilová E, Macaulay V, Richards MB, Cerny V, Pereira L: The Expansion of mtDNA Haplogroup L3 within and out of Africa. Mol. Biol. Evol. 2012, 29:915–27.
Grün R, Stringer CB: Electron spin resonance dating and the evolution of modern humans. Archaeometry 1991, 33:153–199.
Armitage SJ, Jasim SA, Marks AE, Parker AG, Usik VI, Uerpmann H-P: The southern route “out of Africa”: evidence for an early expansion of modern humans into Arabia. Science 2011, 331:453–6.
Rose JI, Usik VI, Marks AE, Hilbert YH, Galletti CS, Parton A, Geiling JM, \vCern’y V, Morley MW, Roberts RG: The Nubian Complex of Dhofar, Oman: An African Middle Stone Age Industry in Southern Arabia. PLoS ONE 2011, 6:e28239.
Groucutt HS, Scerri EM, Lewis L, Clark-Balzan L, Blinkhorn J, Jennings RP, Parton A, Petraglia MD: Stone tool assemblages and models for the dispersal of Homo sapiens out of Africa. Quaternary International 2015, 382:8–30.
Liu W, Martinón-Torres M, Cai Y, Xing S, Tong H, Pei S, Sier MJ, Wu X, Edwards RL, Cheng H, others: The earliest unequivocally modern humans in southern China. Nature 2015, 526:696–699.
Kuhlwilm M, Gronau I, Hubisz MJ, de Filippo C, Prado-Martinez J, Kircher M, Fu Q, Burbano HA, Lalueza-Fox C, de la Rasilla M, Rosas A, Rudan P, Brajkovic D, Kucan Ž, Gušic I, Marques-Bonet T, Andrés AM, Viola B, Pääbo S, Meyer M, Siepel A, Castellano S: Ancient gene flow from early modern humans into Eastern Neanderthals. Nature 2016, 530:429–33.
Hershkovitz I, Weber GW, Quam R, Duval M, Grün R, Kinsley L, Ayalon A, Bar-Matthews M, Valladas H, Mercier N, others: The earliest modern humans outside Africa. Science 2018, 359:456–459.
Posth C, Wißing C, Kitagawa K, Pagani L, van Holstein L, Racimo F, Wehrberger K, Conard NJ, Kind CJ, Bocherens H, others: Deeply divergent archaic mitochondrial genome provides lower time boundary for African gene flow into Neanderthals. Nature communications 2017, 8:1–9.
Clarkson C, Jacobs Z, Marwick B, Fullagar R, Wallis L, Smith M, Roberts RG, Hayes E, Lowe K, Carah X, others: Human occupation of northern Australia by 65,000 years ago. Nature 2017, 547:306–310.
Tobler R, Rohrlach A, Soubrier J, Bover P, Llamas B, Tuke J, Bean N, Abdullah-Highfold A, Agius S, O’Donoghue A, others: Aboriginal mitogenomes reveal 50,000 years of regionalism in Australia. Nature 2017, 544:180–184.
Forster P, Harding R, Torroni A, Bandelt H-J: Origin and evolution of Native American mtDNA variation: a reappraisal. American journal of human genetics 1996, 59:935.
Drummond AJ, Ho SYW, Phillips MJ, Rambaut A: Relaxed phylogenetics and dating with confidence. PLoS Biol. 2006, 4:e88.
Soares P, Ermini L, Thomson N, Mormina M, Rito T, Rӧhl A, Salas A, Oppenheimer S, Macaulay V, Richards MB: Correcting for purifying selection: an improved human mitochondrial molecular clock. The American Journal of Human Genetics 2009, 84:740–759.
Van Oven M, Kayser M: Updated comprehensive phylogenetic tree of global human mitochondrial DNA variation. Hum. Mutat. 2009, 30:E386–94.
Bandelt HJ, Forster P, Röhl A: Median-joining networks for inferring intraspecific phylogenies. Mol. Biol. Evol. 1999, 16:37–48.
Richter D, Grün R, Joannes-Boyau R, Steele TE, Amani F, Rué M, Fernandes P, Raynal J-P, Geraads D, Ben-Ncer A, Hublin J-J, McPherron SP: The age of the hominin fossils from Jebel Irhoud, Morocco, and the origins of the Middle Stone Age. Nature 2017, 546:293–296.
Cabrera VM, Marrero P, Abu-Amero KK, Larruga JM: Carriers of mitochondrial DNA macrohaplogroup L3 basal lineages migrated back to Africa from Asia around 70,000 years ago. BMC Evol. Biol. 2018, 18:98.
Liu W, Martinón-Torres M, Cai Y, Xing S, Tong H, Pei S, Sier MJ, Wu X, Edwards RL, Cheng H, Li Y, Yang X, de Castro JMB, Wu X: The earliest unequivocally modern humans in southern China. Nature 2015, 526:696–9.
Mijares AS, Détroit F, Piper P, Grün R, Bellwood P, Aubert M, Champion G, Cuevas N, De Leon A, Dizon E: New evidence for a 67,000-year-old human presence at Callao Cave, Luzon, Philippines. J. Hum. Evol. 2010, 59:123–32.
Westaway KE, Louys J, Awe RD, Morwood MJ, Price GJ, Zhao J-X, Aubert M, Joannes-Boyau R, Smith TM, Skinner MM, Compton T, Bailey RM, van den Bergh GD, de Vos J, Pike AWG, Stringer C, Saptomo EW, Rizal Y, Zaim J, Santoso WD, Trihascaryo A, Kinsley L, Sulistyanto B: An early modern human presence in Sumatra 73,000-63,000 years ago. Nature 2017, 548:322–325.
Rasmussen M, Guo X, Wang Y, Lohmueller KE, Rasmussen S, Albrechtsen A, Skotte L, Lindgreen S, Metspalu M, Jombart T, Kivisild T, Zhai W, Eriksson A, Manica A, Orlando L, De La Vega FM, Tridico S, Metspalu E, Nielsen K, Ávila-Arcos MC, Moreno-Mayar JV, Muller C, Dortch J, Gilbert MTP, Lund O, Wesolowska A, Karmin M, Weinert LA, Wang B, Li J, Tai S, Xiao F, Hanihara T, van Driem G, Jha AR, Ricaut F-X, de Knijff P, Migliano AB, Gallego Romero I, Kristiansen K, Lambert DM, Brunak S, Forster P, Brinkmann B, Nehlich O, Bunce M, Richards M, Gupta R, Bustamante CD, Krogh A, Foley RA, Lahr MM, Balloux F, Sicheritz-Pontén T, Villems R, Nielsen R, Wang J, Willerslev E: An Aboriginal Australian genome reveals separate human dispersals into Asia. Science 2011, 334:94–8.
Petraglia MD: Trailblazers across Arabia. Nature 2011, 470:50–51.
Westaway KE, Morwood MJ, Roberts RG, Rokus AD, Zhao J, Storm P, Aziz F, van den Bergh G, Hadi P, Jatmiko, de Vos J: Age and biostratigraphic significance of the Punung Rainforest Fauna, East Java, Indonesia, and implications for Pongo and Homo. J. Hum. Evol. 2007, 53:709–17.
Van den Bergh GD, Li B, Brumm A, Grün R, Yurnaldi D, Moore MW, Kurniawan I, Setiawan R, Aziz F, Roberts RG, Suyono, Storey M, Setiabudi E, Morwood MJ: Earliest hominin occupation of Sulawesi, Indonesia. Nature 2016, 529:208–11.
Chen L, Wolf AB, Fu W, Li L, Akey JM: Identifying and Interpreting Apparent Neanderthal Ancestry in African Individuals. Cell 2020, 180:677–687.
Pitulko VV, Tikhonov AN, Pavlova EY, Nikolskiy PA, Kuper KE, Polozov RN: Early human presence in the Arctic: Evidence from 45,000-year-old mammoth remains. Science 2016, 351:260–263.
Rebolledo-Jaramillo B, Su MS-W, Stoler N, McElhoe JA, Dickins B, Blankenberg D, Korneliussen TS, Chiaromonte F, Nielsen R, Holland MM, others: Maternal age effect and severe germ-line bottleneck in the inheritance of human mitochondrial DNA. Proceedings of the National Academy of Sciences 2014, 111:15474–15479.
Zaidi AA, Wilton PR, Su MS-W, Paul IM, Arbeithuber B, Anthony K, Nekrutenko A, Nielsen R, Makova KD: Bottleneck and selection in the germline and maternal age influence transmission of mitochondrial DNA in human pedigrees. Proc. Natl. Acad. Sci. U.S.A. 2019, 116:25172–25178.
Ronquist FSI: Phylogenetic methods in biogeography. Annual Review of Ecology systematic and evolution 2011, 42:441–464.
Drummond AJ, Rambaut A, Shapiro B, Pybus OG: Bayesian coalescent inference of past population dynamics from molecular sequences. Mol. Biol. Evol. 2005, 22:1185–92.
Zuckerkandl E, Pauling L: Evolutionary divergence and convergence in proteins. In Evolving genes and proteins. Elsevier; 1965:97–166.
Scally A, Durbin R: Revising the human mutation rate: implications for understanding human evolution. Nature Reviews Genetics 2012, 13:745–753.
Molak M, Ho SY: Prolonged decay of molecular rate estimates for metazoan mitochondrial DNA. PeerJ 2015, 3:e821.
Henn BM, Gignoux CR, Feldman MW, Mountain JL: Characterizing the time dependency of human mitochondrial DNA mutation rate estimates. Molecular Biology and Evolution 2009, 26:217–230.
Ho SYW, Phillips MJ, Cooper A, Drummond AJ: Time dependency of molecular rate estimates and systematic overestimation of recent divergence times. Choosing appropriate substitution models for the phylogenetic analysis of protein-coding sequences. 2005, 22:1561–1568.
Woodhams M: Can deleterious mutations explain the time dependency of molecular rate estimates? Molecular Biology and Evolution 2006, 23:2271–2273.
Gignoux CR, Henn BM, Mountain JL: Rapid, global demographic expansions after the origins of agriculture. Proceedings of the National Academy of Sciences 2011, 108:6044–6049.
Britten RJ: Rates of DNA sequence evolution differ between taxonomic groups. Science 1986, 231:1393–1398.
HASEGAWA M, KISHINO H: Heterogeneity of tempo and mode of mitochondrial DNA evolution among mammalian orders. The Japanese Journal of Genetics 1989, 64:243–258.
Torroni A, Rengo C, Guida V, Cruciani F, Sellitto D, Coppa A, Calderon FL, Simionati B, Valle G, Richards M, others: Do the four clades of the mtDNA haplogroup L2 evolve at different rates? The American Journal of Human Genetics 2001, 69:1348–1356.
Maca-Meyer N, González AM, Pestano J, Flores C, Larruga JM, Cabrera VM: Mitochondrial DNA transit between West Asia and North Africa inferred from U6 phylogeography. BMC genetics 2003, 4:15.
Howell N, Elson JL, Turnbull DM, Herrnstadt C: African Haplogroup L mtDNA sequences show violations of clock-like evolution. Mol. Biol. Evol. 2004, 21:1843–54.
Merriwether DA, Hodgson JA, Friedlaender FR, Allaby R, Cerchio S, Koki G, Friedlaender JS: Ancient mitochondrial M haplogroups identified in the Southwest Pacific. Proceedings of the National Academy of Sciences 2005, 102:13034–13039.
Pierron D, Chang I, Arachiche A, Heiske M, Thomas O, Borlin M, Pennarun E, Murail P, Thoraval D, Rocher C, others: Mutation rate switch inside Eurasian mitochondrial haplogroups: impact of selection and consequences for dating settlement in Europe. PLoS One 2011, 6.
Duchêne S, Lanfear R, Ho SYW: The impact of calibration and clock-model choice on molecular estimates of divergence times. Mol. Phylogenet. Evol. 2014, 78:277–89.
Battistuzzi FU, Filipski A, Hedges SB, Kumar S: Performance of relaxed-clock methods in estimating evolutionary divergence times and their credibility intervals. Mol. Biol. Evol. 2010, 27:1289–300.
Fu Q, Mittnik A, Johnson PLF, Bos K, Lari M, Bollongino R, Sun C, Giemsch L, Schmitz R, Burger J, Ronchitelli AM, Martini F, Cremonesi RG, Svoboda J, Bauer P, Caramelli D, Castellano S, Reich D, Pääbo S, Krause J: A revised timescale for human evolution based on ancient mitochondrial genomes. Curr. Biol. 2013, 23:553–9.
Rieux A, Eriksson A, Li M, Sobkowiak B, Weinert LA, Warmuth V, Ruiz-Linares A, Manica A, Balloux F: Improved calibration of the human mitochondrial clock using ancient genomes. Mol. Biol. Evol. 2014, 31:2780–92.
Larruga JM, Marrero P, Abu-Amero KK, Golubenko MV, Cabrera VM: Carriers of mitochondrial DNA macrohaplogroup R colonized Eurasia and Australasia from a southeast Asia core area. BMC Evol. Biol. 2017, 17:115.
Kimura M: The neutral theory of molecular evolution: a review of recent evidence. Jpn. J. Genet. 1991, 66:367–86.
Balloux F, Lehmann L: Substitution rates at neutral genes depend on population size under fluctuating demography and overlapping generations. Evolution 2012, 66:605–11.
Watterson G: On the number of segregating sites in genetical models without recombination. Theoretical population biology 1975, 7:256–276.
Nei M, Li WH: Mathematical model for studying genetic variation in terms of restriction endonucleases. Proc. Natl. Acad. Sci. U.S.A. 1979, 76:5269–73.
Tajima F: Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics 1989, 123:585–95.
Cox MP: Accuracy of molecular dating with the rho statistic: deviations from coalescent expectations under a range of demographic models. Hum. Biol. 2008, 80:335–57.
Macaulay V, Soares P, Richards MB: Rectifying long-standing misconceptions about the ρ statistic for molecular dating. PLoS ONE 2019, 14:e0212311.
Rogers AR, Harpending H: Population growth makes waves in the distribution of pairwise genetic differences. Mol. Biol. Evol. 1992, 9:552–69.
Slatkin M, Hudson RR: Pairwise comparisons of mitochondrial DNA sequences in stable and exponentially growing populations. Genetics 1991, 129:555–62.
Donnelly P, Tavaré S: Coalescents and genealogical structure under neutrality. Annu. Rev. Genet. 1995, 29:401–21.

Table 1: Coalescence age for the Tree in Figure 1 using the compound rho
Period	Lineages	Mutations	Rho	i/i + 1	μ	1/μ	Years
2	2	3	1.50	0.67	1.44 x 10-4	6944	10 416
3	3	6	2.00	0.75	1.61 x 10-4	6211	12 422
4	4	2	0.50	0.80	1.72 x 10-4	5814	2 907
5	5	5	1.00	1.00	2.15 x 10-4	4651	4 651
Coalescence age for Figure 1 tree: 30 396 ± 5 513

Table 2: Coalescence age estimates for several human mtDNA-based evolutionary events

Haplogroup split	Evolutionary event	Mean age in years	95% Coefficient interval
L0/L1'2'5'6'4'3	Most recent African common ancestor	317 814 ya	(352 755-282 873 ya)
L3'4	Out of Africa	165 610 ya	(190 833-140 387 ya)
L3	Return to Africa of L3	112 829 ya	(133 648-92 010 ya)
P	Reaching the Pacific	106 752 ya	(127 003-86 501 ya)
P	Reaching Australia	108 034 ya	(128 406-87 662 ya)
P	Reaching Philippines	111 545 ya	(132 244-90 846 ya)
P	Reaching New Guinea	112 070 ya	(132 818-91 322 ya)
B2	Expansion Americas	37 701 ya	(49 735-25 667 ya)

Download PDF

Journal Publication

published 29 Jun, 2020

Read the published version in BMC Evolutionary Biology →

Version 3

posted

You are reading this latest preprint version

Counterbalancing the time-dependence effect on the Human Mitochondrial DNA Molecular Clock

Status:

Journal Publication

Version 3

Abstract

Figures

Background

Results

Discussion

Conclusions

Methods

List of Abbreviations

Declarations

References

Tables

Supplementary Files

Status:

Journal Publication

Version 3