Significance of Linkage Disequilibrium and Epistasis on the Genetic Variances in Non- Inbred and Inbred Populations

doi:10.21203/rs.3.rs-646130/v2

Download PDF

Research Article

Significance of Linkage Disequilibrium and Epistasis on the Genetic Variances in Non- Inbred and Inbred Populations

https://doi.org/10.21203/rs.3.rs-646130/v2

This work is licensed under a CC BY 4.0 License

You are reading this latest preprint version

Background The influence of linkage disequilibrium (LD), epistasis, and inbreeding on the genotypic variance continues to be an important area of investigation in genetics and evolution. Although the current knowledge about biological pathways and gene networks imply that epistasis is important in determining quantitative traits, the empirical evidence for a range of species and traits is that the genetic variance is most additive. This is confirmed by some recent theoretical studies. However, because these investigations have assumed linkage equilibrium, only additive effects, or simplified assumptions for the two- and high-order epistatic effects, the objective of this investigation was to provide additional information about the impact of LD and epistasis on the genetic variances in non-inbred and inbred populations, using a simulated data set.

Results The epistatic variance in generation 0 corresponded to 1 to 10% of the genotypic variance, with 30% of epistatic genes, but it corresponded to 5 to 45% assuming 100% of epistatic genes. After 10 generations of random cross or selfing the ratio epistatic variance/genotypic variance increased in the range of 15 to 1,079%. The epistatic variances are maximized assuming dominant epistasis, duplicate genes with cumulative effects, and non-epistatic gene interaction. A minimization occurs with complementary, recessive, and dominant and recessive epistasis. In non-inbred populations, the genetic covariances have negligible magnitude compared with the genetic variances. In inbred populations, excepting for duplicate epistasis, the sum of the epistatic covariances was in general negative and with magnitude higher than the non-additive variances, especially under 100% of epistatic genes.

Conclusions The LD level for genes, even under a relatively low gene density, has a significant effect on the genetic variances in non-inbred and inbred populations. Assuming digenic epistasis, the additive variance is in general the most important component of the genotypic variance in non-inbred and inbred populations. The ratio epistatic variance/genotypic variance is proportional to the percentage of interacting genes and increases with random cross and selfing. In general, the additive x additive variance is the most important component of the epistatic variance. The maximization of the epistatic variance depends on the allele frequency, LD level, and epistasis type.

Epigenetics & Genomics

linkage disequilibrium

epistasis

inbreeding

genetic variances.

The basic knowledge on the genetics of the quantitative traits was provided by RA Fisher [1], including the partition of the genotypic value in effects due to individual genes, allelic interaction (dominance), and non-allelic interaction (epistasis). Further, he also recognized the significance of the linkage phase between genes on the population variance and on the correlation between relatives. The influence of linkage disequilibrium (LD), epistasis, and inbreeding on the genotypic variance continues to be an important area of investigation in genetics and evolution [2–4]. Assuming linkage equilibrium and three to five loci interaction, A Maki-Tanila and WG Hill [4] concluded that most of the genotypic variance is additive, regardless of order of interaction, allele frequencies, and type and magnitude of interaction effects. Another main finding was that the majority of the epistatic variance is due to digenic interactions. Assuming LD, WG Hill and A Maki-Tanila [3] showed that variances are generally higher with positive LD and that the ratio epistatic variance/genotypic variance is largest with negative LD. Both studies showed that the epistatic variance is increased by increasing the heterozygosity. However, this has no impact on the relative magnitude of the epistatic variance because the additive and epistatic variances increase in similar proportions.

Based on the additive model, J Clo, J Ronfort and D Abu Awad [2] showed that assuming stabilizing selection and high mutation rates, self-pollinated populations are able to accumulate genetic variation through negative LD. Using a meta-analysis of quantitative traits heritability, J Clo, L Gay and J Ronfort [5] confirmed previous theoretical and empirical evidences that self-pollinated populations exhibit lower levels of additive variance for quantitative traits. However, the decrease in the additive variance is compensated by the non-additive components of the genotypic variance. Because of negative consequences (inbreeding depression), geneticists agree that inbreeding should be efficiently controlled to maintain adequate genetic diversity in the populations [6, 7]. However, self-pollination has been deliberately used in maize hybrid breeding (currently to a lesser extent due to the doubled-haploid technology). For self-pollinated crops, the development of varieties involves selection over generations of increasing inbreeding. In these populations the inbreeding has an impact on the genetic variances and covariance between relatives [8].

Although the current knowledge about biological pathways and gene networks imply that epistasis is important in determining quantitative traits, the empirical evidence for a range of species and traits is that the genetic variance is most additive [9, 10]. Based on theoretical models, WG Hill, ME Goddard and PM Visscher [10] concluded that this occurs because high difference of allelic frequencies. They also concluded that, in outbred populations, the detection of epistasis is difficult unless the epistatic effects are large and the gene frequencies are intermediate. TFC Mackay [9] emphasizes that, because epistasis regularly determine quantitative traits, it has consequences for plant and animal breeding, evolutionary biology, and human genetics. Recent studies on genomic selection and GWAS including epistasis have confirmed that most of the genetic variance is additive [11–14]. However, incomplete LD at low marker density can indicate epistasis when the trait determination is purely additive [15].

The most important quantitative genetics theory for modelling epistasis was developed by O Kempthorne [16]. CC Cockerham [17] also provided a significant contribution. If modelling only inbreeding, LD, or epistasis is a difficult task for the quantitative geneticists, jointly modelling the three events is a challenge. An impressive approach for two genes theory in quantitative genetics assuming inbreeding, LD, and epistasis was presented by BS Weir and CC Cockerham [18]. Because of the complexity of the expressions for the genetic variances and covariance between relatives, they concluded that “the result is of little use”. That is, the functions do not allow assessing the influence of LD, epistasis, and inbreeding on the genetic variability and the degree of relationship in the populations. Further, because recent investigations based on theoretical models have assumed linkage equilibrium, only additive effects, or simplified assumptions for the two- and high-order epistatic effects, the objective of this study was to provide additional information about the impact of LD and epistasis on the genetic variances in non-inbred and inbred populations, using a simulated data set.

The analysis of the parametric LD in the populations shows that the LD level depends mainly on the gene density (Additional File Fig. 1). The higher LD level was observed under high gene density (one gene each cM). Regardless of the gene density, the LD level is generally higher for the closest genes. Because the LD is predominantly positive, 10 generations of random cross significantly decreased the LD level of the populations. The decrease was higher for the density of one gene each five cM, regardless of the population (approximately 95% for r², on average). The average r² decrease for the density of one gene each cM was 81%. The LD level showed only a slight decrease after 10 generations of selfing, regardless of the population (approximately 14% for r², on average).

To characterize the magnitude of the genotypic variance components in non-inbred and inbred populations with contrasting LD levels, under no epistasis, we assumed a density of one gene each five cM. In generation 0, the r² in the high LD population is 2,395 times greater than the LD in the low LD population, on average. Compared with the populations with intermediate LD, the r² in the high LD population, generation 0, is 372 and 502 times greater, on average. Because the populations with high and low LD levels have an average allele frequency of 0.5, the decrease in the population mean due to inbreeding and the genotypic and additive variances are maximized, relative to the populations with average allele frequency lower (0,3) or higher (0,7) than 0.5. The same is true for the dominance variance in the non-inbred populations. After 10 generations of selfing, the decreases in the population means were 15 and 17% for the populations with low and high LD level, respectively (Additional File Fig. 2). Regardless of the LD level and the degree of inbreeding, the additive variance is the most important component of the genotypic variance. The significance of the LD level is impressive on the additive and dominance variances. The additive variance in the population with high LD is 6.8 times greater than the additive variance in the population with low LD in generation 0, 2.9 times greater after 10 generations of random cross, and 5.7 times greater after 10 generations of selfing. Concerning the dominance variance and the covariance between additive and dominance values, there is a lower difference between their magnitudes in the populations with low and high LD levels. In the non-inbred populations, the dominance variance assuming high LD is approximately two times greater than the dominance variance under low LD, regardless of the generation. In the populations with intermediate LD level, the decreases in the population mean due to inbreeding are similar. In both improved and not improved populations, there was also a significant decrease in the additive variance with random crosses (approximately 60%) and an increase with selfing (approximately 60% too). The additive variance is greater in the not improved population, regardless of the generation. In both populations the additive variance is in general intermediate to the values observed for the populations with high and low LD level. The dominance variance significantly decreased with random cross or selfing, regardless of the level of LD (approximately 12 to 97%).

To characterize the components of the genotypic variance in non-inbred and inbred populations with high LD level, under epistasis, we also assumed the density of one gene each five cM. Regardless of the type of epistasis and the percentage of interacting genes, there are non-significant changes in the population mean along 10 generations of random cross (−0.5 to 0.3%; remember that the average decrease in the r² values was approximately 95%) (Additional File Fig. 3). With 10 generations of selfing, regardless of the percentage of epistatic genes, except for duplicate and dominant epistasis with 100% of interacting genes, the inbreeding decreased the population mean in 2 to 28% (remember that the decrease assuming no epistasis was 17%).

Regardless of the type of epistasis, the ratio epistatic variance/genotypic variance is proportional to the percentage of the epistatic genes. The epistatic variance in generation 0 corresponded to 1 to 10% (dominant epistasis) of the genotypic variance, with 30% of epistatic genes, but it corresponded to 5 to 45% (duplicate epistasis) assuming 100% of epistatic genes (Additional File Figs. 4 to 10). In general, irrespective of the type of epistasis and the percentage of epistatic genes, after 10 generations of random cross or selfing the ratio epistatic variance/genotypic variance increased in the range of 15 to 1,079%. This occurred because the decrease in the genotypic variance was much higher than the decrease in the epistatic variance with random cross. With selfing, this occurred because the increase in the genotypic variance was much lower than the increase in the epistatic variance or because the genotypic variance decreased while the epistatic variance increased. With one exception, regardless of the type of epistasis and the percentage of epistatic genes, the most important component of the genotypic variance is also the additive variance. The additive variance decreased with random cross and increased with selfing. With duplicate epistasis and 100% of epistatic genes, the additive x additive variance was higher than the additive variance, after three generations of selfing. Except for dominant epistasis, duplicate genes with cumulative effects, and non-epistatic genic interaction, the additive variance was 1.1 to 6 times greater assuming 30% of epistatic genes, compared with 100% of epistatic genes, for both random cross and selfing. Assuming dominant epistasis, duplicate genes with cumulative effects, and non-epistatic genic interaction, the additive variance was 1.5 to 2.7 times higher with 100% of interacting genes, compared with 30% of interacting genes.

For the epistatic variances, their magnitudes are much lower than the additive variance (Additional File Figs. 4 to 10). The additive x additive variance is the most important epistatic variance. Generally, an insignificant variation in the epistatic variances was observed throughout 10 generations of random cross (−13 to 6%), regardless of the type of epistasis and the percentage of the epistatic genes. A significant increase in the additive x additive, additive x dominant, and dominant x additive variances occurred with selfing (114 to 863%), regardless of the percentage of epistatic genes and the type of epistasis. When inbreeding increased, the dominant x dominant variance significantly decreased in the population with high LD (76 to 86%) but increased in the other populations (11 to 175%). The epistatic variances are maximized assuming dominant epistasis, duplicate genes with cumulative effects, and non-epistatic gene interaction. A minimization of the epistatic variances occurs with complementary, recessive, and dominant and recessive epistasis. In non-inbred populations, the genetic covariances have negligible magnitude compared with the genetic variances. In inbred populations, excepting for duplicate epistasis, the sum of the epistatic covariances was in general negative and with magnitude higher than the non-additive variances, especially under 100% of epistatic genes.

For the populations with intermediate and low LD levels, the previous inferences holds but the genotypic and genetic variances are generally lower than the values for the population of high LD level, regardless of generation, type of epistasis, and percentage of epistatic genes, as exemplified assuming 30% of epistatic genes showing all types of epistasis (Fig. 1). With no exception, the additive variance is also the most important component of the genotypic variance, regardless of the generation. Further, assuming an admixture of the epistasis types and 30% of interacting genes, the ratio epistatic variance/genotypic variance in the high LD population is lower than the ratio in the low LD population (Fig. 2), regardless of the degree of inbreeding (30 to 60%). Note that both populations have the same average allele frequency (0.5). Compared to the populations with intermediate LD, the ratio epistatic variance/genotypic variance under high LD is greater relative to the non-inbred population with average allele frequency of 0.7 (approximately 10 to 60%) but lower relative to the other populations, regardless of the generation and degree of inbreeding (approximately 30 to 80%) (Fig. 2).

Assuming an admixture of the types of epistasis and 30% of epistatic genes, the genetic variances in the non-inbred high LD population are higher than the values observed in the non-inbred low LD population (1.2 to 5.1 times higher). In the inbred populations, in general, the additive, additive x additive, and additive x dominance variances are greater under high LD but the dominance and the dominance x dominance variances are lower (Figs. 1 and 2).

WG Hill, ME Goddard and PM Visscher [10] emphasize that the knowledge about the relative magnitudes of the additive, dominance, and epistatic variances is important in evolutionary biology, medicine, and agriculture. However, the investigation about the joint significance of LD, epistasis, and inbreeding on the genetic variances for a quantitative trait is a challenge, even fixing a trait, i.e., even fixing the number of genes, the allele frequencies, and the degrees of dominance.

One main reason is that the theory available is too complex to allow the assessment of the relative magnitudes of the genetic variances [3, 4, 10, 19]. The other main reason is the large number of combinations between levels of LD (say, low to high) and inbreeding (say, not inbred to completely inbred) with distinct percentage of epistatic genes (say 30 to 100%), degree of epistasis (say, digenic to a high order), and type of epistasis (up to seven types of digenic epistasis, complementary or duplicate trigenic or high-order epistasis, or an admixture of types).

BS Weir and CC Cockerham [18] derived very complex functions for the components of the genotypic variance assuming a two-gene model with inbreeding, LD, and epistasis and concluded that they are of “little use”. T Wang and ZB Zeng [19] only highlight that their theoretical results serve as a framework to understand and properly interpret estimates of the genetic effects and variance components in a QTL mapping experiment. The theoretical models investigated by WG Hill, ME Goddard and PM Visscher [10], assuming linkage equilibrium, predict high proportions of additive variance even in the presence of non-additive gene action. Assuming also linkage equilibrium, the theoretical results from A Maki-Tanila and WG Hill [4] showed that the epistatic variance is small compared to the additive variance, even assuming high heterozygosity. They also emphasize that the majority of the epistatic variance is due to two-locus interaction. Based on theoretical models including LD, WG Hill and A Maki-Tanila [3] confirmed that most of the genotypic variance in a segregating population is additive.

Because the main conclusion from the previously described studies is that most of the genotypic variance is additive, we believe that our simulation-based study provides significant additional knowledge about the influence of LD and epistasis on the genetic variances in non-inbred and inbred populations. Our study has a strong theoretical background on quantitative genetics. We assumed low to high LD levels for genes, not inbred to completely inbred populations, 30 and 100% of epistatic genes, and the seven types of digenic epistasis. Although there is evidence for high-order epistasis, pairwise interaction can contribute substantially to phenotypic variation between individuals [4, 20].

Our results agree with the main finds from WG Hill and A Maki-Tanila [3], A Maki-Tanila and WG Hill [4], and WG Hill, ME Goddard and PM Visscher [10], that LD significantly affects the genetic variances and that most of the genotypic variance is additive. However, from the analyses assuming an admixture of the types of epistasis and 30% of interacting genes, the ratio epistatic variance/genotypic variance was maximized in the populations with intermediate LD and average allele frequency of 0.3 (9 to 10%) and low LD and average allele frequency of 0.5 (10 to 22%), regardless of the generation and degree of inbreeding. The ratio was minimized in the populations with intermediate LD and average allele frequency of 0.7 (3 to 10%) and high LD and average allele frequency of 0.5 (3 to 8%). Our results also give support to the main conclusions of J Clo, J Ronfort and D Abu Awad [2], who assumed additive model under LD and distinct selfing rates. The differences observed for outcrossing species relies on their assumption of negative LD.

An important aspect to be also discussed is the unavailability of epistatic variance estimates from field phenotypic data. Most of the empirical evidence of epistasis comes from QTL mapping studies [9, 10] simply because when analyzing field data, there is no previous knowledge if there is epistasis. Further, even assuming digenic epistasis, linkage equilibrium, and non-inbred population, it would be necessary to estimate six independent variances and covariances between relatives to estimate the six genetic variances. Comparing estimates of the narrow and broad sense heritabilities only provides evidence of non-additive effects. Recently, however, some estimates of epistatic variances have been provided in studies involving genomic selection [21, 22]. In these studies, the epistatic variance ranged from 0 to 9.5% of the phenotypic variance.

Our main finds from a simulation-based study supported by quantitative genetics theory involving LD, epistasis, and inbreeding were: 1) the LD level for genes, even under a relatively low gene density, has a significant effect on the genetic variances in non-inbred and inbred populations; 2) assuming digenic epistasis, the additive variance is in general the most important component of the genotypic variance in non-inbred and inbred populations; 3) the ratio epistatic variance/genotypic variance is proportional to the percentage of interacting genes and increases with random cross and selfing; 4) in general, the additive x additive variance is the most important component of the epistatic variance; and 5) the maximization of the epistatic variance depends on the allele frequency, level of LD, and epistasis type. Two important implications of our results are that selection based on breeding value prediction remains the best approach for population improvement and that cross- and self-pollinated populations keep a non-negligible amount of genetic variation for quantitative traits to allow their adaptive potential to environmental changes, assuming LD and epistasis.

Additive and dominance genetic values in inbred populations

Assume initially a single biallelic gene (A/a) determining a quantitative trait, where A is the gene that increases the trait expression, and a population derived by n generations of selfing from a Hardy-Weinberg equilibrium population (generation 0). Defining ${M}_{F}^{1}$ and ${M}_{F}^{2}$ as the means of the inbred population after an allelic substitution for the genes A and a, respectively, the average effect of the allelic genes in the inbred population are ${\alpha }_{A}^{\left(n\right)}={M}_{F}^{1}-{M}_{F}=q\alpha +2Fpqd$ and ${\alpha }_{a}^{\left(n\right)}={M}_{F}^{2}-{M}_{F}=-p\alpha +2Fpqd$, where ${M}_{F}=m+\left(p-q\right)a+2pqd- 2Fpqd=M- 2Fpqd$ is the inbred population mean, p and q are the allelic frequencies, $\alpha$ is the average effect of an allelic substitution, $F$ is the inbreeding coefficient, and M is the non-inbred population mean. Thus, the additive values in the inbred population are ${A}_{AA}^{\left(n\right)}=2q\alpha +4Fpqd={A}_{AA}^{\left(0\right)}+4Fpqd$, ${A}_{Aa}^{\left(n\right)}=\left(q-p\right)\alpha +4Fpqd={A}_{Aa}^{\left(0\right)}+4Fpqd$, and ${A}_{aa}^{\left(n\right)}=-2p\alpha +4Fpqd={A}_{aa}^{\left(0\right)}+4Fpqd$, where ${A}^{\left(0\right)}$ is the additive value in the non-inbred population. Note that $E\left({A}^{\left(n\right)}\right)=4Fpqd$. Expressing the genotypic values in the inbred population as a function of ${M}_{F}$, we have:

$${G}_{AA}={M}_{F}+{A}_{AA}^{\left(0\right)}+\left({-2q}^{2}d+2Fpqd\right)={M}_{F}+{A}_{AA}^{\left(0\right)}+\left({D}_{AA}^{\left(0\right)}+2Fpqd\right)={M}_{F}+{A}_{AA}^{\left(0\right)}+{D}_{AA}^{\left(n\right)}$$

$${G}_{Aa}={M}_{F}+{A}_{Aa}^{\left(0\right)}+\left(2pqd+2Fpqd\right)={M}_{F}+{A}_{Aa}^{\left(0\right)}+\left({D}_{Aa}^{\left(0\right)}+2Fpqd\right)={M}_{F}+{A}_{Aa}^{\left(0\right)}+{D}_{Aa}^{\left(n\right)}$$

$${G}_{aa}={M}_{F}+{A}_{aa}^{\left(0\right)}+\left({-2p}^{2}d+2Fpqd\right)={M}_{F}+{A}_{aa}^{\left(0\right)}+\left({D}_{aa}^{\left(0\right)}+2Fpqd\right)={M}_{F}+{A}_{aa}^{\left(0\right)}+{D}_{aa}^{\left(n\right)}$$

Note that in the inbred population, $E\left({A}^{\left(0\right)}\right)=E\left({D}^{\left(n\right)}\right)=0$ but $E\left({D}^{\left(0\right)}\right)=-2Fpqd$. Note also that the additive value in the non-inbred population is the additive value in the inbred population expressed as deviation from its mean $\left({A}^{\left(0\right)}={A}^{\left(n\right)}-4Fpqd\right)$ and the dominance value in the inbred population is the dominance value in the non-inbred population expressed as deviation from its mean $\left({D}^{\left(n\right)}={D}^{\left(0\right)}+2Fpqd\right)$. This implies that, in the inbred population,$E\left(G\right)={M}_{F}.$

Genetic variances in inbred populations in LD

Assume now two linked biallelic genes (A/a and B/b) determining a quantitative trait and a non-inbred population in LD (generation 0). Assume dominance but initially no epistasis. After n generations of selfing, the genotypic variance for the two genes in the inbred population is (see the genotype probabilities in the Additional File Appendix) ${\sigma }_{G}^{2\left(n\right)}={\sigma }_{A}^{2\left(n\right)}+{\sigma }_{D}^{2\left(n\right)}+2{\sigma }_{A,D}^{\left(n\right)}$, where:

${\sigma }_{A}^{2\left(n\right)}=\left(1+F\right)\left(2{p}_{a}{q}_{a}{\alpha }_{a}^{2}+2{p}_{b}{q}_{b}{\alpha }_{b}^{2}\right)+2\left[2+{c}_{1}\left(1-2{r}_{ab}\right)\right]{\varDelta }_{ab}^{(-1)}{\alpha }_{a}{\alpha }_{b}=\left(1+F\right){\sigma }_{A}^{2\left(0\right)}+ 2\left[{c}_{1}\left(1-2{r}_{ab}\right)-2F\right]{\varDelta }_{ab}^{(-1)}{\alpha }_{a}{\alpha }_{b}$ is the additive variance,

${\sigma }_{D}^{2\left(n\right)}=\left(1-{F}^{2}\right)\left(4{p}_{a}^{2}{q}_{a}^{2}{d}_{a}^{2}+4{p}_{b}^{2}{q}_{b}^{2}{d}_{b}^{2}\right)+F\left[4{p}_{a}{q}_{a}{\left({p}_{a}-{q}_{a}\right)}^{2}{d}_{a}^{2}+4{p}_{b}{q}_{b}{\left({p}_{b}-{q}_{b}\right)}^{2}{d}_{b}^{2}\right]+ 8\left\{\left(1-F\right)\left({c}^{n}-1+F\right){p}_{a}{q}_{a}{p}_{b}{q}_{b}+\left({p}_{a}-{q}_{a}\right)\left({p}_{b}-{q}_{b}\right)\left[\left(1-F\right){c}^{n}-\left(1-2F\right)+{c}_{1}\left(1-2{r}_{ab}\right)/2\right]{\varDelta }_{ab}^{(-1)}/2+ \left(1-F\right){c}^{n}{{\varDelta }_{ab}^{(-1)}}^{2}\right\}{d}_{a}{d}_{b}=\left(1-{F}^{2}\right){\sigma }_{D}^{2\left(0\right)}+F{D}_{2}+8\left\{\left(1-F\right)\left({c}^{n}-1+F\right){p}_{a}{q}_{a}{p}_{b}{q}_{b}+ \left({p}_{a}-{q}_{a}\right)\left({p}_{b}-{q}_{b}\right)\left[\left(1-F\right){c}^{n}-\left(1-2F\right)+{c}_{1}\left(1-2{r}_{ab}\right)/2\right]{\varDelta }_{ab}^{(-1)}/2+\left[\left(1-F\right){c}^{n}-\left(1-{F}^{2}\right)\right]{{\varDelta }_{ab}^{(-1)}}^{2}\right\}{d}_{a}{d}_{b}$ is the dominance variance, and

${\sigma }_{A,D}^{\left(n\right)}=2F\left[{2p}_{a}{q}_{a}\left({p}_{a}-{q}_{a}\right){\alpha }_{a}{d}_{a}+{2p}_{b}{q}_{b}\left({p}_{b}-{q}_{b}\right){\alpha }_{b}{d}_{b}\right]+\left[2F+{c}_{1}\left(1-2{r}_{ab}\right)\right]{\varDelta }_{ab}^{\left(-1\right)}\left[\left({p}_{b}-{ q}_{b}\right){\alpha }_{a}{d}_{b}+\left({p}_{a}-{q}_{a}\right){\alpha }_{b}{d}_{a}\right]=2F{D}_{1}+\left[2F+{c}_{1}\left(1-2{r}_{ab}\right)\right]{\varDelta }_{ab}^{\left(-1\right)}\left[\left({p}_{b}-{ q}_{b}\right){\alpha }_{a}{d}_{b}+\left({p}_{a}-{q}_{a}\right){\alpha }_{b}{d}_{a}\right]$ is the covariance between additive and dominance values,

where ${\varDelta }_{ab}^{(-1)} ={P}_{AB}^{(-1)}.{P}_{ab}^{(-1)}-{P}_{Ab}^{(-1)}.{P}_{aB}^{(-1)}$ is the measure of LD in the gametic pool of generation −1 [23], where ${P}^{(-1)}$ is a haplotype probability, ${r}_{ab}$ is the recombination frequency, ${c}_{1}=2\left\{1-{\left[\left(1-2{r}_{ab}\right)/2\right]}^{n}\right\}/\left(1+2{r}_{ab}\right)$, $c=1-2{r}_{ab}\left(1-{r}_{ab}\right)$, ${\sigma }_{A}^{2\left(0\right)}=2{p}_{a}{q}_{a}{\alpha }_{a}^{2}+2{p}_{b}{q}_{b}{\alpha }_{b}^{2}+4{\varDelta }_{ab}^{(-1)}{\alpha }_{a}{\alpha }_{b}$ and ${\sigma }_{D}^{2\left(0\right)}=4{p}_{a}^{2}{q}_{a}^{2}{d}_{a}^{2}+4{p}_{b}^{2}{q}_{b}^{2}{d}_{b}^{2}+8{d}_{a}{d}_{b}$ are the additive and dominance variances in the non-inbred population in LD [24], and ${D}_{1}$ (covariance of a and d) and ${D}_{2}$ (variance of d) are the components of the covariance of relatives from self-fertilization, assuming linkage equilibrium [8]. The other terms are the covariances between the average effects of an allelic substitution, between dominance deviations, and between the average effect of an allelic substitution and dominance deviation, for genes in LD. Because we assumed biallelic genes, ${\stackrel{ˇ}{H}=\sigma }_{D}^{2}.$ Thus, ${\left(1-{F}^{2}\right){\sigma }_{D}^{2\left(0\right)}=\left(1-F\right)\sigma }_{D}^{2\left(0\right)}+F\left(1-F\right)\stackrel{ˇ}{H}$. Note that the genotypic variance derived here is a general formulation for the Cockerham’s genotypic variance c_ggg [8], assuming LD. If p = q, ${\sigma }_{A,D}^{\left(n\right)}=0$.

Assuming LD but no inbreeding, the genotypic variance after n generations of random cross in the non-inbred population in LD is ${\sigma }_{G}^{2\left(n\right)}={\sigma }_{A}^{2\left(n\right)}+{\sigma }_{D}^{2\left(n\right)}$, because

, where:

$${\sigma }_{A}^{2\left(n\right)}=2{p}_{a}{q}_{a}{\alpha }_{a}^{2}+2{p}_{b}{q}_{b}{\alpha }_{b}^{2}+4{\left(1-{r}_{ab}\right)}^{n}{\varDelta }_{ab}^{(-1)}{\alpha }_{a}{\alpha }_{b}$$

$${\sigma }_{D}^{2\left(n\right)}=4{p}_{a}^{2}{q}_{a}^{2}{d}_{a}^{2}+4{p}_{b}^{2}{q}_{b}^{2}{d}_{b}^{2}+8{\left[{{\left(1-{r}_{ab}\right)}^{n}\varDelta }_{ab}^{(-1)}\right]}^{2}{d}_{a}{d}_{b}$$

Thus, the genotypic variance can increase or decreases after n generations of random cross in a non-inbred population, depending on the sign of the LD measure. The LD value is positive for genes in coupling phase and negative for genes in repulsion phase.

Epistasis in non-inbred and inbred populations in LD

The quantitative genetics theory for modelling epistasis in a population in LD is a generalization of the theory proposed by O Kempthorne [16], who assumed a non-inbred population in linkage equilibrium and any number of alleles. We assumed biallelism. It should be emphasized that the Kempthorne’s theory allows a generalization from two to three or more interacting genes. But fitting three or more interacting genes in a population in LD is a challenge because the genotype probabilities for three or more genes in LD are too complex to derive. Furthermore, only complementary and duplicate epistasis can be easily defined for three or more epistatic genes.

Assume now that the two previous defined genes are epistatic. The genotypic value is [16]:

$${G}_{ijkl}=M+{\alpha }_{i}^{1}+{\alpha }_{j}^{1}+{\alpha }_{k}^{2}+{\alpha }_{l}^{2}+{\delta }_{ij}^{1}+{\delta }_{kl}^{2}+{\left({\alpha }^{1}{\alpha }^{2}\right)}_{ik}+{\left({\alpha }^{1}{\alpha }^{2}\right)}_{jk}+{\left({\alpha }^{1}{\alpha }^{2}\right)}_{il}+{\left({\alpha }^{1}{\alpha }^{2}\right)}_{jl}+{ \left({\alpha }^{1}{\delta }^{2}\right)}_{ikl}+{\left({\alpha }^{1}{\delta }^{2}\right)}_{jkl}+{\left({{\delta }^{1}\alpha }^{2}\right)}_{ijk}+{\left({{\delta }^{1}\alpha }^{2}\right)}_{ijl}+{\left({{\delta }^{1}\delta }^{2}\right)}_{ijkl}=M+A+D+AA+AD+ DA+DD$$

where AA, AD, DA, and DD are the additive x additive, additive x dominance, dominance x additive, and dominance x dominance epistatic genetic values.

The parametric values of the 36 parameters for the nine genotypic values are obtained by solving the equations $\beta ={\left(X\text{'}VX\right)}^{-1}X\text{'}Vy$, under the restrictions defined by O Kempthorne [16], where $X$ is the incidence matrix, $V=diagonal\left\{{f}_{ij}^{\left(n\right)}\right\}$ is the diagonal matrix of the genotype probabilities, and $y$ is the vector of the genotypic values $\left({G}_{ij}\right)$ (i, j = 0, 1, and 2).

O Kempthorne [16] provided explicit functions for all effects because he assumed linkage equilibrium. Assuming LD makes very difficult to derive such functions but the following results hold:

1) the expectation of the breeding value is zero regardless of the degree of inbreeding in the population.

2) the expectation of the dominance value is $E{\left(D\right)}^{\left(n\right)}={p}_{a}{q}_{a}F\left({\delta }_{AA}-{2\delta }_{Aa}{+\delta }_{aa}\right)+{p}_{b}{q}_{b}F\left({\delta }_{BB}-{2\delta }_{Bb}{+\delta }_{bb}\right)$; then, defining the dominance value in an inbred population as the dominance value expressed as deviation from its mean $\left({D}^{\left(n\right)}=D-E{\left(D\right)}^{\left(n\right)}\right)$, $E\left({D}^{\left(n\right)}\right)=0$.

3) the expectation of the additive x additive value is zero only if there is no LD.

4) the expectation of the additive x dominance value is zero only if F = 0 or p = q for all genes.

5) the expectation of the dominance x additive value is zero only if F = 0 or p = q for all genes.

6) the expectation of the dominance x dominance value is zero only if F = 0 and there is no LD.

Thus, defining the additive x additive, additive x dominance, dominance x additive, and dominance x dominance epistatic values as the values expressed as deviation from its mean, ${AA}^{\left(n\right)}=AA-E{\left(AA\right)}^{\left(n\right)}$, ${AD}^{\left(n\right)}=AD-E{\left(AD\right)}^{\left(n\right)}$, ${DA}^{\left(n\right)}=DA-E{\left(DA\right)}^{\left(n\right)}$, and ${DD}^{\left(n\right)}=DD-E{\left(DD\right)}^{\left(n\right)}$, the genotypic value in an inbred population can be expressed as

$$G=M+E{\left(D\right)}^{\left(n\right)}+E{\left(AA\right)}^{\left(n\right)}+E{\left(AD\right)}^{\left(n\right)}+E{\left(DA\right)}^{\left(n\right)}+E{\left(DD\right)}^{\left(n\right)}+A+{D}^{\left(n\right)}+{AA}^{\left(n\right)}+{ AD}^{\left(n\right)}+{DA}^{\left(n\right)}+{DD}^{\left(n\right)}={M}_{F}+A+{D}^{\left(n\right)}+{AA}^{\left(n\right)}+{AD}^{\left(n\right)}+{DA}^{\left(n\right)}+{DD}^{\left(n\right)}$$

This implies that $E\left(G\right)={M}_{F}$. If F = 0 then

$$G=M+E\left(AA\right)+E\left(DD\right)+A+D+\left[AA-E\left(AA\right)\right]+AD+DA+\left[DD-E\left(DD\right)\right]={M}^{*}+A+D+{AA}^{*}+AD+DA+{DD}^{*}$$

where,

$E\left(AA\right)=2{\varDelta }_{ab}^{(-1)}\left({\alpha }_{A}{\alpha }_{B}-{\alpha }_{A}{\alpha }_{b}-{\alpha }_{a}{\alpha }_{B}+{\alpha }_{a}{\alpha }_{b}\right)$ and $E\left(DD\right)={\left[{\varDelta }_{ab}^{(-1)}\right]}^{2}\left({{\delta }_{AA}\delta }_{BB}-2{{\delta }_{AA}\delta }_{Bb}+{{\delta }_{AA}\delta }_{bb}-2{{\delta }_{Aa}\delta }_{BB}+4{{\delta }_{Aa}\delta }_{Bb}-{{\delta }_{Aa}\delta }_{bb}+{{\delta }_{aa}\delta }_{BB}-2{{\delta }_{aa}\delta }_{Bb}+{{\delta }_{aa}\delta }_{bb}\right)$.

This implies that $E\left(G\right)={M}^{*}$. If F = 0 and there is no LD,

$$G=M+A+D+AA+AD+DA+DD$$

where the linear components are those defined by O Kempthorne [16]. This implies that $E\left(G\right)=M$.

In non-inbred populations in LD, only the additive and dominance values are not correlated. The genotypic variance in these populations is, in simplified form,

$${\sigma }_{G}^{2\left(0\right)}={\sigma }_{A}^{2\left(0\right)}+{\sigma }_{D}^{2\left(0\right)}+{\sigma }_{AA}^{2\left(0\right)}+2{\sigma }_{A,AA}^{\left(0\right)}+2{\sigma }_{D,AA}^{\left(0\right)}+\dots$$

where

$${\sigma }_{AA}^{2\left(0\right)}={f}_{22}^{\left(0\right)}{\left[\left({4\alpha }_{A}{\alpha }_{B}\right)\right]}^{2}+\dots +{f}_{00}^{\left(0\right)}{\left[\left({4\alpha }_{a}{\alpha }_{b}\right)\right]}^{2}-{\left[E{\left(AA\right)}^{\left(0\right)}\right]}^{2}$$

$${\sigma }_{A,AA}^{\left(0\right)}=2{\varDelta }_{ab}^{(-1)}\left[{\alpha }^{A}\left({\alpha }_{A}{\alpha }_{B}{-\alpha }_{A}{\alpha }_{b}+{\alpha }_{a}{\alpha }_{B}-{\alpha }_{a}{\alpha }_{b}\right)+{\alpha }^{B}\left({\alpha }_{A}{\alpha }_{B}{-\alpha }_{a}{\alpha }_{B}+{\alpha }_{A}{\alpha }_{b}-{\alpha }_{a}{\alpha }_{b}\right)\right]$$

$${\sigma }_{D,AA}^{\left(0\right)}=-4{\varDelta }_{ab}^{(-1)}\left[{{p}_{a}{q}_{a}d}_{a}\left({\alpha }_{A}{\alpha }_{B}{-\alpha }_{A}{\alpha }_{b}-{\alpha }_{a}{\alpha }_{B}+{\alpha }_{a}{\alpha }_{b}\right)+{{p}_{b}{q}_{b}d}_{b}\left({\alpha }_{A}{\alpha }_{B}{-\alpha }_{a}{\alpha }_{B}-{\alpha }_{A}{\alpha }_{b}+{\alpha }_{a}{\alpha }_{b}\right)\right]$$

where, to avoid confusion, ${\alpha }^{A}$ and ${\alpha }^{B}$ are the average effects of an allelic substitution.

The assumption of LD makes very difficult to derive the components of the genotypic variance (additive, dominance, and epistatic variances and the covariances between these effects), even assuming non-inbred populations, biallelic genes, and only digenic epistasis. In respect to the types of digenic epistasis, the following can be defined [25, 26]:

Complementary (${G}_{22}={G}_{21}={G}_{12}={G}_{11}$ and ${G}_{20}={G}_{10}={G}_{02}={G}_{01}={G}_{00}$; proportion of 9:7 in a F₂).
Duplicate (${G}_{22}={G}_{21}={G}_{20}={G}_{12}{=G}_{11}={G}_{10}={G}_{02}={G}_{01}$; proportion of 15:1 in a F₂).
Dominant (${G}_{22}={G}_{21}={G}_{20}={G}_{12}{=G}_{11}={G}_{10}$ and ${G}_{02}={G}_{01}$; proportion of 12:3:1 in a F₂).
Recessive (${G}_{22}={G}_{21}={G}_{12}={G}_{11}$, ${G}_{02}={G}_{01}$, and ${G}_{20}={G}_{10}={G}_{00}$; proportion of 9:3:4 in a F₂)
Dominant and recessive (${G}_{22}={G}_{21}={G}_{12}={G}_{11}={G}_{20}={G}_{10}={G}_{00}$ and ${G}_{02}={G}_{01}$; proportion of 13:3 in a F₂).
Duplicate genes with cumulative effects (${G}_{22}={G}_{21}={G}_{12}={G}_{11}$, and ${G}_{20}={G}_{10}={G}_{02}={G}_{01}$; proportion of 9:6:1 in a F₂).
Non-epistatic genic interaction (${G}_{22}={G}_{21}={G}_{12}={G}_{11}$, ${G}_{20}={G}_{10}$, and ${G}_{02}={G}_{01}$; proportion of 9:3:3:1 in a F₂).

Simulated data sets

Because the magnitude of the components of the genotypic variance generally cannot be inferred from the previous functions, all means and genetic variances and covariances were computed from simulated data sets provided by the software REALbreeding (available upon request). This software uses the quantitative genetics theory that was described in the previous sections and in JMS Viana [24]. REALbreeding has been used to provide simulated data in investigations in the areas of genomic selection [27], GWAS [28], QTL mapping [29], linkage disequilibrium [30], population structure [31], and heterotic grouping/genetic diversity [32].

The software simulates individual genotypes for genes and molecular markers and phenotypes in three steps using user inputs. The first step (genome simulation) is the specification of the number of chromosomes, molecular markers, and genes as well as marker type and density. The second step (population simulation) is the specification of the population(s) and sample size or progeny number and size. A population is characterized by the average frequency for the genes (biallelic) and markers (first allele). The final step (trait simulation) is the specification of the individual phenotypes. In this stage, the user informs the minimum and maximum genotypic values for homozygotes (to compute the a deviations), the minimum and maximum phenotypic values (to avoid outliers), the direction and degree of dominance (to compute the dominance deviations/d), and the broad sense heritability. The current version allows the inclusion of digenic epistasis, gene x environment interaction, and multiple traits (up to 10), including pleiotropy. The population mean (M), additive (A), dominance (D), and epistatic (AA, AD, DA, and DD) genetic values or general and specific combining ability effects (GCA and SCA) or genotypic values (G) and epistatic values (I), depending on the population, are calculated from the parametric gene effects and frequencies and the parametric LD values. The phenotypic values ($P$) are computed assuming error effects $\left(E\right)$ sampled from a normal distribution ($P=M+A+D+AA+AD+DA+DD+ E=G+E$ or $P=M+GCA1+GCA2+SCA+I+E=G+E$). The population in LD is generated by crossing two populations in linkage equilibrium followed by a generation of random cross. This generation of random cross aims to generate a population in Hardy-Weinberg equilibrium. Thus, the generation 0 (the founder population) is a population in Hardy-Weinberg equilibrium, in LD for linked genes and molecular markers, and the individuals are not related. The parametric LD in this population is ${\varDelta }_{ab}^{(-1)}=\left[\left(1-2{r}_{ab}\right)/4\right]\left({p}_{a1}-{p}_{a2}\right)\left({p}_{b1}-{p}_{b2}\right)$, where the indexes 1 and 2 stand for the allele frequencies in the parental populations.

The quantitative genetics theory for epistasis does not solve the challenge of studying genetic variability and covariance between relatives in populations, using simulated data sets, even assuming simplified scenarios such as linkage equilibrium and no inbreeding. Because the genotypic values for any two interacting genes are not known, there are infinite genotypic values that satisfy the specifications of each type of digenic epistasis. For example, fixing the gene frequencies (the population) and the parameters m, a, d, and d/a (degree of dominance) for each gene (the trait), the solutions ${G}_{22}={G}_{21}={G}_{12}={G}_{11}$ = 5.25 and ${G}_{20}={G}_{10}={G}_{02}={G}_{01}={G}_{00}$ = 5.71 or ${G}_{22}={G}_{21}={G}_{12}={G}_{11}$ = 6.75 and ${G}_{20}={G}_{10}={G}_{02}={G}_{01}={G}_{00}$ = 2.71 define complementary epistasis but the genotypic values are not the same.

The solution implemented in the software allows the user to control the magnitude of the epistatic variance (V(I)), relative to the magnitudes of the additive and dominance variances (V(A) and V(D)). As an input for the user, the software requires the ratio V(I)/(V(A) + V(D)) for each pair of interacting genes (a single value; for example, 1.0). Then, for each pair of epistatic genes the software samples a random value for the epistatic value ${I}_{22}$ (the epistatic value for the genotype AABB), assuming ${I}_{22}N\left(0, V\left(I\right)\right)$. Then, the other epistatic effects and genotypic values are computed.

We simulated grain yield assuming 400 genes in 10 chromosomes of 200 and 50 cM (40 genes/chromosome). The average density was approximately one gene each five and one cM, respectively. We generated five populations, two with high LD level and one with low LD level, all three with an average allele frequency of 0.5, and two populations with intermediate LD level and an average frequency for the favorable genes of 0.3 (not improved) and 0.7 (improved). We defined positive dominance (average degree of dominance of 0.6), maximum and minimum genotypic values for homozygotes of 160 and 30 g.plt^− 1, and maximum and minimum phenotypic values of 180 and 10 g.plt^− 1. The broad sense heritability was 20%. For each population we assumed additive-dominance model and additive-dominance with digenic epistasis model, defining 100% and 30% of interacting genes. Concerning the ratio V(I)/(V(A) + V(D)), the analyses assuming ratios 1, 10, and 100 evidenced that increasing the ratio from 1 to 10 and 100 increased the epistatic variances but also increased the additive and dominance variances. Then, because the main conclusions for the greater ratios were essentially the same provided by ratio 1, we will present only the results for ratio 1. With epistasis, we assumed a single type or an admixture of the seven types. We ranged the degree of inbreeding from 0.0 to 1.0, assuming 10 generations of selfing. We also assumed 10 generations of random crosses. The population size was 5,000 per generation.

The characterization of the LD in the populations was based on the parametric Δ, r², and D’ values for the 40 genes in chromosome 1, which were provided by REALbreeding (it should be similar for the other chromosomes). The heatmaps were processed using the R package pheatmap. Assuming no epistasis, the software provides the parametric additive and dominance genetic values and the parametric genetic variances and covariances. Assuming epistasis, the software provides the parametric additive, dominance, and epistatic genetic values. Thus, under epistasis, the genetic variances and covariances were computed from the parametric genetic values, using a sample size of 5,000 individuals per generation. Two important implications of our results are that selection based on breeding value prediction remains the best approach for population improvement and that cross- and self-pollinated populations keep a non-negligible amount of genetic variation for quantitative traits to allow their adaptive potential to environmental changes, assuming LD and epistasis.

linkage disequilibrium; A–additive value; D–dominance value; AA–additive x additive value; AD–additive x dominance value; AD–dominance x additive value; DD–dominance x dominance value; G–genotypic value; I–epistatic value.

Ethics approval and consent to participate: Not applicable.

Consent for publication: Not applicable.

Availability of data and materials: The data set is available at https://doi.org/10.6084/m9.figshare.13607306.v2.

Competing Interests: The authors declare that they have no competing interests.

Funding: None.

Authors' contributions: JMSV designed the study, developed the software, processed the data, and wrote the manuscript. AAFG designed the study, processed the data, and revised the manuscript. All authors read and approved the final manuscript.

Acknowledgements: We thank the National Council for Scientific and Technological Development (CNPq), the Brazilian Federal Agency for Support and Evaluation of Graduate Education (Capes; Finance Code 001), and the Foundation for Research Support of Minas Gerais State (Fapemig) for financial support.

Fisher RA: The correlation between relatives on the supposition of Mendelian inheritance. Transactions of the Royal Society of Edinburgh 1918, 52(2):399–433.
Clo J, Ronfort J, Abu Awad D: Hidden genetic variance contributes to increase the short-term adaptive potential of selfing populations. Journal of Evolutionary Biology 2020, 33(9):1203–1215.
Hill WG, Maki-Tanila A: Expected influence of linkage disequilibrium on genetic variance caused by dominance and epistasis on quantitative traits. Journal of Animal Breeding and Genetics 2015, 132(2):176–186.
Maki-Tanila A, Hill WG: Influence of Gene Interaction on Complex Trait Variation with Multilocus Models. Genetics 2014, 198(1):355–367.
Clo J, Gay L, Ronfort J: How does selfing affect the genetic variance of quantitative traits? An updated meta-analysis on empirical results in angiosperm species. Evolution 2019, 73(8):1578–1590.
Hasselgren M, Noren K: Inbreeding in natural mammal populations: historical perspectives and future challenges. Mammal Review 2019, 49(4):369–383.
Howard JT, Pryce JE, Baes C, Maltecca C: Invited review: Inbreeding in the genomics era: Inbreeding, inbreeding depression, and management of genomic variability. Journal of dairy science 2017, 100(8):6009–6024.
Cockerham CC: Covariances of relatives from self-fertilization Crop Science 1983, 23:1177–1180.
Mackay TFC: Epistasis and quantitative traits: using model organisms to study gene-gene interactions. Nature Reviews Genetics 2014, 15(1):22–33.
Hill WG, Goddard ME, Visscher PM: Data and theory point to mainly additive genetic variance for complex traits. Plos Genetics 2008, 4(2).
Vitezica ZG, Legarra A, Toro MA, Varona L: Orthogonal Estimates of Variances for Additive, Dominance, and Epistatic Effects in Populations. Genetics 2017, 206(3):1297–1307.
Forneris NS, Vitezica ZG, Legarra A, Perez-Enciso M: Influence of epistasis on response to genomic selection using complete sequence data. Genetics Selection Evolution 2017, 49.
Su G, Christensen OF, Ostersen T, Henryon M, Lund MS: Estimating additive and non-additive genetic variances and predicting genetic merits using genome-wide dense single nucleotide polymorphism markers. PloS one 2012, 7(9):e45293.
Monir MM, Zhu J: Comparing GWAS Results of Complex Traits Using Full Genetic Model and Additive Models for Revealing Genetic Architecture. Scientific Reports 2017, 7.
Misztal I, Aguilar I, Lourenco D, Ma L, Steibel J, Toro M: Emerging issues in genomic selection. Journal of animal science 2021.
Kempthorne O: The theoretical values of correlations between relatives in random mating populations. Genetics 1954, 40:153–167.
Cockerham CC: An extension of the concept of partitioning hereditary variance for analysis of covariances among relatives when epistasis is present. Genetics 1954(39):859–882.
Weir BS, Cockerham CC: Two-locus theory in quantitative genetics. In: International Conference on Quantitative Genetics: 1976; Ames. The Iowa State University Press; 1977: 247–269.
Wang T, Zeng ZB: Models and partition of variance for quantitative trait loci with epistasis and linkage disequilibrium. BMC genetics 2006, 7.
Domingo J, Baeza-Centurion P, Lehner B: The Causes and Consequences of Genetic Interactions (Epistasis). Annual Review of Genomics and Human Genetics, Vol 20, 2019 2019, 20:433–460.
Chen ZQ, Baison J, Pan J, Westin J, Gil MRG, Wu HX: Increased Prediction Ability in Norway Spruce Trials Using a Marker X Environment Interaction and Non-Additive Genomic Selection Model. Journal of Heredity 2019, 110(7):830–843.
Vitezica ZG, Reverter A, Herring W, Legarra A: Dominance and epistatic genetic variances for litter size in pigs using genomic models. Genetics Selection Evolution 2018, 50.
Kempthorne O: An Introduction to Genetic Statistics. Ames: The Iowa State University Press; 1973.
Viana JMS: Quantitative genetics theory for non-inbred populations in linkage disequilibrium. Genetics and Molecular Biology 2004, 27(4):594–601.
Viana JMS: Dominance, epistasis, heritabilities and expected genetic gains. Genetics and Molecular Biology 2005, 28(1):67–74.
Viana JMS: Components of variation of polygenic systems with digenic epistasis. Genetics and Molecular Biology 2000, 23(4):883–892.
Viana JMS, Pereira HD, Piepho HP, Silva FFE: Efficiency of Genomic Prediction of Nonassessed Testcrosses. Crop Science 2019, 59(5):2020–2027.
Pereira HD, Viana JMS, Andrade ACB, Silva FFE, Paes GP: Relevance of genetic relationship in GWAS and genomic prediction. Journal of Applied Genetics 2018, 59(1):1–8.
Viana JMS, Silva FF, Mundim GB, Azevedo CF, Jan HU: Efficiency of low heritability QTL mapping under high SNP density. Euphytica 2017, 213(1).
Andrade ACB, Viana JMS, Pereira HD, Pinto VB, Fonseca ESF: Linkage disequilibrium and haplotype block patterns in popcorn populations. PloS one 2019, 14(9):e0219417.
Viana JMS, Valente MSF, Silva FF, Mundim GB, Paes GP: Efficacy of population structure analysis with breeding populations and inbred lines. Genetica 2013, 141(7–9):389–399.
Viana JMS, Risso LA, Oliveira deLima R, Fonseca e Silva F: Factors affecting heterotic grouping with cross-pollinating crops. Agronomy Journal 2020.

No competing interests reported.

AdditionalFile.pdf

Download PDF

Editorial decision: Major revision
10 Dec, 2021
Reviews received at journal
13 Oct, 2021
Reviews received at journal
06 Oct, 2021
Reviewers agreed at journal
16 Sep, 2021
Reviewers agreed at journal
12 Sep, 2021
Reviewers invited by journal
12 Sep, 2021
Editor assigned by journal
12 Sep, 2021
Editor invited by journal
08 Sep, 2021
Submission checks completed at journal
08 Sep, 2021
First submitted to journal
28 Jun, 2021

You are reading this latest preprint version

Significance of Linkage Disequilibrium and Epistasis on the Genetic Variances in Non- Inbred and Inbred Populations

Status:

Version 2

Abstract

Figures

Background

Results

Discussion

Conclusions

Methods

Additive and dominance genetic values in inbred populations

Genetic variances in inbred populations in LD

Epistasis in non-inbred and inbred populations in LD

Simulated data sets

Abbreviations

Declarations

References

Additional Declarations

Supplementary Files

Status:

Version 2