Statistical properties of GRAMMAR-lambda for binary diseases
For the two genomic datasets, phenotypes were simulated to be controlled by 40, 200, and 1000 QTNs at the low (0.2), moderate (0.5), and high (0.8) genomic heritability, respectively. The GRAMMAR-Lambda, a test at once is comparable with the four competing methods—GRAMMAR, GMMAT, LTMLM, and SAIGE. Association results are displayed selectively in Figure 1 for the Q-Q profiles and Figure 2 for ROC profiles (Figure 1S and Figure 2S for details); the genomic controls are estimated in Table 1S. Under genomic control of exact 1, GRAMMAR-Lambda performs stably and with high statistical power, irrespective of how much the QTNs control the quantitative traits and how complex the population structures are. In contrast, GMMAT exhibits the same statistical power as GRAMMAR-Lambda, but with slightly low, even instable genomic control. For GRAMMAR, genomic controls are the lowest among GRAMMAR-Lambda and the four competing methods, the population structure is more complex, and the false negative rate is higher. Although LTMLM detected most QTNs for all simulated phenotypes in maize and SAIGE provided higher statistical power for those controlled by 1,000 QTNs at the genomic heritability of 0.2, they produce strong false positive errors. For humans, there were no distinct differences in statistical properties of GRAMMAR-Lambda and the four competing methods, although GRAMMAR produced a slight false negative error.
Furthermore, GRAMMAR-Lambda could jointly analyze multiple QTN candidates chosen from a test at once at a significance level of 0.05. For convenience of comparison, statistical powers of detecting QTNs are depicted together in Figure 1 selectively and in Supplementary Figure 1S in detail. Through backward multiple regression analysis, GRAMMAR-Lambda offered significant improved statistical power, but also kept almost perfect genomic control that was infinitely close to 1. Even at the highest false positive rates, LTMLM was inferior to GRAMMAR-Lambda with joint analysis in terms of statistical power.
Sensitivity to estimate genomic heritability
For normally distributed traits, GRAMMAR-Lambda does not need to estimate genomic heritability. To test whether this finding fits the binary diseases or not, we pre-specify the genomic heritability at 0.5 to analyze the simulated phenotypes controlled by 200 QTNs at different levels of heritabilities (see Figure 3S). Figure 3 compares the statistical behaviors of GRAMMAR-Lambda and the other methods by specifying and estimating genomic heritability. As shown in each plot, both QQ and ROC curves obtained by pre-specifying the genomic heritability almost overlapped with those obtained by estimating genomic heritability. Extensive simulations in optimizing GRAMMAR showed that the lower bound of the given heritability may be lower for binary diseases than that for normally distributed ones (data not shown). This supports the fact that while implementing GRAMMAR-Lambda for binary diseases, genomic heritability is set to 0.5 by default or to the empirical heritability of traits, if available.
Calculation of GRMs with the sampling markers
When the genomic heritability is pre-specified to 0.5, we randomly took 3 K, 5 K, 10 K, 20 K, and 25 K SNPs from the entire genomic markers to analyze the simulated phenotypes controlled by the varied numbers of QTNs at the heritability 0.5 using all methods, except LTMLM which cannot sample SNPs to estimate heritability. Changes in genomic control at the varied sampling levels of SNPs are depicted in Figure 4 for GRAMMAR-Lambda, GRAMMAR, GMMAT, and SAIGE. No competing method can stably control the positive/negative false errors using less than 20 K sampling SNPs, and SAIGE for human phenotypes simulated. Specifically, GMMAT gradually controls the positive false errors when the sampling markers increased; GRAMMAR seemed to reduce the negative false rate by sampling less markers, while SAIGE produced a serious false negative error in the complex maize population. In comparison, GRAMMAR-Lambda still retained high statistical power to detect QTNs through perfect genomic control, even by using less than 3000 sampling markers (see Figure 4S and Figure 5S).
Application of GRAMMAR-Lambda to WTCCC study
We were authorized to collect the data of 11,985 cases of six common diseases and 3,004 shared controls, genotyped at a total of 490,032 SNPs from the Wellcome Trust Case Control Consortium (WTCCC) study 1 18. For each dataset, a standard quality control (QC) procedure was performed. SNPs with MAFs < 0.01 and HWE > 0.05 were excluded, and individuals with missing rates > 0.01 were also excluded. After the QC process, the number of samples and SNPs used for generalized mixed-model association analyses was 5002 individuals (1998 cases and 3004 controls) and 409,642 SNPs for bipolar disorder (BD), 4992 individuals (1988 cases and 3004 controls) and 409,516 SNPs for coronary artery disease (CAD), 5003 individuals (1999 cases and 3004 controls) and 409,924 SNPs for rheumatoid arthritis (RA), 5005 individuals (2001 cases and 3004 controls) and 409,742 SNPs for hypertension (HT), 5004 individuals (2000 cases and 3004 controls) and 40,9674 SNPs for type I diabetes (T1D), and 5003 individuals (1999 cases and 3004 controls) and 409,805 SNPs for type II diabetes (T2D). All data analyses were performed in a CentOS Linux sever with 2.60 GHz Intel(R) Xeon(R) 40 CPUs E5-2660 v3, and 512 GB memory.
For the six common diseases, we implemented GRAMMAR-Lambda in two ways: to estimate the genomic heritability and GBVs together using all genomic markers and to estimate only GBVs by randomly sampling 5,000 SNPs with a given heritability of 0.5. The Q-Q and Manhattan profiles for the six common diseases are shown in Figure 6S and Figure 7S, which depict GRAMMAR-Lambda and the four competing methods used in simulations. We concluded that (1) under perfect genomic control, GRAMMAR-Lambda found the QTNs for each disease on each chromosome, and the numbers of the detected QTNs were not less than all the competing methods; and (2) in GRAMMAR-Lambda, joint association analyses detected more QTNs than a test at once. Compared with GRAMMAR-Lambda, GRAMMAR detected less QTNs with the lowest genomic control among all methods, while GMMAT yielded more SNPs whose -log(p) exceeded the Bonferroni corrected thresholds for CAD, T1D, T2D, and HT, but it behaved the largest genomic control. Additionally, LTMLM estimated the abnormal genomic heritabilities for CAD, BD, T2D, and HT, producing an unstable genomic control.
For better estimating the genomic heritability, furthermore, we conducted strict QC for each dataset, as performed previously in 12. Despite this, the missing heritabilities could not be normally evaluated for BD and HT. As shown in Figure 8S and Figure 9S, all methods provided clear and comparable association results, except for GRAMMAR. Interestingly, both LTMLM and GMMAT seriously underestimated the genomic heritability for each disease after strict QC. GRAMMAR-lambda could extremely efficiently and robustly map QTNs for binary diseases and did not depend on the estimation of genomic heritability and QC for genomic datasets. For each dataset with standard QC, we recorded the running times from input of genotypes and phenotype to the output of mapping QTNs for all the methods. Table 2S shows that GRAMMAR-Lambda several times to dozens of times reduced the computing time by almost dozens of times with the lowest memory footprint.