We performed a gene-based analysis examining the association between missense/LoF and brain-specific enhancer and promoter variants in previously-identified AD-associated genes and seven measures of cognitive function in South Asians across India. Using only missense/LoF variants, three genes were associated with at least one measure of cognitive function after multiple testing correction, including APOE with multiple cognitive measures. However, no genes were associated in the brain-specific promoter and enhancer analysis. The most strongly associated variants were missense SNVs with high predicted deleteriousness. One of the most significantly associated missense variants was very rare, yet appeared to be enriched in LASI-DAD compared to EA samples in public databases.
We found that APOE is significantly associated with HMSE score, general cognitive function, executive function, and orientation in the missense/LoF analysis in Model 1. Apolipoprotein E (APOE) facilitates cholesterol and phospholipid transfer between cells, and complexes with amyloid β proteins in the brain for removal, inhibiting the amyloid β plaque formation necessary for AD onset [38]. APOE alleles confer different risks for Alzheimer’s disease. Relative to the ε3 allele, ε4 is associated with increased risk of Alzheimer’s disease in EA [39], and the ε4/ε4 genotype is also associated with cognitive decline in those with Alzheimer’s disease [40]. In our analytic sample from LASI-DAD, the ε4 allele frequency is estimated to be approximately 10.9%, which is less than reported frequency among EA samples in the US (14%) [41] and while not common is still frequent. The ε4 allele has somewhat different associations with AD risk across races/ethnicities, with ε4 and associated variant effects being stronger in EA populations compared to African-Americans [42]. Rs429358 is used to differentiate between ε3 and ε4 alleles in APOE, and has a CADD score of 16.6, which places it in the top 2nd percentile of deleterious SNVs. Although previous studies with a subset of the current LASI-DAD sample did not find an association between cognitive function and rs429358 [17, 43], this was likely due to smaller sample size and/or less regional variation in the previous studies. Our reported associations with APOE are not surprising as working memory and executive function deficits are often early markers of AD [44].
Phosphatidylinositol binding clathrin assembly protein (PICALM) facilitates endocytosis of APP [45], which is needed to form β-amyloid plaques that lead to AD. PICALM was found to be associated with cognitive function in EA samples [46]. PICALM variants identified in EA have had mixed associations in East Asian samples, with which the South Asian population of India shares ancestry [47]. For example, some variants identified as associated with AD in a large meta-analysis of Chinese GWAS near PICALM [48]were not found to be associated in smaller Indian studies [49]. We found that the sentinel SNV in PICALM for HMSE score is rs779406084, a very rare variant with a high CADD score at 24, placing it in the top 0.2th percentile of all deleterious SNVs. To our knowledge, this is the first study that reports an association with rs779406084 and cognitive function. This is likely due to the rarity of this variant in EA samples (MAF = 1.5x10− 5) and an association was likely found in our study due to its comparatively higher frequency in LASI-DAD (MAF = 7.5x10− 4). Further work is needed to elucidate the specific effects of this variant on the protein and replication in other cohorts is needed.
Translocator protein (TSPO) associated protein 1 (TSPOAP1) regulates calcium channels in nerve synapses [50]. It interacts with the protein TSPO which is involved in inflammation pathways [51]. TSPOAP1 variants were associated with AD in a large transethnic AD GWAS [52]. The sentinel SNV of TSPOAP1 in LASI-DAD was rs9913145, which has a CADD score of 1.12 indicating that it is not strongly deleterious. This variant is slightly more common in LASI-DAD (MAF = 0.15) compared to EA samples (MAF = 0.12). To our knowledge, this variant has not otherwise been reported to have an association with cognitive function or dementia. Given the relatively low CADD score of the variant and the relatively common frequency in EA samples, this variant may tag a haplotype specific to South Asians within TSPOAP1 that is associated with executive function.
We found no associations between brain-specific promoter and enhancer SNVs within the known AD genes and any of the measures of cognitive function in our sample after multiple testing correction. This is likely due to promoter/enhancer SNVs having more subtle effects on AD gene expression compared to the potentially more deleterious effects from missense/LoF SNVs. We also found that annotation weights did not substantively change our analysis results. This may be because the missense/LoF variants had relatively small variance in their annotation weights and tended to be high, resulting in little additional statistical information.
We found many genes that were nominally associated with each measure of cognitive function in the missense/LoF analysis and brain-specific promoter/enhancer analysis. Genes associated with multiple cognitive measures include ADAM17, OTULIN, and ABCA7, which are involved in amyloid-β metabolism or in immune signaling [38, 53, 54]. In both Model 1 and Model 2, ABCA7, ADAM17, APOE, OTULIN, and TSPOAP1 were all at least nominally associated with three or more measures of cognitive function. These genes all play a role in cholesterol and APP metabolism (ABCA7, ADAM17, APOE) or are involved in inflammation pathways (OTULIN, TSPOAP1).
Similar gene-based analysis studies were conducted in EA samples. One large genome-wide gene-based AD study conducted in the UK BioBank on different categories of rare missense/LoF variants found that three gene regions were associated with AD parent proxy cases, including TOMM40/APOE [55]. Notably, detection of these regions depended on resolving variants into categories of high confidence and predicted loss-of-function effects. Another gene-based AD analysis conducted in the ADES-FR study found that protein-truncating rare variants and strictly damaging rare variants in TREM2, ABCA7, and SORL1 were associated with early-onset AD, but not with late onset AD [56]. Given that the previous studies focused on very specific classes of rare variants in EA samples, it is no surprise that these genes were also at least nominally associated in our study.
Although many genes were nominally associated in our analysis, few genes were significant after correction for multiple testing. This could be in part because we selected genes associated with AD, which may have weaker effects on cognitive function changes that precede AD. Further, the genes were identified through GWAS which excels in identifying primarily non-functional common variants which may be correlated with causal variants. In this study, > 95% of our analyzed variants were rare (MAF < 5%). Although we focused on variants more likely to be causal, it is possible that the sentinel SNPs identified in the GWAS were not tagging variants in the functional classes we examined, and that more genes would have reached significance if common, non-functional variation was included in our analysis. Further, the genes identified were in large cohorts of EA. Genetic differences between EA and South Asians, including allele frequency and linkage disequilibrium, could have contributed to the lack of findings. Another explanation is that genetic associations with cognitive function may be attenuated in this population due to more heterogeneity in environmental factors across India, such as sociodemographics, socio-cultural factors, and air pollution, each of which is associated with cognitive function [57–61]. Finally, the tendency toward associations being nominally significant, but not significant after multiple testing correction, may be a result of the smaller sample size.
One limitation of this study is that we examined only two classes of functional annotations. It may be that variants with other functional consequences besides missense/LoF and promoter/enhancer variants could influence associations with cognitive function. Another limitation is that we could not include insertion/deletion variants in this analysis, as annotation weights are not available; however, it is possible that these variants may have more deleterious effects on proteins and their removal may have attenuated signal. Additionally, the LASI-DAD cohort design oversamples LASI participants with higher cognitive impairment risk, which may result in different observed genetic associations with cognitive function compared to studies sampled in other ways [17]. Finally, cognitive measures may have been biased due to administering the tests in many different languages [43]. However, no systematic bias with respect to language has been detected in LASI-DAD [62].
Our study also has several strengths. The prioritization and aggregation of SNVs based on their actual or predicted functional consequences likely increased signal for associations between the genes and cognitive function by focusing on variants that are more likely to have causal effects. Gene-based analysis with functional annotation also more directly links SNVs disease etiology, allowing a greater understanding of the types of variation within these genes that contribute to cognitive function. Additionally, to our knowledge, our study is the first to examine gene-based, rare variant associations with cognitive function in South Asians living in India. Thus, this work addresses an important health disparity in an understudied population [63]. Furthermore, our cohort presented a unique genetic environment to study potentially novel associations with understudied genetic variants due to its large genetic heterogeneity, unique subpopulations, and unique genetic ancestry [43, 47, 64, 65]. Finally, we examined several measures of cognitive function, which allows us to determine which specific cognitive domains are associated with each gene.
In conclusion, we found that three genes (APOE, PICALM, and TSPOAP1) associated with Alzheimer’s disease in EA are also associated with measure of cognitive functions in South Asians living in India, with the association primarily driven by missense/LoF SNVs. Associations were in part driven by rare, deleterious alleles, including a very rare SNV enriched in LASI-DAD compared to EA. Future functional studies are needed to verify and characterize SNVs found within this study.