Next, we compared the geographic distribution and allele ages of the most frequent 7 ACE variants that are associated with Alzheimer’s diseases (in the Table 2) based on the 1000 Genomes Project (1KGP), African Genome Variation database (AGVP), and the gnomAD datasets. The results are summarized in Table S3 and indicate that four are globally rare variants occurring at ~ 1% or lower frequency across continents in both 1KGP and gnomAD datasets. Two of these, T887M (rs3730043) and R1250Q (rs4980), had similar frequencies in Africa and in Europe in the 1KGP dataset. N1007K (rs142947404) was found only in two of the 1KGP populations, Italian (TSI) and Peruvian (PEL), leading to the speculation of a possible link between the distribution of the variant to South European gene flow to South Americas during the last millennia. However, due to the rarity of this variant much larger and targeted datasets are required for further characterization. Variant E738K (rs148995315) was absent in 1KGP and was extremely rare and restricted only to European ancestry populations in the much larger gnomAD dataset.27
The other three SNPs demonstrated a trend towards being African-specific, having several fold higher frequencies in African populations compared to all other continents. The allele ages of these variants (Table S3), estimated using the Human Genome Dating Atlas, suggest that at least two of these evolved prior to the out-of-Africa migration but remained in the continent until recently. The most dramatic of these is R1257S ACE mutation (rs4364), which occurs in approximately 13% in those of African descent but is absent or extremely rare in populations from the other continents. This variant exhibited almost two-fold higher frequency in West African Niger-Congo speakers (MSL and GWD) compared to Niger-Congo speakers form Central-West (YRI and ESN) and East Africa (LWK and Baganda) (Figure S1). Moreover, the much lower frequency (about 2%) of these variants in the non-Niger-Congo speaking Ethiopian population further highlights the extreme intra-continental variation of this SNP.
SNP D563G (rs12709426) demonstrated a similar trend (shown in Table S3, Figure S1) of higher frequency in West African populations, although the differences in allele frequencies within the continent was not as extreme. In contrast, A232S (rs4303) exhibited frequency differences within each of the African geographic regions rather than between them.
Recent whole genome studies, especially those based on African populations, have shown that the survey of global distribution of variants may identify curated disease-related variants (such as those in the ACMG list) in need of revisions.28 Given the commonality of the three ACE variants described above in Niger-Congo speakers and the absence of known elevations in AD in Africa, specifically in West Africa, suggests that associations of these three SNPs to AD is highly questionable. However, it remains possible that these variants may confer increased risk in conjunction with other variants such as the Epsilon allele of APOE.20 Moreover, their extreme differentiation within the continent suggests that at least some of these alleles may be associated to one or more key biological functions. Further investigation is needed to evaluate these possibilities.
We also attempted to trace these possible AD-associated ACE mutations (Table 2) to the published Neanderthal genomes (https://neandertal.ensemblgenomes.org/index.html), focusing on the four European related variants (rs3730043, rs4980, rs142947404, rs148995315). None of these variants were found in the few Neanderthal genomes available. Given the pattern of distribution within Europe and the extremely low frequency of variants, much larger datasets of Neanderthal, ancient, and current Europeans genomes would be required for investigating the direction of flow of these alleles within the continent.
Localization of AD-associated ACE mutations and possible mechanism of its action.
Blood ACE levels associated with these ACE mutations are listed in Table S2 and Table 2. These data combined with previously published literature suggest several possible mechanisms for the association of these ACE mutations with AD.
1) Low levels of ACE expression: This possible mechanism is the most simple and straightforward. Low ACE expression (and activity) occurs because of one non-functional allele in ACE gene, or due to indels or stop-codons (Group II in Table S2 with 131 mutations). A convincing example of this mechanism is provided by 6 such mutations described above.25 However, the combined frequency of such ACE mutations is not high – less than 0.2% in the general population. Gene therapy (CRISPR-based) may theoretically help such patients but is not currently available.
2) Low levels of ACE activity: This potential mechanism realysed when mutations occur in the active center residues - discussed in29. However, only such mutations that occur in the N domain active center are likely to be associated with AD by this mechanism. Only the N domain active center hydrolyzes Aβ42, which is an important step for amyloid deposition.4
3) Low levels of surface ACE protein expression: Another potential mechanism involves mutations that result in decreased surface ACE expression (similar to what we described previously for the Q1069R mutation11). Examples include mutations in the C domain, such as E738K, T887M and N1007K (mature somatic ACE numbering listed in Table 2 and shown on Fig. 1), as well as many others shown in Group III (missense mutations) of Table S2. These mutations could contribute to AD through decreased ACE function. Other examples of such transport-deficient ACE mutations in the C domain are R828H (rs146089353) and R1180P (rs5381166970), which are confirmed to cause low ACE surface expression.15,30 Some of these mutations may be detected using mAbs directed to the epitopes involving these mutations (e.g., T887M using mAb 8H1 as shown in Fig. 4 in31), potentially providing a clinically useful diagnostic test.
It is more complicated to determine the possible mechanisms for AD-associated ACE mutations localized in the N domain. We previously identified an ACE mutation in the N domain that results in lower blood ACE activity and is likely transport-deficient: rs373025 (Y215C).10 This mutation is occurring with fairly high frequency–1068 per 100,000 (Table 2, Table S2) and has been associated with Alzheimer’s disease.23,24 The predicted negative effect of the Y215C substitution on ACE expression (Table S1), including PolyPhen-2 (Table S2), is supported by the significant decrease in blood ACE levels we observed when ACE phenotyping the carriers of this mutation was performed.10 These observations in combination with the abovementioned studies prompted us to hypothesize that in addition to Alzheimer’s disease development, genetically determined low ACE expression may be a contributing factor for systemic sclerosis/scleroderma pathophysiology in some patients. Deficiency in ACE due to ACE inhibition, genetic manipulation, and/or mutations also causes other severe disease phenotypes, including defects in fetal development, hypotension, inability to concentrate urine, structural renal defects, anemia, and reduced male fertility (reviewed in9). Therefore, it is logical to hypothesize that some individuals with low (borderline) values of ACE expression in utero due to this Y215C mutation, may develop kidney abnormalities during embryonic development (similar to the effects of taking ACE inhibitors during pregnancy32). As a result, patients with such ACE mutations may have a decreased rate of survival into their 70s, when late onset Alzheimer’s disease is most likely to become clinically detectable.
R230H is another example of a transport-deficient ACE mutation in the N domain (rs370903033, listed in Table S2) that is associated with low ACE activity15 as is likely A232S (rs4303, listed in Table 2 with localization shown on Fig. 2). Based upon comparison of the hinge regions found in “open" and "closed” forms of ACE (described in33), the A232S mutation is unlikely to have a pronounced effect on ACE enzymatic machinery or binding of ACE substrates. This mutation is on the surface in the region that does not move during opening and closing. The effect on substrate binding is likely to be minimal because A232 is relatively distant from the binding site for regular (short) peptides. Binding of longer substrates (such as Aβ42) may be affected, but the effect is expected to be minimal due to the small difference in bulk between alanine and serine sidechains. The A232S mutation also is unlikely to affect the important salt bridge between R231-E590.30 The mutation is on an adjacent side of the helix that includes R231, points aways from R231, and is too small to cause substantial interference with the salt bridge between R231 and E590. A232 is located in a hydrophobic cavity formed by M267 and V268. Hypothetically, this A232S mutation could affect the strength of hydrophobic interactions at this site by reducing the hydrophobic contact and forming a new hydrogen bond with M267 (C = O) in an adjacent short helix. How this effect may alter the protein/epitope shape/interactions is difficult to predict without additional extensive computational simulations (and blood samples from carriers of this mutation are not available for our analysis).
Additional ACE mutations that cause transport deficiency are likely to be identified among 1047 missense ACE mutations listed in Table S2. GWAS analysis, which was instrumental in finding AD-associated ACE mutations (listed in Table 2), can identify only correlations between common variants with lower impact on risk for AD and cannot determine rare coding variants with high pathogenicity.25 Fifty-one Group I mutations in the signal peptide (Table S2) also relate to transport-deficiency. We analyzed blood ACE levels (described in Supplementary Material) from a study quantifying 4907 plasma proteins (including ACE) in 35,559 Icelanders using SomaScan technology.34 There were 22 missense ACE mutations found in this Iceland cohort, including at least 3 that were associated with substantial decreases in blood ACE levels (Table S4 and Fig. S2). It is likely that several ACE mutations from this list will be transport-deficient, including mutation Y215C (rs37300025)10), which occurs in more than 1% of the general population.
The combined frequency of all transport-deficient ACE mutations may be significant (more than 1.5% in the general population), which theoretically may be treated using a combination of chemical (sodium butyrate) and pharmacological (central-acting ACE inhibitors) chaperones and proteosome inhibitors to restore surface expression of the mutant ACE.11 We speculate that a positive effect on cognitive functions of central acting ACE inhibitors in some patients with AD35 may be explained by restoration of the impaired transport of the unrecognized mutant ACEs to the cell surface by ACE inhibitors, which are known to be effective pharmacological chaperones for transport-deficient ACE mutants.11
4) Impaired Aβ42 cleavage: In addition to transport-deficient ACE mutations that may be detected by lower blood ACE activity (e.g., Y215C10), other ACE mutations in the N domain could demonstrate normal ACE activity with conventional (short) substrates but have compromised binding and hydrolysis of longer substrates, such as Aβ42. An example of such divergence between Aβ42/Aβ40 converting activity and angiotensin-converting activity was demonstrated just recently for cells expressing ACE and presenilin 1 mutations. PS1 deficiency (quite unexpectedly) impaired both Aβ42/ Aβ40 converting activity and angiotensin-converting activity. Some PS1 mutants restored angiotensin-converting activity, but were not able to restore Aβ42/ Aβ40 converting activity.36
It is intriguing to hypothesize that patients with the D563G mutation (and possibly others in the N domain) may be associated with AD because this mutation is likely to specifically decrease Aβ42 cleavage (or binding) by the N domain active center due to significant conformational changes. However, this mutation may not prevent cleavage of conventional substrates by the C domain active center. There are two relevant observations. First, D563 is located on the side opposite to the “lid” opening of the protein, which is where ACE substrates enter the binding site (Fig. 3). Second, there are several acidic and basic residues in proximity to D563 that form an extensive network of electrostatic and hydrogen bond interactions. Although D563 is excluded as part of any hinge region,37 most likely because it does not move, D563 still may play a key mechanistic role by serving as a rigid spring that holds the bottom and the top portions of the ACE “jaws” and provides the necessary plasticity to allow binding of the substrates and their release after hydrolysis is complete. The D563G mutation leads to a loss of multiple interactions and conformational rigidity because glycine does not have a sidechain, likely resulting in a loss of stiffness of the “spring” and disruption of the catalytic machinery.
Figure 3B demonstrates that the D563G mutation is located in the center of the epitope for the potent anti-catalytic antibody 3A5.38,39 In the contrast to classical anti-catalytic mAbs which bind to the closed entrance of N domain active center for substrates,39 the anti-catalytic effect of mAb 3A5 is based on conformational changes in the whole N domain. After binding of this mAb to the N domain, no other mAbs could interact with the N domain.38 Binding by mAb 3A5 in the region of D563 likely disrupts the dynamics of “lid” opening and closing that leads to a conformation of ACE that is not recognized by other antibodies. Because these mAbs38 were developed for the conformation of ACE where the D563 “spring” functions normally, they are unable to recognize the new conformation of ACE caused by the binding of mAb 3A5 to the D563 area. Based upon this information, we also can predict that mAb 2H4 changes binding to this mutant (D563G) (Fig. 3B, see also Fig. S4 in31).
5) ACE mutations in the cytoplasmic tail: Patients with R1250Q and R1257S mutations localized in the cytoplasmic tail of ACE could be associated with AD pathophysiology by another mechanism that may include impaired phosphorylation signaling.40 To understand the effect of these mutations on ACE interactions with the membrane, we performed MD simulations of the transmembrane and cytoplasmic domains of ACE (Asp1222–Ser1277) in 1-palmitoyl-2-oleoyl-sn-glycero-3-phosphocholine (POPC) lipid membrane for WT ACE and both R1250Q and R1257S mutants (Fig. 4).
In WT ACE, the average angle between the transmembrane helix, which in this simulation spans from Gln 1224 to Leu 1247, and the Z-axis of the lipid bilayer is 16 degrees (Fig. 4B). In the R1250Q mutant, the transmembrane helix shifts toward the extracellular space and spans from Arg 1227 to Gln 1249, and its angle with the membrane bilayer is 23.5 degrees. (Fig. 4C). Unlike in WT ACE, the neutral, albeit still polar, glutamine sidechain of Q1250 is much less effective at interacting with the negatively charged phosphate groups, allowing the transmembrane helix to tilt until it reaches the next positively charged sidechain at R1255. In the R1257S mutant, the transmembrane helix is formed between R1227 and L1247, whereas its average angle with the lipid bilayer is 8 degrees. (Fig. 4D). The rationale for this decrease in the angle in R1257S mutant is not immediately obvious.
Visual analysis of the MD trajectory suggests that the charged residues in the cytoplasmic portion of WT ACE form coulombic interactions with the phosphate groups of POPC. This is expected because the cytoplasmic portion of WT ACE contains five arginine residues, R1250, R1255, R1257, R1261, R1275, and only two glutamic acid residues, E1271, E1273. Distribution of the proximal positive charges of the arginine residues on the inner surface of the membrane is likely to create a lateral tension, leading to a 16-degree tilt in the orientation of the helix.
In the R1257S mutant, the transmembrane helix remains anchored at R1250 on the inner part of the lipid bilayer, similarly to what occurs in WT ACE. We hypothesized that because the number of positive charges in the R1257S (and R1250Q) mutant is reduced by one (out of 5 positive and 2 negative residues), the requirements for membrane association of the charged residues in the cytoplasmic portion may be relaxed, making the interactions between the transmembrane domain of ACE and the lipid bilayer more dominant relative to those of the cytoplasmic domain. To investigate this possibility, we analyzed the distances between Q1225, located in the extracellular portion of ACE, and the amino acids with charged sidechains (R1250, R1255, R1257, R1261, E1271, E1273, R1275) in the cytoplasmic region of ACE. We observed that during MD simulation, these distances in both the mutants are greater, and overall distribution is less focused, suggesting that their effect on the transmembrane helix orientation is indeed diminished. A representative example is shown in Fig. S3 for the distances between Q1225 and R1275 in WT ACE and both mutants.
These findings suggest that there is crosstalk between the extracellular and cytoplasmic portions of ACE. Even for mutations located in the cytoplasmic portion of ACE, the geometry of the extracellular N and C domains may be affected with respect to (1) the distance from the membrane, (2) the orientation of the domains relative to the membrane, and (3) the degree of dimerization. Hence, despite their cytoplasmic location, these mutations are likely to have direct effects on the conformation of ACE at the membrane, and possibly on the extent of ACE dimerization Thus, they are predicted to alter ACE catalytic activity and substrate specificity, especially involving long substrates such as Aβ42.
Increased female susceptibility to Alzheimer’s disease was recently reported in the carriers of one of the ACE mutations in the cytoplasmic tail - R1250Q (rs4980). Twelve of the 13 AD patients with this mutation were women.20 We have detected significant differences in the conformation of urinary ACE in men compared to women, which may be caused by differential glycosylation (specifically, sialylation) of kidney ACE (i.e., the source of ACE in urine).41 Protein glycosylation is involved in proper protein folding, protein quality control, transport of proteins to specific organelles and sensitivity to shedding,8 including for ACE, which has ACE 17 potential glycosylation sites. The sex-specific differences in tissue ACE glycosylation that we have identified41 may be associated with differential disease susceptibility. One example is provided by structural differences between male and female ACE in certain neurons. Differences in ACE glycosylation in brain (striatal ACE) and lung ACE may be responsible for significant differences in substrate specificity of some brain-specific ACE substrates (i.e., substance P and substance K), at least in rats.42 Therefore, it is logical to suggest that the R1250Q mutation may significantly impair Aβ42 cleavage in female carriers of this ACE mutation but not exhibit a similar harmful effect in males.
In addition to the increased female prevalence in AD patients with the R1250Q ACE mutation, the lifetime risk for AD is nearly twofold greater in woman than in man.43 Therefore, we speculate that gender-specific differences in ACE glycosylation, such as we have identified,41 may occur in other LoF ACE mutations as well. An intriguing possibility raised by this hypothesis is that the increased prevalence of functional pain syndrome in women44 may be caused in part by these gender differences in tissue ACE because one of the pain mediators, substance P, is a specific substrate for brain ACE.42
Conformational fingerprinting of ACE in human brain.
We previously demonstrated that the mAb binding pattern to ACE derived from different organs is tissue-specific and determined by alterations in ACE glycosylation/sialylation that differently occur in different organs/tissues-i.e., the concept of ACE conformational fingerprinting.45 To characterize brain ACE, we compared the “conformational fingerprint” binding patterns of 22 mAbs to different epitopes on the ACE N and C domain31 with that for our “gold standard” -lung ACE, derived from endothelial cells,45 and with kidney ACE, which originates from proximal tubule epithelial cells (Fig. 5). Substantial differences in the conformational fingerprints of ACE from brain homogenates compared to lung homogenates (Fig. 5A) may be attributed to differential ACE glycosylation in the endothelial cells of the lung and in ACE-positive cells in frontal cortex of the brain (endothelial cells and neurons.8,9 Specific glycosylation sites in the ACE protein include Asn82 (localized in the epitopes for mAbs 3A5/3G8/5B3), Asn45 (mAbs 6C8/6H6/2D1) and perhaps Asn131 (epitope for mAb 2D7), Asn685 (epitope for mAbs 1E10/3C10/4E3/2H9), perhaps Asn913 (possibly in the epitope for mAb 8H1) and Asn1196 (epitope for mAb 4C12)-see epitope mapping previously described in31.
A surprising observation is that the conformational fingerprint of brain ACE is very similar (although not identical) to the conformational fingerprint of kidney ACE (Fig. 5B), This similarity contrasts with previously described differences in mAb binding patterns (i.e., likely glycosylation pattern) between ACE from other epithelial cells, specifically from prostate and from adrenal glands, which have embryonic origin from neural crest (Fig. 3A and B in45). Theoretically, differences in mAb binding patterns in brain ACE compared to lung ACE may be due to differential ACE glycosylation in the brain endothelial cells and cortical neurons (the source of ACE in the brain) compared to the lung. Moreover, brain ACE demonstrated an additional, lower MW ACE band42, 46 with an alternative glycosylation pattern.46
Additional support for the hypothesis that glycosylation (and perhaps more specifically, sialylation) may determine the observed differences in mAb binding to brain (and kidney) ACE versus lung ACE (Fig. 5) is derived from calculation of the 2H9/2D1 binding ratio. We recently determined that this ratio is a marker of the extent of ACE sialylation at certain glycosylation sites in ACE. Binding of mAbs having Asn45 or Asn117 in their epitopes was dramatically decreased if these glycosylation sites were sialylated.41 This prior study demonstrated that the 2H9/2D1 binding ratio for female urinary ACE (~ 10) was 4-fold higher in comparison to male urinary ACE, while the 2H9/2D1 binding ratio for female and male lung tissues were similar and relatively low (about 3.0). In the present study, we observe that the 2H9/2D1 binding ratio for human male brain and kidney ACEs are extremely low at approximately 4-fold less than for human male lung ACE (Fig. 5, insert). This observation matches well with the established fact that lung ACE contains ~ 6-fold more sialic acids than kidney ACE.47 Therefore, we conclude that the similarity in conformational fingerprints for brain and kidney ACEs (Fig. 5) are attributed primarily to differences in ACE sialylation.