PCA analysis indicated that all study subjects were ethnically East Asian (Supplementary Figure 1). The genomic inflation factor calculated using a set of 1767 negative control SNPs in regions included on Immunochip for studies of reading and writing disabilities, psychosis and schizophrenia was 1.03 (lambda(1000)=1.02). No evidence of statistical inflation is seen in the Q-Q plot (Supplementary Figure 2). After quality control and imputation, 15,748 SNPs across the MHC (from 25-35Mb, hg18) were available for analysis in 1,482 cases and 1,512 controls. Imputed HLA-B allele frequencies amongst controls in the current study were not significantly different from those in previously reported directly genotyped studies (P>0.05), confirming the high accuracy of HLA imputation, particularly at two-digit resolution [3].
HLA-B Associations
The strongest SNP association with AS observed was a missense variant of HLA-B, rs1071652 (SNP-B*31432180_CG, odds ratio (OR)=180, P=4.45×10-256, Figure 1). The previously reported east Asian HLA-B*27 tagSNP rs13202464 (31452562) [11] was also found significantly associated with AS (OR=58.73, P=1.92×10-211). Controlling for rs1071652, residual association is seen with HLA-B*27 (4.48×10-22) and SNP rs41553720 (SNP-B*31432843_A, P=3.87×10-31), indicating that combinations of SNPs are currently required to tag HLA-B*27 in East Asian populations, in contrast to the situation in European descent populations [12].
After SNP imputation in the MHC region, the expected strong association was observed with HLA-B*27 (odds ratio (OR)=205, P=5.76×10-244, Table 2). Controlling for the HLA-B*27 association and studying other HLA-B alleles, risk association is seen with HLA-B*40 at suggestive level (OR=1.65, P=2.54x10-4). Controlling for both HLA-B*27 and –B*40, no association was observed in 2-digit HLA-B allele with MAF > 1% (only rare alleles HLA-B*53 and HLA-B*38 were associated at suggestive level, P-values were 2.9x10-4 and 3.3x10-4, respectively).
At amino acid level the strongest association seen in uncontrolled analysis was with histidine at position 114 in HLA-B (P=7.24×10-241), followed by multiple HLA-B amino acids including lysine at 70 (P=1.49×10-237) and asparagine 97 (P=2.51×10-237) (Table 3). Asparagine 97 and histidine 114 were previously reported to be the main amino acid determining HLA-B associations with AS in European-descent and Korean populations respectively [1, 2].
Conditional analyses for these individual amino acids and their combinations reveals that only association of histidine 114 can be attenuated by conditioning on asparagine 97 (Table 4). No other individual amino acid explains the association of the other amino acids. For HLA-B alleles, both combinations of lysine 70 + asparagine 97 and lysine 70 + histidine 114, but not asparagine 97 + histidine 114, controlled for association with any other 2-digit HLA-B allele (P>0.0017, correcting for 30 2-digit HLA-B alleles tested). Controlling for all of lysine 70, asparagine 97 and histidine 114, the strongest HLA amino-acid association remains with several positions including HLA-B position 97 (serine, found on HLA-B*7, *8, *15, *2707, *40, *41, *48; OR=2.14, P=2.14×10-5).
Controlling for HLA-B*27 alone or in combination with HLA-B*40 did not fully control for the association of asparagine 97, lysine 70 or histidine 114 (P<5x10-8 for both analyses, Table 3).
Non-HLA-B susceptibility loci in the MHC.
Considering HLA alleles other than HLA-B, several HLA -A, -C and Class II alleles showed significant associations (Table 2). Controlling for the HLA-B*27 association, independent risk association was confirmed with HLA-C*15 (OR=2.13, P=9.30×10-19) and HLA-C*1502 (OR=7.62, P=6.78×10-23), both associations at amino acid level tagged by a leucine 116 in HLA-C (P=6.61x10-21) located in the HLA-C epitope binding groove. Stepwise conditional analyses on both HLA-B*27 and -C*15 demonstrated significant associations with HLA-DQB1*04 (OR=2.13, P=7.91×10-5) after correcting for multiple comparisons (499 signals across MHC, as defined by regions with LD r2<0.2, Bonferroni correction threshold=10-4). Conditioning on both HLA-B*27 and HLA-B*40, association was confirmed with HLA-C*15 (OR=4.97, P=9.88×10-17) and HLA-DQB1*04 (OR=2.42, P=1.86×10-6).
ERAP1 variants in association with AS.
The key ERAP1 variant associated with AS is rs30187 (ccc-5-96150086-T-C, chr5:96150086[hg18], encoding K528R) [4, 13](Table 5). It has previously been observed in European populations that the association with the variant rs30187 in the ERAP1 locus is restricted to HLA-B*27-positive subjects, or HLA-B*40-positive, HLA-B27-negative subjects, consistent with epistatic interactions. Here we investigated the possibility of interaction between the HLA-B*27, HLA-B*40 alleles and the previous reported tag SNP of ERAP1 locus (rs30187) [4]. When testing for interaction with the HLA-B*27 alleles, we found that rs30187-A risk allele increased the risk of disease in the strata where HLA-B*27 was present (OR=1.29; P=2.71×10-6) (Table 5), but no association was seen in HLA-B27-negative cases (OR=1.06, P=0.61). No evidence of interaction was observed between rs30187 and the HLA-B *40 allele, although the power to identify this was low as the number of HLA-B27-negative cases was low.