The demographic and clinical characteristics of 1031 study participants included in our association study are presented in Table 1. All participants were genotyped by Affymetrix 500 k SNP Array. A total of 607,153 SNPs passed through quality control (Supplementary Figure S1). These SNPs filtered minor allele frequency of < 1% and a call rate of < 95%.
To demonstrate that there is no genetic stratification in the population, we performed a principal component analysis on the SNPs of all participants. The first two principal components show absence of population structure (Supplementary Figure S2). To identify susceptibility SNPs for HBV infection, we performed a GWAS in HBV infection similar with previous design [8, 9]. HBV clearance was used as a control group versus ASPI, CHB, DC, HCC as HBV chronic infection (case group). We observed associations of two novel MHC loci with progression to certain HBV stages (SNP: rs2395166, Gene: HLA-DRA, P = 1.42 × 10− 7; SNP: rs615672, Gene: HLA-DRB1, P = 8.54 × 10− 7) and two reported MHC locus (SNP: rs3077, Gene: HLA-DPA1, P = 6.60 × 10− 9; SNP: rs9277542, Gene: HLA-DPB1, P = 1.53 × 10− 8) (Table 2; Fig. 1). These MHC loci variants replicated association results of previous studies affirming that MHC gene alleles confer risks of susceptibility of HBV infection in East Asian. Interestingly, we founded that these reported MHC loci (rs2395166:C, rs615672:G, rs3077:A, rs9277542:T, rs9277341:T) present significant differences in allele frequency between East Asian and non-East Asian population in gnomAD database (Table 3), as well as the differences between HBV infection group and HBV clearance group. Since different groups may not present an identical minor allele, here, we used the derived allele against the ancestral allele for studying the allele frequency across different populations. The derived allele frequencies in East Asian are much closer to the HBV chronic infection group, while other populations, such as European, are much closer to the HBV clearance group. These genetic differences may suggest a selective signal in non-East Asian population versus East Asian population. To confirm this, we firstly build a phylogenetic tree based on these loci and then showed the genetic diversity in world-wide populations, in which the East Asian population is at the root. We set the East Asian as the ancestral group in these loci according to the derived allele frequencies and the phylogenetic tree. Subsequently, we identified two strong phylogenetic signals (HLA-DPA1, HLA-DPB1) in the European population (Fig. 2) via FST method. Haplotype bifurcation diagrams of the two core SNPs (rs3077, rs9277542) presented that the resisted allele lead to a long-range, and a high frequency homozygosity in European population (Fig. 3), confirming the natural genetic selection. These evidences revealed that the resisted alleles were under positive selection in European population strongly. We estimated the historic population size and then showed these two loci (HLA-DPA1, HLA-DPB1) were under selection during the past 26,000 years (Supplementary Figure S3). These results may provide a context for the racking influence of HBV infectious diseases in history.
Table 2
The significance of HBV-related outcomes study
Case-control studies | SNP | Gene | P value | OR | Minor Allele | Minor Allele Frequency | Report | λ |
Case (n) | Control (n) | Case | Control |
Infection (587) | Clearance (275) | rs2395166 | HLA-DRA | 1.42 × 10− 7 | 0.4534 | C | 0.1269 | 0.2182 | MHC region | 1.003 |
rs615672 | HLA-DRB1 | 8.54 × 10− 7 | 0.5697 | G | 0.405 | 0.5276 | MHC region |
rs3077 | HLA-DPA1 | 6.60 × 10− 9 | 0.5007 | A | 0.2675 | 0.4145 | (Kamatani et al., 2009) |
rs9277542 | HLA-DPB1 | 1.53 × 10− 8 | 0.5353 | T | 0.3735 | 0.5347 | (Kamatani et al., 2009) |
CHB (93) | ASPI (92) | rs1264473 | GRHL2 | 1.57 × 10− 6 | 3.931 | C | 0.4402 | 0.1957 | Novel | 1.052 |
HCC (214) | CHB (93) | rs2833856 | EVA1C | 1.62 × 10− 6 | 0.3515 | C | 0.2243 | 0.4086 | Novel | 1.022 |
HCC (214) | DC (188) | rs4661093 | ETV3 | 2.26 × 10− 6 | 2.841 | A | 0.2104 | 0.0882 | Novel | 1.022 |
Abbreviations: OR, odds ratio; ASPI, asymptomatic persistence infection; CHB, chronic hepatitis B; DC, decompensated cirrhosis; HCC, hepatocellular carcinoma; MHC, major histocompatibility complex; λ: statistic of genomic control. |
Table 3: Divided allele frequency of significant SNPs in MHC region
SNP
|
Study
|
Population
|
Derived Allele
|
Derived Allele Frequency
|
|
P value
|
Case-control studies
|
gnomAD
|
|
East Asian vs non-East Asian
|
Infection
|
Clearance
|
Healthy
|
East Asian
|
non-East Asian
|
|
rs2395166
|
our study
|
Chinese
|
C
|
0.127
|
0.218
|
0.183
|
0.128
|
0.367
|
|
4.367×10-195
|
rs615672
|
our study
|
Chinese
|
G
|
0.405
|
0.528
|
0.512
|
0.388
|
0.590
|
|
6.003×10-111
|
rs3077
|
our study
|
Chinese
|
A
|
0.268
|
0.415
|
0.379
|
0.283
|
0.723
|
|
0
|
Kamatani et al., 2009 [5]
|
Japanese
|
0.245
|
-
|
0.392
|
|
Guo et al., 2011 [55]
|
Chinese
|
0.314
|
0.447
|
0.443
|
|
Nishida et al., 2012 [56]
|
Japanese, Korean
|
0.213
|
0.393
|
-
|
|
Wong et al., 2013 [57]
|
Southern Chinese
|
0.206
|
0.276
|
0.288
|
|
rs9277341
|
our study
|
Chinese
|
T
|
0.142
|
0.242
|
0.196
|
0.159
|
0.582
|
|
0
|
Guo et al., 2011 [55]
|
Chinese
|
0.133
|
0.237
|
0.237
|
|
rs9277542
|
our study
|
Chinese
|
T
|
0.374
|
0.535
|
0.482
|
0.339
|
0.627
|
|
2.005×10-225
|
Kamatani et al., 2009 [5]
|
Japanese
|
0.246
|
-
|
0.437
|
|
Abbreviations: East Asian, East Asian population in gnomAD database; non-East Asian, combined all other population excepted East Asian in gnomAD database; P value, compared allele frequency between East Asian and non-East Asian population via fisher exact test. Derived Allele was accessed from human ancestral genome (Ensembl-59).
To identify new susceptibility locus for HBV-related outcomes, we performed association studies for CHB, DC, and HCC. Significantly, we observed three associated gene SNP loci: 1) (SNP: rs1264473, Gene: GRHL2, P = 1.57 × 10− 6) associated with CHB versus ASPI; 2) (SNP: rs2833856, Gene: EVA1C, P = 1.62 × 10− 6) associated with HCC versus CHB; and 3) (SNP: rs4661093, Gene: ETV3, P = 2.26 × 10− 6) associated with HCC versus DC (Table 2; Fig. 1). No SNP associated with DC versus CHB were apparent.
HBV clearance, ASPI, CHB, DC, and HCC are progressive stages post HBV infection [4]. We hypothesized that the host genetic factor contributes to the development of outcomes, as well as to the individual outcome. To investigate this hypothesis, we test two progressive stages upon HBV infection: 1.) HBV infection itself (CHB, ASPI, and HBV clearance) and 2.) development of CHB (CHB, DC, and HCC). We performed a chi-square test for trend in proportions of allele to identify SNPs increasing risk of HBV-related outcomes in the progressive stages. We observed association with one novel locus (SNP: rs1537862, Gene: LACE1, P = 1.85 × 10− 6), one association with a reported locus (SNP: rs9277542, Gene: HLA-DPB1, P = 1.50 × 10− 9), and two association variants at MHC genes (SNP: rs615672, Gene: HLA-DRB1, P = 1.39 × 10− 6; SNP: rs3128923, Gene: HLA-DPA2, P = 2.06 × 10− 6) with trend test of allele frequency across three outcomes (Table 4; Fig. 4A). The three reported MHC genes were demonstrated to play a critical role in the resistance of HBV infection, and two (HLA-DPB:rs9277542, HLA-DRB1:rs9277542) were identified to be associated with HBV clearance (Table 2). We did not observe any SNPs achieve genome-wide significant association with development of CHB; Two additional loci (SNP: rs6942409, Gene: AC011288.2, P = 3.08 × 10− 6) and the HCC associated locus (SNP: rs2833856, Gene: EVA1C, P = 1.62 × 10− 5) were associated with increased risk of DC and HCC during the development of CHB (Table 5; Fig. 4B).
Table 4
The significance of progressive HBV infection study
SNP | Gene | P value | Resistant Allele | Resistant Allele Frequency | | Related Risk |
CHB | ASPI | Clearance | | CHB (vs ASPI) | Clearance (vs ASPI) |
rs615672 | HLA-DRB1 | 1.39 × 10− 6 | G | 0.3297 | 0.4185 | 0.5276 | | 0.82 | 1.12 |
rs9277542 | HLA-DPB1 | 1.50 × 10− 9 | T | 0.3011 | 0.375 | 0.5347 | | 0.84 | 1.17 |
rs3128923 | HLA-DPA2 | 2.06 × 10− 6 | G | 0.3804 | 0.4348 | 0.5636 | | 0.89 | 1.14 |
rs1537862 | LACE1 | 1.85 × 10− 6 | C | 0.6 | 0.6374 | 0.7647 | | 0.92 | 1.19 |
Abbreviations: ASPI, asymptomatic persistence infection; CHB, chronic hepatitis B. RR was calculated with the comparison of CHB and ASPI, Clearance and ASPI respectively. |
Table 5
The suggestive significance of progressive CHB study
SNP | Gene | P value | Risk Allele | Risk Allele Frequency | | Related Risk |
CHB | DC | HCC | | DC (vs CHB) | HCC (vs CHB) |
rs6942409 | AC011288.2 | 3.08 × 10− 6 | G | 0.6129 | 0.7287 | 0.7958 | | 1.20 | 1.37 |
rs2833856 | EVA1C | 1.62 × 10− 5 | T | 0.5914 | 0.75 | 0.7757 | | 1.30 | 1.35 |
Abbreviations: CHB, chronic hepatitis B; DC, decompensated cirrhosis; HCC, hepatocellular carcinoma. RR was calculated with the comparison of HCC and CHB, DC and CHB respectively. |
Host genetic factors were demonstrated to influence concentrations of liver enzymes in plasma, which are widely used to indicate liver disease [37, 38] Here, to investigate the functional change in liver influenced by the HBV related loci described above, we performed a variance analysis in 10 clinical parameters of serum liver enzymes (ALT, AST, TBIL, DBIL, ALP, GGT, ALB, AFP, PTA, and PLT) between different genotypes in healthy controls (Supplementary Figure S4-9). Six loci (rs1537862, rs3128923, rs9277542, rs9277341, rs9277378, rs4661093) showed modest associations with concentrations of liver enzymes, including ALB, ALP, AFP, and PTA (Fig. 5). These associations suggest pathways linking the host genetic factors, metabolism, and liver function for understanding the mechanisms of infection and disease progression.
In sum, our study identified susceptibility SNPs associated with HBV related outcomes and SNPs increased the risk of progressive outcomes from HBV clearance to HBV chronic infection, DC, and HCC in a Chinese population (Supplementary Figure S10).