In the present study, we explored the mutations in the ATP7B gene in 101 WD probands from southern China. Forty-eight mutations were found, including 7 novel variants. These novel variants were not found in the control chromosomes. Substituted amino acids with a PolyPhen-2 score close to 1.000 could be predicted to be potentially damaging. However, insertion, deletion and premature stop mutations failed to yield acceptable results from PolyPhen-2 analysis. We regarded the insertion mutation (1510–1511 ins A) as a clearly pathogenic mutation since it caused a frameshift leading to a premature stop codon. According to the PolyPhen-2 score, the other six variants (2075 T > C (Leu692Pro), 2233 C > A (Leu745Met), 3209C > G (Pro1070Arg), 3677C > T (Thr1226Ile), 3793 G > T (Val1265Leu), 3824 T > C (Leu1275Ser)) were all predicted to affect protein function. A mutation detection rate of 80.7% was achieved in the southern cohort, but 19.3% of alleles remained unidentified. Failure to detect the remaining mutations may be explained by some objective factors, such as the primers, the PCR procedure and/or the sequence alignment. In our recent studies, the detection rate of direct sequencing could reach as high as 95% with the newly designed PCR primers and the improved amplification requirements. Another reason may be due to the presence of mutations outside the open reading frame of the gene, i.e., in the promoter, introns, the presence of gene rearrangements or possible mutations in other copper-transport chaperone gene, such as ATOX1 and COMMD1[28].
2333G > T (Arg778Leu) was the most frequent mutation in our study and was also described as the most common mutation in China [16–21, 23, 24], accounting for 18.8% of alleles studied here. The second most common mutation was 2975 C > T (Pro992Leu) among the WD patients, with an allelic frequency of 13.4%, consistent with the frequency previously reported in China [17, 19, 20, 23]. However, Hong et al [24] suggested that 3443 T > C (Ile1148Thr) was the second most common mutation instead of 2975 C > T in their cohort study of 103 Chinese WD patients. An earlier study of 114 WD patients from northern China demonstrated that 2621 C > T (Ala874Val) was the second hot-spot mutation, followed by 2975 C > T, at an allelic frequency of 6.1% [29]. We speculate that different gene-level tests or a limited number of patients is largely responsible for the differential conclusions. A large-scale or prospective study, based on the same detection standard, is imperative.
In our current study, exon 8 was the most frequent mutational site, found in 28.8% of mutant alleles, followed by exon 13 in 19.6% and exon 16 in 14.1%, indicating that these three exons could be important regions for detecting mutations.
We conducted a comprehensive analysis of the spectrum and frequency of ATP7B mutations in a large-scale sample of Chinese WD patients from more than 30 provinces, autonomous regions and municipalities of China. A total of 233 distinct mutations were detected, of which 85 were novel. The computational predictive analysis software PolyPhen-2 interpreted most of the novel missense variants as disease-causing mutations, with the exception of several benign variants (121A > G (Asn41Asp), 748G > A (Gly250Arg), 1168A > G (Ile390Val), 1426G > A (Ala476Thr), 1760C > T (Thr587Met), 3368C > T (Pro1123Leu)). It cannot be ruled out that the silent mutations interpreted as benign could affect protein function.
The most prevalent mutation in the 1302 WD patients pool was 2333G > T (Arg778Leu), in exon 8, with an allelic frequency of 28.6%. The 2333G > T (Arg778Leu) mutation is frequently found in reports of Asian patients, with an allele frequency of 12 to 50% [30], [14, 16–20]. In contrast, the 3207 C > A (His1069Gln) mutation, the most common mutation in European and North American populations, accounting for 30 to 70% of the alleles studied [30], was not detected in any Chinese patients. The next most frequent mutation in this large cohort of Chinese patients was 2975 C > T (Pro992Leu), with an allelic frequency of 13.0%. To our knowledge, the highest frequency of 2975 C > T described so far was 27% [14].
All exons except exon 1 were affected. Notably, exons 8, 13, 12 and 16 were the hot-spot exons identified in the large WD population, accounting for 73.6% of mutant alleles, consistent with previous results that 60.5–74% of mutations were located on the above hot-spot exons [31].
The spectrum of WD mutations in the large cohort of Chinese patients consisted of a small number of relatively frequent mutations and a greater number of rare mutations. This further indicated a high degree of mutational heterogeneity, in agreement with previously published findings [32], [33]. Moreover, we found that many mutations were located a short distance away, in line with the preliminary results [34]. Additionally, 64.3% patients were found to stay in a compound heterozygotic state, compared with 13.8% patients in a homozygotic state and 17.4% patients in a single heterozygotic state, which can be explained by the low percentage of consanguinity in our investigated population. No significant difference in phenotypic profiles were found when comparing homozygous or combined heterozygous patients with the patients who had only one mutation at two alleles. We suspected that, to the patients with a single mutation, the remaining unidentified mutations would probably be located in non-coding regions of the ATP7B gene. Other mutational mechanisms should also be taken into consideration.
One polymorphism with substitution of leucine with leucine at codon 770 in the transmembrane region of ATP7B has been found to be linked with the 2333 G > T (Arg778Leu) mutation. Perhaps, the coexistence of the 2333G > T mutation and the 2310C > G polymorphism would have a special effect on the ATP7B protein. Further investigation of the functional implications of both is needed.
A well-defined landscape of the genotype-phenotype correlation will promote the development of clinical studies. However, most of previous studies devoted to genotype-phenotype association have addressed rare or conflicting conclusions. Verification of this requires a cohort study. Our study significantly described a systemic and quantitative analysis of the genotype-phenotype correlation in a large cohort of Chinese patients with WD.
In the demonstration of hot-spot exons in different types of clinical presentations, we identified that exon 11 was ranked as the second most mutational exon in the mixed presentation. The difference in the proportion of patients with mutations in exon 11 between the hepatic and the mixed group was significant (P = 0.046), while the difference between the neurological and the mixed group was not significant. We could not reach the correlation between the mutations in exon 11 and the mixed manifestation. 2621 C > T (Ala874Val), the predominant mutation in exon 11, frequently presented with 2333 G > T (Arg778Leu) substitution. Krishna et al considered that the hydrophobicity and conformational stability of the hydrophobic domains, such as transmembrane domains, may be altered due to the valine amino acid [35]. We speculated that the transmembrane domain region of ATP7B with valine at the 874 domain region and with leucine at 778 could probably destabilise the formation or influence the expression of protein. Functional studies of mutations are required for the validation of our speculation. Another finding in the analysis of exon hot spots in different clinical presentations was that exon 18 ranked as the third exon with the most mutations in the hepatic presentation group, with a higher mutation frequency (10.38%) than that in the other two groups (6.25% and 3.85% in the neurological and mixed presentation, respectively). This is probably attributable to the potential association between the mutations in exon 18 and the hepatic involvement. Fortunately, we identified that 3884 C > T (Ala1295Val), one kind of the mutations in exon 18, only mutated in the patients with hepatic manifestation. Statistical analysis revealed that there was a significant association between the 3884 C > T (Ala1295Val) mutation and the hepatic phenotype, which was consistent with previous observations that mutations in the conserved ATP hinge region were associated with liver disease without neurological presentation [36], and when the mutation affected the ATP hinge, it resulted in hepatic failure [37].
Furthermore, we found a statistically significant correlation between the 2333 G > T (Arg778Leu) mutation and lower serum ceruloplasmin levels. The difference in the serum ceruloplasmin level between 2333 G > T homozygous or heterozygous patients and non-2333 G > T patients was significant. A recent study in a large cohort of Chinese WD patients [1] showed that 2333 G > T was related to lower levels of ceruloplasmin as well. That study also suggested that 2333 G > T was related to younger onset age. However, in our study, we did not find the significant difference between 2333 G > T and the onset age, and we did not find a considerable difference between 2333 G > T and the hepatic manifestation either, as previously reported by Liu et al [38]. Significant difference in the age of onset was observed between 2975 C > T homozygous or combined heterozygous patients and non-2975 C > T patients. Collectively, our finding revealed that the patients with 2975 C > T mutation often presented with WD profiles at an earlier age, usually before 13.7 years old, than the patients with other mutations, while Hua et al [19] described that the patients with 2975C > T often presented WD features before 12 years old. We also found a remarkable association between 3809 A > G (Asn1270Ser) and the disease onset age. Statistical findings showed that the patients with 3809 A > G mutation usually manifested the WD features before 10.8 years old, much earlier than the patients with other mutations at two chromosomes.
We also observed a visible correlation between the onset age and the characteristic clinical manifestations. Our findings showed that the patients with mixed manifestation had a later age of onset than the groups with either liver disease or neurological phenotype. However, the two latter onset ages were not significantly different from one another, inconsistent with the understanding that patients having predominantly neuropsychiatric symptoms usually manifest symptoms later than patients with hepatic presentation [39][40][41]. Our results showed that the liver and brain could be affected by WD simultaneously.
In addition, we discovered interesting clinical differences between the symptomatic group and the asymptomatic group in terms of the presence of KF rings. KF rings of symptomatic patients were found to be significantly higher than KF rings of asymptomatic cases, consistent with the earlier findings [42]. We also found that the patients who had KF rings were significantly older at symptom onset than the cohort without KF rings, which is in line with the finding identified in our study that the patients with typical clinical manifestations significantly displayed later age of onset than the patients who were clinically asymptomatic. In summary, the cohort with clinical symptoms presented with a later age of onset and higher prevalence of KF rings than the asymptomatic cohort. One report claimed that patients with purely neurological symptoms were susceptible to KF rings [43]. Our current study did not show any clear difference in the presence of KF rings among the three groups with different clinical manifestations.
In conclusion, we characterised a complete genotypic and phenotypic profile of Chinese patients with WD. The seven novel mutations identified in the southern Chinese WD patients could considerably extend the previously established spectrum of the ATP7B mutations. Comprehensive mutation analysis will enhance the current knowledge of WD genetics in China. The findings of correlation between specific mutations and clinical features, as well as the age of onset and several clinical profiles provides new insights into the relationships between genotype and phenotype. Additional large studies are required for validation of our conclusions.