DOI: https://doi.org/10.21203/rs.2.19325/v3
Background: Hearing loss is the most common sensory defect, and it affects over 6% of the population worldwide. Approximately 50%-60% of hearing loss patients are attributed to genetic causes. Currently, more than 100 genes have been reported to cause non-syndromic hearing loss. It is possible and efficient to screen all potential disease-causing genes for hereditary hearing loss by whole exome sequencing (WES).
Methods: We collected 5 consanguineous pedigrees from Pakistan with hearing loss and applied WES in selected patients for each pedigree, followed by bioinformatics analysis and Sanger validation to identify the causal genes.
Results: Variants in 7 genes were identified and validated in these pedigrees. We identified single candidate variant for 3 pedigrees: GIPC3 (c.937T>C), LOXHD1 (c.6136G>A) and TMPRSS3 (c.941T>C). The remaining 2 pedigrees each contained two candidate variants: TECTA (c.4045G>A) and MYO15A (c.3310G>T and c.9913G>C) for one pedigree and DFNB59 (c.494G>A) and TRIOBP (c.1952C>T) for the other pedigree. The candidate variants were validated in all available samples by Sanger sequencing.
Conclusion: The candidate variants in hearing-loss genes were validated to be co-segregated in the pedigrees, and they may indicate the aetiologies of hearing loss in such patients. We also suggest that WES may be a suitable strategy for hearing-loss gene screening in clinical detection.
Hearing loss is the most common sensory defect, and it affects ~1/500 newborns [1] and 466 million people worldwide (https://www.who.int/pbd/deafness/estimates/en/). Approximately 50%~60% of hearing loss patients are attributed to genetic causes [1, 2]. Hereditary hearing loss is a genetically heterogeneous disorder [3] that can be divided into syndromic hearing loss and non-syndromic hearing loss, among which non-syndromic hearing loss is the predominant type, with a proportion of ~80% [4]. Currently, more than 100 genes have been reported to cause non-syndromic hearing loss (https://hereditaryhearingloss.org/), and the total number of genes related to hearing loss is expected to be several hundred.
There are mature gene panels for hearing-loss detection, and the genes involved range from 4 to more than 100. However, except for several genes, such as GJB2 [5-7] or SLC26A4 [8-10], most causal genes contribute a small fraction to the disorder. Therefore, in clinical detection, we may not obtain a satisfactory result by gene panel screening for many cases. As whole exome sequencing (WES) technology has rapidly developed and its cost has become less expensive, it is possible and efficient to screen all potential disease-causing genes for hereditary hearing loss by WES [11, 12].
Recessive inheritance hearing loss is worth studying because such patients usually have unaffected parents, which makes the disorder seem to be “sudden onset”, and this situation is more difficult to prevent. Consanguineous pedigrees represent a suitable natural model to study recessive disorders [13]. In Pakistan, there are numerous consanguineous pedigrees because of their customs, and these pedigrees may provide more opportunities to study and recognize such disorders [14, 15].
In this study, we collected 5 consanguineous pedigrees with hearing loss from Pakistan and applied WES to identify the causal genes. We identified several variants in hearing-loss genes that co-segregated in the pedigrees, and they may indicate the aetiologies of hearing loss in such patients.
Participants and clinical diagnosis
In the present study, we collected 5 consanguineous pedigrees containing 22 patients with hearing loss from rural areas in Pakistan. All the patients showed different degrees of hearing loss. The most likely inheritance mode for these pedigrees was autosomal recessive (Fig 1). The study was approved by the ethical committee of the National Institute for Biotechnology and Genetic Engineering (NIBGE), Faisalabad, Pakistan, and all participants provided written informed consent.
DNA extraction and whole exome sequencing
According to the manufacturer's instructions, genomic DNA was isolated from the peripheral blood leukocytes of all participants using a DNA QIAamp mini kit (Qiagen, Hilden, Germany). One patient from each pedigree was selected, and WES was performed. Exons were captured using the BGI-Exome kit V4 and sequenced by BGI-seq 500 with 100 bp paired-end sequencing.
Bioinformatics analysis
Low-quality reads were removed by SOAPnuke [16], and then the reads were mapped to the human genome reference (UCSCGRCh37/hg19) by the Burrows–Wheeler Aligner (BWA-MEM, version 0.7.10) [17]. Variants were called using the Genome Analysis Tool Kit (GATK, version 3.3) [18]. Variant Effect Predictor (VEP) [19] was used to annotate and classify all the variants. After that, all the variants were filtered based on their frequency in public databases (e.g., 1000 Genomes Project, Exome Sequencing Project and ExAC) and our in-house databases, and the variants with MAF<0.005 were retained. Then, homozygous variants and compound heterozygous variants were selected because the most likely inheritance mode for these pedigrees was autosomal recessive. Finally, we applied several variant prediction tools including SIFT [20] (http://provean.jcvi.org/), PolyPhen2 [21] (http://genetics.bwh.harvard.edu/pph2/), MutationTaster [22] (http://www.mutationtaster.org/) and CADD [23], to predict the functional impact of candidate variants [24].
Sanger validation
DNA from all available samples in the five pedigrees was Sanger sequenced to validate the variants and confirm their co-segregation in the pedigree. Forward and reverse primers were designed by Primer3. After PCR amplification, the purified product was sequenced on ABI 3730XL DNA Analyzer.
Clinical features
All the patients showed different degrees of hearing loss. In the first pedigree (HL1), all patients showed severe deafness, and one patient (III1) was selected for WES. In the second pedigree (HL2), all patients showed congenital profound deafness and muteness, and one patient (IV1) was selected for WES. In the third pedigree (HL3), all patients showed moderate deafness (their hearing loss started after seizures), and one patient (V1) was selected for WES. In the fourth pedigree (HL4), all patients showed congenital profound deafness, and one patient (IV1) was selected for WES. In the fifth pedigree (HL5), all patients showed moderate deafness, and one patient (III5) was selected for WES.
Genetic analysis
WES was applied in the selected patients. The average depth of the target region was 146X with a coverage of 99.85%, and the coverage of the target region that was sequenced at least 10 times (depth >= 10 X) was 98.20% (Table 1). For each individual, more than ten thousand variants that may influence protein were identified. After frequency filtration (MAF<0.005), approximately 15~32 exon variants were retained. Further inheritance model filtration retained 1~6 candidate variants (Table 1). All the rare variants detected in the exon region for the pedigrees are listed in the supplementary table.
We identified a stop codon lost homozygous variant, GIPC3: c.937T>C, from the patient in the HL1 pedigree, and the variant prediction tools provided a benign prediction. For the patient from the HL2 pedigree, we identified 4 variants in 2 genes. However, one gene was reported to cause autosomal dominant hearing loss, therefore, we first analyzed the other gene. Then, a homozygous variant, LOXHD1: c.6136G>A, was regarded as a candidate variant for this pedigree. The variant prediction tools indicated a damaging prediction. For the patient from the HL3 pedigree, 6 variants in 5 genes were identified at first, and further analysis indicated that only 2 genes may cause autosomal recessive deafness. Therefore, the homozygous variant, TECTA: c.4045G>A, and two compound heterozygous variants, c.3310G>T and c.9913G>C in MYO15A, were regarded as candidate variants. The variant prediction tools indicated benign prediction for both MYO15A variants and damaging prediction for the TECTA variant. For the patient from the HL4 pedigree, 3 variants in 3 genes were identified, and two of them may cause autosomal recessive deafness. The two homozygous variants were DFNB59: c.494G>A and TRIOBP: c.1952C>T. The variant prediction tools indicated damaging prediction for the first variant and benign prediction for the last variant. For the patient from the HL5 pedigree, only one homozygous candidate variant was identified, TMPRSS3: c.941T>C. The variant prediction tools indicated a damaging prediction. None of these variants were previously reported to cause hearing loss. The detailed information is listed in Table 2.
In summary, we identified one most likely causing variant for the HL1, HL2 and HL5 pedigrees and two most likely causing candidate variants for the HL3 and HL4 pedigrees.
Sanger validation
To validate co-segregation in the pedigree, we applied Sanger sequencing to all available samples. In total, 6 samples (II1~2 and III1~4) were sequenced for the HL1 pedigree, 6 samples (III1~2 and IV1~4) were sequenced for the HL2 pedigree, 4 samples (IV1 and V1~3) were sequenced for the HL3 pedigree, and 21 samples (I1, II1~11 and III1~9) were sequenced for the HL5 pedigree. For the HL4 pedigree, the initial samples collected were degraded, and we failed to collect additional samples. All the variants selected for Sanger sequencing were co-segregated in the pedigrees except for the HL4 pedigree (Fig 2).
In this study, we identified several variants in genes reported to cause hearing loss that co-segregated in the pedigrees. For the HL1 pedigree, the variant in the GIPC3 gene may cause the disorder. GIPC3 encodes a 312-residue protein that contains three predicted low-complexity regions and a central conserved PDZ domain [25]. The PDZ domain of GIPC3 is required for the survival of spiral ganglion and hair cells in the mouse ears. This gene was reported to cause autosomal recessive deafness 15, non-syndromic genetic deafness and audiogenic seizures. Currently, 11 pathogenic variants have been reported in this gene in ClinVar.
For the HL2 pedigree, the variant in the LOXHD1 gene was likely the causative variant. LOXHD1 encodes a highly conserved stereociliary protein consisting of 15 polycystin-1/lipoxygenase/alpha-toxin (PLAT) domains that facilitate protein interactions with the plasma membrane [26]. Loxhd1 in mice plays a crucial role in maintaining the normal function of cochlear hair cells [27]. It was reported to cause disorders including autosomal recessive deafness 77. Currently, 28 pathogenic variants have been reported in this gene in ClinVar.
For the HL3 pedigree, a homozygous variant in TECTA and two compound heterozygous variants in MYO15A were the likely candidate variants. TECTA encodes a protein that contains 2,155 amino acids and is one of the major non-collagenous glycoproteins of the tectorial membrane, a non-cellular matrix overlying the cochlear neuroepithelium that lies over stereocilia of the hair cells and is critical for the mechanical amplification and transmission of sound [28, 29]. This gene was reported to cause autosomal recessive deafness 21, and 40 pathogenic variants were reported in this gene in ClinVar. The protein encoded by MYO15A is a member of the unconventional myosin superfamily and plays an indispensable role in the graded elongation of stereocilia and actin organization in hair cells of the inner ear, which are essential for normal hearing function [30]. MYO15A was reported to cause autosomal recessive deafness 3, and 112 pathogenic variants were reported in this gene in ClinVar. Considering the transmission of variants in the pedigree (this was a consanguineous pedigree), we thought that the homozygous variant in TECTA was more likely to cause the disorder in this pedigree than the compound heterozygous variants in MYO15A.
For the HL4 pedigree, homozygous variants were detected in both DFNB59 and TRIOBP. DFNB59 encodes a protein that contains 352 amino acids and plays a crucial role in auditory nerve signaling transmission [31]. It was reported to cause autosomal recessive deafness 59, and 9 pathogenic variants were reported in this gene in ClinVar. TRIOBP encodes a protein containing 652 amino acids that plays a role in the regulation of adherens junctions as well as the reorganization of the actin cytoskeleton [32]. Actually, little is known about the exact function of TRIOBP, and the multiple roles of this gene raised the issue of why pathogenic variants in this gene do not lead to pathologies other than isolated hearing loss. This gene was reported to cause autosomal recessive deafness 28, and 26 pathogenic variants were reported in this gene in ClinVar. The variant in TRIOBP was annotated as likely benign in the deafness variation database. Therefore, we thought the variant in DFNB59 was more likely to be responsible for the disorder in this pedigree than the TRIOBP variant.
For the HL5 pedigree, the variant in TMPRSS3 may cause the disorder. The protein encoded by this gene contains a serine protease domain, a transmembrane domain, an LDL receptor-like domain, and a scavenger receptor cysteine-rich domain. It plays an important role in activating the ENaC sodium channel, which is regulated by serine protease activity [33], and it maintains a low Na+ concentration in the endolymph of the inner ear [34]. TMPRSS3 was reported to cause autosomal recessive deafness 8, and 23 pathogenic variants were reported in this gene in ClinVar.
We calculated the density of reported pathogenic variants in these genes, which were 11.7/kb, 4.2/kb, 6.2/kb, 10.6/kb, 8.5/kb, 3.7/kb and 16.8/kb for GIPC3, LOXHD1, TECTA, MYO15A, DFNB59, TRIOBP and TMPRSS3, respectively. The density may indicate the degree of understanding or focus for different genes. Genes with low density, such as LOXHD1 and TRIOBP, may have potential research value.
The majority of causal genes we identified for these pedigrees were not common hearing-loss genes. If we applied a common hearing-loss gene panel to screen these patients, we would obtain negative results, and the causal gene/variant for the patients would be missed. Therefore, WES may be a better strategy than panel sequencing for hearing-loss screening even in clinical detection.
In conclusion, we applied WES in five consanguineous pedigrees (one patient per pedigree) from Pakistan with hearing loss, followed by Sanger sequencing for all available samples among the pedigrees to identify the causal genes for them. Several variants in hearing-loss genes were validated to be co-segregated in the pedigrees, and they may indicate the aetiologies of hearing loss in such patients. Moreover, we suggest that WES may be a suitable strategy for hearing-loss screening in clinical detection.
Ethics approval and consent to participate
This study was approved by the ethical committee of National institute for Biotechnology and Genetic Engineering (NIBGE), Faisalabad, Pakistan and all participants provided written informed consent.
Consent for publication
Not applicable.
Availability of data and materials
The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request. The data that support the findings of this study have been deposited in the CNSA (https://db.cngb.org/cnsa/) of CNGBdb with accession code CNP0000508-sub010963.
Competing interests
The authors declare that they have no competing interests.
Funding
No funding was obtained for this study.
Authors' contributions
YZ analyzed and interpreted the WES data and write the manuscript. MT analyzed and interpreted the patient data and modified the manuscript. SH analyzed and interpreted the WES data. UA performed Sanger sequencing. JZ designed the project and guided the analysis of WES data. SB designed the project, guided the analysis of patient data and helped the manuscript. All authors read and approved the final manuscript.
Acknowledgments
The authors thanked all the participating patients and the families in this study for their cooperation.
Table 1. Sequencing and variants data
Pedigrees |
HL1 |
HL2 |
HL3 |
HL4 |
HL5 |
Samples applied WES |
III1 |
IV1 |
V1 |
IV1 |
III5 |
Sequencing depth (X) |
136.98 |
136.47 |
142.36 |
143.74 |
170.79 |
Coverage (%) |
99.75 |
99.88 |
99.9 |
99.83 |
99.9 |
10X coverage (%) |
98.11 |
98.21 |
98.21 |
97.84 |
98.65 |
Exon variants with MAF<0.005 |
23 |
22 |
32 |
15 |
19 |
Variants followed recessive model |
1 |
4 |
6 |
3 |
1 |
Variants applied Sanger validation |
1 |
1 |
3 |
0 |
1 |
Table 2. Detailed information for candidate variants
Pedigree |
Variant |
RS_number |
Gene |
Strand |
DNA change |
AA change |
Type |
SIFT (score) |
PolyPhen2 (score) |
MutationTaster (score) |
CADD (score) |
ACMG classify |
HL1 |
19-3590186-T-C |
rs1466835034 |
GIPC3 |
+ |
c.937T>C |
p.*313Gluext*98 |
homozygous |
NA (NA) |
NA (NA) |
Polymorphism (not provided) |
Benign (11.43) |
Likely pathogenic (PM2+PM4+PP1+PP4) |
HL2 |
18-44063569-C-T |
rs774836161 |
LOXHD1 |
- |
c.6136G>A |
p.Glu2046Lys |
homozygous |
Damaging (0.016) |
Damaging (0.995) |
disease_causing (not provided) |
Damaging (29.2) |
Uncertain significance (PM2+PP1+PP3+PP4) |
HL3 |
11-121016765-G-A |
rs141024429 |
TECTA |
+ |
c.4045G>A |
p.Ala1349Thr |
homozygous |
Tolerated (0.058) |
Benign (0.061) |
disease_causing (not provided) |
Benign (23.2) |
Uncertain significance (PM2+PP1+PP3+PP4) |
HL3 |
17-18025424-G-T |
rs919809633 |
MYO15A |
- |
c.3310G>T |
p.Gly1104Cys |
compound heterozygous |
Damaging (0) |
Benign (0.011) |
Polymorphism (not provided) |
Benign (15.33) |
Uncertain significance (PM2+PP1+PP3+PP4) |
HL3 |
17-18069800-G-C |
rs535441567 |
MYO15A |
- |
c.9913G>C |
p.Glu3305Gln |
compound heterozygous |
Damaging (0.007) |
Damaging (0.83) |
disease_causing (not provided) |
Damaging (27.9) |
Uncertain significance (PM2+PP1+PP3+PP4) |
HL4 |
2-179320814-G-A |
NA |
DFNB59 |
- |
c.494G>A |
p.Ser165Asn |
homozygous |
Damaging (0.02) |
Damaging (0.931) |
disease_causing (not provided) |
Damaging (25.2) |
Uncertain significance (PM2+PP3+PP4) |
HL4 |
22-38120515-C-T |
rs760246167 |
TRIOBP |
+ |
c.1952C>T |
p.Ser651Phe |
homozygous |
Damaging (0.003) |
Benign (0.036) |
Polymorphism (not provided) |
Benign (15.1) |
Uncertain significance (PM2+PP3+PP4) |
HL5 |
21-43795850-A-G |
NA |
TMPRSS3 |
- |
c.941T>C |
p.Leu314Pro |
homozygous |
Damaging (0.001) |
Damaging (1) |
disease_causing (not provided) |
Damaging (28.9) |
Uncertain significance (PM2+PP1+PP3+PP4) |