Whole exome sequencing of adult Indians with apparently acquired Aplastic Anemia: initial experience at tertiary care hospital

doi:10.21203/rs.3.rs-2836149/v1

Download PDF

Article

Whole exome sequencing of adult Indians with apparently acquired Aplastic Anemia: initial experience at tertiary care hospital

https://doi.org/10.21203/rs.3.rs-2836149/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Aplastic anaemia (AA) is a rare hypocellular bone marrow disease which can be acquired or constitutional. Nearly 10-30% patients with apparently acquired AA have mutations in telomerase reverse transcriptase gene (TERT) leading to bone marrow failure. The TERT plays a crucial role in regulating the telomerase ribonucleoprotein complex which otherwise causes short telomeres leading to AA. We used our benchmarked whole exome sequencing (WES) pipeline and systems bioinformatics approaches to identify sequence variants underlying AA in adult Indian subjects with apparently acquired AA. For 36 affected individuals, we sequenced coding regions to a mean coverage of 100× and a sufficient depth was achieved. The downstream validation and filtering was done to call the variants wherein we identified a host of candidate genes associated with AA who were treated with Cyclosporine A (CsA). Across all samples, six genes were shown to be associated with the AA phenotype with one non-coding SNP underlying intronic region as an exceptional case from interferon gamma (IFNG). While these variants (across the genes, viz. TERT (G/X), IFNG ( T/C), PIGA (T/X) or (T/A), NBS1/NBN(T/X), MPL (G/C) and CYP3A5) spanned across the subjects, a majority of control samples do not have these variants. We demonstrate the application of WES to discover the variants associated with CsA responders and non-responders in the Indian cohort.

Biological sciences/Computational biology and bioinformatics

Biological sciences/Genetics

Aplastic Anemia

Next Generation Sequencing

Systems Genomics

Exome sequencing

Aplastic anaemia (AA) is a rare hypocellular bone marrow disease with bone marrow containing very few hematopoietic cells ^1,. Nearly 10–30% patients with apparently acquired AA have mutations in the telomerase reverse transcriptase (TERT) leading to bone marrow failure². The TERT gene is known for maintaining the telomerase ribonucleoprotein complex and plays a crucial role in its regulation which otherwise causes short telomeres leading to AA. Currently the treatment options for AA include Hematopoietic stem cell transplantation (HSCT), Anti-thymocyte globulin (ATG) and Cyclosporine A (CsA) rendered as a standard mode of efficacy. The HSCT is the treatment of choice for AA patients with TERT mutations. However, many of these respond to Androgens. In view of costs associated with HSCT and ATG therapies, many patients are prescribed CsA-Androgen therapy combination. CsA along with Danazol, an anabolic steroid is administered for treating AA and they were assessed in various populations^31–33. While these patients do not respond to immunosuppressive therapy (standard of care), bone marrow transplantation (BMT) is the treatment of choice in patients of AA who have documented telomerase mutations^3,4,5,,29.

Over the years, genetic characterization associated with AA has steadfastly progressed¹⁷ with studies using whole exome sequencing (WES) and mutational screening assays carried out.^18R Recently, Zhang et al. have identified potential pathogenic genes for severe AA (SAA) and explored the possible genetic variants in CD8 + T cells.¹⁹ In addition, unlike WES, the evaluation frequency of targeted next generation sequencing (NGS) in capturing variations with lower allelic burden could be detected with higher rate even as the somatic mutation frequency ranges between 5–70% across the NGS studies^6,22,28. However, the occurrence of such mutations corresponds with the duration of disease, suggesting selective pressure favouring cell survival^16,23,28. Recent efforts have paved the way for identification of AA variants and the likelihood of effects they show, for example homozygous MPL mutations and familial AA association are well regarded^7,8. Although studies on the magnitude of the problem, morphological changes associated with AA determinants are known, there is no published data on TERT mutations in Indian patients with acquired AA⁹. Our earlier pilot study was an attempt to identify TERT mutations with apparently acquired AA wherein the knowledge using the NGS approaches has not been translated from Telomere dysfunction in the Indian cohort.¹⁰. This could be because of the divergent choice of treatment rendered in those individuals. Therefore measurement of telomere length and identifying inherent candidate variants would be interesting to understand the disease condition. In this study, we sought to underpin the candidate genetic variants associated with AA from the Indian population using the WES approach. All 36 patients in our cohort were treated with CsA and Danazol. We enrolled CsA responders as those became blood transfusion-independent or those who needed blood transfusion very sparingly. The samples were analyzed for variants predicted to be associated with/causal and further downstream annotation yielded bona fide variants with a marked impact on the risk of AA.

Patients and samples

The AA subjects were recruited from the department of general medicine, SMS Medical College and Hospital, Jaipur during 2016–’20 with ethics approval from the institutional ethics committee and Indian Council of Medical Research (ICMR)/Department of Health Research (DHR), Government of India, New Delhi. All methods were carried out in accordance with relevant guidelines and regulations. An informed consent was obtained from all the patients after fully explaining them about the study and process. The patients’ 2 ml blood sample was drawn through the peripheral vein for WES. The standard management procedures for collecting AA samples were judiciously followed with ethics clearance wherever stated. A total of 36 AA subject samples(21 male/ 15 female) and 41 healthy controls (25 male/16 female) with a mean age of 32 across two cohorts have been sequenced. Those subjects who responded to CsA and Danazol treatment were considered as CsA responders while those who underwent blood transfusion were non-responders ( Supplementary Table 1). The treatment regimen ranged from 6 months to 1 year on follow-up cases. All raw data is analysed in-house using our local 1 TB RAM/ 64 processors server.

Exome capture and sequencing

The WES was performed on 36 subjects using QIAamp® DNA Blood Mini Kit (Cat No :51104). The quality check (QC) of the genomic DNA (gDNA) was done using Qubit® 2.0 Fluorometer followed by agarose gel electrophoresis respectively. Briefly, we used 200 ng of gDNA to generate 300 bp to 350 bp fragments and performed end repair, adapter ligation and amplification, and adapter ligated DNA was hybridized using Agilent V5 + UTRs chemistry (Human All Exon 75 Mb kit) with paired end reads and an approximate 110× depth of coverage was obtained resulting in 6–8 GB of data per sample.

Quality control (QC) and variant calling

We obtained significant reads for 33 samples, as the samples were checked for QC following which human genome reference (hg38) was used to align through bowtie2 incorporated in an in-house pipeline for downstream analyses¹¹. The VarScan prediction tool was employed to check variants filtered for false discovery rate (FDR). The quality assessment was done for all the samples using FastQC with raw reads checked for quality, GC bias and duplication levels. After bowtie2 mapping with hg38 reference, variant calling was performed using VarScan with mutations counted as heterozygous (“het”) through awk/bash scripts. The VarScan somatic command was used with the mpileup option meeting the minimum coverage of 10x to identify possible somatic variants. In this process, their corresponding genotypes for the samples were checked to infer the somatic status. Further predictions identifying “deleterious” mutations were screened for Sanger validation.

Telomere length analysis and Validation of SNPs

To check whether or not the AA subjects have shortened telomere length, we used Scicell’s Absolute Human Telomere Length Quantification qPCR Assay Kit (AHTLQ: Catalog #8918). A single copy reference (SCR) primer set was used to amplify a 100 bp-long region on human chromosome 17, and served as a reference for data normalization. The primer sets were validated by qPCR and the gel electrophoresis was run to check the amplification efficiency. All SNPs were mapped to the publicly available databases and the variant effect predictor (VEP) Ensembl suite was used to detect the RefSeq (rs) ids with for all the SNPs that have minor allele frequency (MAF) cutoff 0.05. Based on the WES data, the shortlisted variants were cross-checked using downstream validation databases, viz. Varsome, CADD, GERP scores and finally the common variants along with those select variants were visualized using Integrated Genome Viewer (IGV) browser and a set of SNPs that exhibited significant association with AA were validated using Sanger sequencing (Supplementary information).

Statistical analyses and population stratification

All statistical tests were done keeping the sample relationship checks. The bcf/vcf files with a MAF < = 0.05 and a minimum DP > = 5 were used for mapping the pathogenic variants. We had multiple instances of checking this wherein we first calculated the mean number of heterozygous rare (MAF < = 0.5%) variants observed in the entire datasets and then calculated the mean/variance of the means for these genes to tabulate.VerifyBamID was used to infer whether or not the reads are contaminated across the samples¹². For developing an interaction network, we used Cytoscape Cytohubba tool to infer the clustering coefficient.³⁰

Samples and characteristics of AA samples

We looked into heterozygous variant calls with the low-coverage SNPs/ indel sites and further explored the singleton mutations with the MAF cutoff 0.05. We observed the variants mapped across the genes having a major difference in functional alleles when compared to unaffected samples ( Fig. 1/Supplementary Table 2). We observed significant mutations including indels, missense, frameshift, nonsense with altered protein sequences associated with various gene ontology (GO) processes. Across all samples, six genes were shown to be associated with the AA phenotype with one non-coding SNP underlying intronic region as an exceptional case from IFNG. We discern that 3/5ths of these mutations spanning across chromosomes 12, X and 8 are likely to be associated with AA. While these variants spanned across the subjects, a majority of control/unaffected samples do not have these variants. This could be because the coverage in certain samples, viz. 1,11,5,6,20,19,16,15 and 28 were less and the resulting VCF generated had no more than 1 million SNPs. Among them, IFNG was not reported while PIGA and NBS mutations were found significantly across all subpopulations and the ones with different alleles were deemed as novel. As IFNG’s susceptibility is reported to clinical characteristics, it is also shown to have efficacy to immuno-suppressive therapy.^20,21

Effect of variants on CsA response

The overall variant detection rate was split between the six variants across the genes, viz. TERT (G/X), IFNG (T/C), PIGA (T/X) or (T/A), NBS1/NBN (T/X), MPL (G/C) and CYP3A5 which were reported in the samples. These were checked with statistical significance as discussed earlier.The latter CYP3A5 is a gene known for mode of action of immunosuppression for CsA response and the variant is also known to be reported in Clinvar (https://www.ncbi.nlm.nih.gov/clinvar/variation/226021/). While a large number of variants are TERT, whether or not the remission of the disease is associated with non-genetic TERT or genetic TERT could not be established as the treatment period largely varied between 2.5 months to four years in follow-up cases. We could not check germline mutations as well owing to lack of familial history/samples. A large number of patients, however, responded within 5 months of therapy with partial remission (free from blood transfusion dependence) maintained (see supplementary table 1). For example, samples 40 and 51 were found to have TERT mutations and response to such intensified immunosuppression has not been majorly reported, paving way into the need for genotyping assays and clinical testing.

We found low-confidence CYP3A5 (7:99672916;non-coding transcript) to be associated with splice acceptor variant in sample 44¹³. We argue that although these are less significant and benign, these observations deserve clinical attention as some of these variants are shown to be associated with CsA response on end stage renal failure, as described in literature^14,15. It remains not so clear why certain patients have poor response to CsA and so are prone to blood transfusion, such findings allow us to study in more detail, the pathophysiology and druggability. We reason this to poor prognosis in some patients, by and large as it may be associated with somatic variant burden. We believe that this is also in agreement with increased mutation burdens where such characteristic mutational signatures are found in these cell types.^26,27 Furthermore, our telomerase assay indicates that a substantial number of cases have telomere length when compared to the individual controls (see supplementary information). The GRCh38: Chr3:169764948–169764951 variants seen in TERT are largely associated with pulmonary fibrosis and/or bone marrow failure, telomere-related, cellular senescence, homologous recombination, Dyskeratosis congenita, autosomal dominant¹⁶. We find a large number of somatic variants enriched in these pathways but as our exome capture is not a long-term follow-up of patients, it could, therefore, be the reason why the variants are not significant in them. In addition, on the basis of sex, we asked whether there is a significant difference of outcomes in AA as we observe relatively equal prevalence and prognosis among men and women.

Using WES, we have screened mutations in the subjects with CsA responders and non-responders. Our study is reasonably the first report with the pathogenic variants associated with AA samples from India. We hope to discern and check the mutational burden in a larger cohort in the near future.

Authors’ Contributions: SM and KMM conceptualized the study. NS, SaG and SG collected the samples. RP and AKM prepared libraries and performed NGS. PS analyzed the NGS samples, developed the strategy, and wrote the first draft. PS, KMM and SM proofread the manuscript.

Funding: SM and KMM are grateful to the Indian Council of Medical Research- Directorate of Health Research (ICMR-DHR) with the grant provided to them (Grant#GIA/70/2014-DHR dated 15-10-2014, IRIS No. 2012-25230).

Ethics and Informed consent: The subjects were recruited from the SMS Medical College, Jaipur (code:107/MC/EC/2012), India in accordance with a protocol approved by the institutional ethics committee (IEC) of the hospital. A written informed consent was duly taken from them.

Data Availability Statement: The raw reads are made available through sequence read archive project id at PRJNA780657. All Supplementary files (Sanger validation results) are available at https://drive.google.com/file/d/1Wgm-hFBlrCsGqv61knbNvbfRo-FWRokT/view?usp=sharing.

Acknowledgments: We gratefully acknowledge Dr. Neal Young, Chief of the Hematology Branch of the National Heart, Lung and Blood Institute and the Director of the Trans-NIH Center for Human Immunology, Autoimmunity and Inflammation, USA for critically reviewing the manuscript and providing his valuable comments. His inputs have allowed us to bring inherent insights into the manuscript. We thank Narendra Meena for helping us with the collection of samples.

Conflicts of Interest: None.

Levi M, Toh CH, Thachil J et al. Guidelines for the diagnosis and management of Aplastic Anaemia, British Committee for Standards in Haematology. Br J Haematol. 2009;145:24–33.
Shallis RM, Ahmad R, Zeidan AM. Aplastic anemia: Etiology, molecular pathogenesis, and emerging concepts. Eur J Haematol. 2018 Dec;101(6):711-720. doi: 10.1111/ejh.13153. Epub 2018 Oct 10. PMID: 30055055.
Young N. Aplastic Anemia. N Engl J Med 2018; 379:1643-1656
Tischkowitz MD, Hodgson SV. Fanconi anemia. Journal of Medical Genetics 2003;40:1-10.
Auerbach AD. Fanconi anemia and its diagnosis. Mutat Res. 2009 Jul 31;668(1-2):4-10. doi: 10.1016/j.mrfmmm.2009.01.013. Epub 2009 Feb 28. PMID: 19622403; PMCID: PMC2742943.
Durrani J, Maciejewski JP. Idiopathic aplastic anemia vs hypocellular myelodysplastic syndrome. Hematology Am Soc Hematol Educ Program. 2019 Dec 6;2019(1):97-104. doi: 10.1182/hematology.2019000019. PMID: 31808900; PMCID: PMC6913491.
Savage SA, Viard M, O'hUigin C et al.Genome-wide Association Study Identifies HLA-DPB1 as a Significant Risk Factor for Severe Aplastic Anemia. Am J Hum Genet. 2020 Feb 6;106(2):264-271. doi: 10.1016/j.ajhg.2020.01.004. Epub 2020 Jan 30. PMID: 32004448; PMCID: PMC7010969.
Walne AJ, Dokal A, Plagnol V, Beswick R, Kirwan M, de la Fuente J, Vulliamy T, Dokal I. Exome sequencing identifies MPL as a causative gene in familial aplastic anemia. Haematologica. 2012; 97(4):524-8. doi: 10.3324/haematol.2011.052787.
Biswajit H, Pratim PP, Kumar ST et al.. Aplastic anemia: a common hematological abnormality among peripheral pancytopenia. N Am J Med Sci. 2012;4(9):384-388. doi:10.4103/1947-2714.100980.
Mehta S, Krishnamohan M, Gulati S, Sharma N, Vashishtha P, Singh I. Detection of Mutations in TERT, the Genes for Telomerase Reverse Transcriptase, in Indian Patients of Aplastic Anaemia: A Pilot Study. J. Assoc. Physicians India. 2014; 62:13-17.
Meena N, Mathur P, Medicherla KM et al. A Bioinformatics Pipeline for Whole Exome Sequencing: Overview of the Processing and Steps from Raw Data to Downstream Analysis. Bio-101 2018; e2805. DOI: 10.21769/BioProtoc.2805.
G. Jun, M. Flickinger, K. N. Hetrick et al. Detecting and Estimating Contamination of Human DNA Samples in Sequencing and Array-Based Genotype Data, American journal of human genetics doi:10.1016/j.ajhg.2012.09.004 (volume 91 issue 5 pp.839 - 848)
Flores-Pérez C, Castillejos-López MJ, Chávez-Pacheco JL et al. The rs776746 variant of CYP3A5 is associated with intravenous midazolam plasma levels and higher clearance in critically ill Mexican paediatric patients. J Clin Pharm Ther. 2021 Jun;46(3):633-639. doi: 10.1111/jcpt.13388. Epub 2021 Feb 26. PMID: 33638195.
Chu XM, Hao HP, Wang GJ, et al. Influence of CYP3A5 genetic polymorphism on cyclosporine A metabolism and elimination in Chinese renal transplant recipients. Acta Pharmacol Sin. 2006; 27: 1504–1508. https://doi.org/10.1111/j.1745-7254.2006.00428.x.
Büscher AK, Beck BB, Melk A et al. German Pediatric Nephrology Association (GPN). Rapid Response to Cyclosporin A and Favorable Renal Outcome in Nongenetic Versus Genetic Steroid-Resistant Nephrotic Syndrome. Clin J Am Soc Nephrol. 2016;11(2):245-53. doi: 10.2215/CJN.07370715.
Yoshizato T, Dumitriu B, Hosokawa K, et al. Somatic Mutations and Clonal Hematopoiesis in Aplastic Anemia. N Engl J Med. 2015;373(1):35-47. doi:10.1056/NEJMoa1414799.
Heuser M, Schlarmann C, Dobbernack V, Panagiota V, Wiehlmann L, Walter C, Beier F, Ziegler P, Yun H, Kade S, Kirchner A, Huang L, Koenecke C, Eder M, Brümmendorf TH, Dugas M, Ganser A, Thol F. Genetic characterization of acquired aplastic anemia by targeted sequencing. Haematologica. 2014 Sep;99(9):e165-7. doi: 10.3324/haematol.2013.101642. Epub 2014 Jun 6. PMID: 24907358; PMCID: PMC4562551.
Singh I, Nunia V, Sharma R, Barupal J, Govindaraj P, Jain R, Gupta GN, Goyal PK. Mutational analysis of telomere complex genes in Indian population with acquired aplastic anemia. Leuk Res. 2015 Sep 7:S0145-2126(15)30370-2. doi: 10.1016/j.leukres.2015.08.018. Epub ahead of print. PMID: 26360549.
Zhang Y, Zhang Y, Ge H, Li N, Liu C, Wang T, Fu R, Shao Z. Identification of potential pathogenic genes for severe aplastic anemia by whole-exome sequencing. J Clin Lab Anal. 2022 May;36(5):e24438. doi: 10.1002/jcla.24438. Epub 2022 Apr 18. PMID: 35435273; PMCID: PMC9102512.
Bestach Y, Sieza Y, Attie M, Riccheri C, Verri V, Bolesina M, Bengió R, Larripa I, Belli C. Polymorphisms in TNF and IFNG are associated with clinical characteristics of aplastic anemia in Argentinean population. Leuk Lymphoma. 2015 Jun;56(6):1793-8. doi: 10.3109/10428194.2014.966707. Epub 2015 Jan 21. PMID: 25248876.
Mortazavi Y, Merk B, McIntosh J, Marsh JC, Schrezenmeier H, Rutherford TR; BIOMED II Pathophysiology and Treatment of Aplastic Anaemia Study Group. The spectrum of PIG-A gene mutations in aplastic anemia/paroxysmal nocturnal hemoglobinuria (AA/PNH): a high incidence of multiple mutations and evidence of a mutational hot spot. Blood. 2003 Apr 1;101(7):2833-41. doi: 10.1182/blood-2002-07-2095. Epub 2002 Nov 7. PMID: 12424196.
Boddu PC, Kadia TM. Molecular pathogenesis of acquired aplastic anemia. European Journal of Haematology. 2018; 102(2): 103-110. doi: 10.1111/ejh.13182
Kulasekararaj AG, Jiang J, Smith AE, et al. Somatic mutations identify a subgroup of aplastic anemia patients who progress to myelodysplastic syndrome. Blood. 2014; 124: 2698- 2704.
Cong YS, Wright WE, Shay JW. Human telomerase and its regulation. Microbiol Mol Biol Rev 2002;66:407-425
Yamaguchi H, Calado RT, Ly H, Kajigaya S, Baerlocher GM, Chanock SJ, Lansdorp PM, Young NS. Mutations in TERT, the gene for telomerase reverse transcriptase, in aplastic anemia. N Engl J Med. 2005 Apr 7;352(14):1413-24. doi: 10.1056/NEJMoa042980. PMID: 15814878.
Robinson, P.S., Coorens, T.H.H., Palles, C. et al. Increased somatic mutation burdens in normal human cells due to defective DNA polymerases. Nat Genet 53, 1434–1442 (2021). https://doi.org/10.1038/s41588-021-00930-y We believe that this is also in agreement with increased mutation burdens where such characteristic mutational signatures are found in these cell types (Robinson et al. 2021)
Maciejewski JP, Balasubramanian SK. Clinical implications of somatic mutations in aplastic anemia and myelodysplastic syndrome in genomic age. Hematology Am Soc Hematol Educ Program. 2017 Dec 8;2017(1):66-72. doi: 10.1182/asheducation-2017.1.66. PMID: 29222238; PMCID: PMC6142555.
Steensma DP. Clinical consequences of clonal hematopoiesis of indeterminate potential. Hematology Am Soc Hematol Educ Program. 2018 Nov 30;2018(1):264-269. doi: 10.1182/asheducation-2018.1.264. PMID: 30504320; PMCID: PMC6245996.
Killick SB, Bown N, Cavenagh J, Dokal I, Foukaneli T, Hill A, Hillmen P, Ireland R, Kulasekararaj A, Mufti G, Snowden JA, Samarasinghe S, Wood A, Marsh JC; British Society for Standards in Haematology. Guidelines for the diagnosis and management of adult aplastic anaemia. Br J Haematol. 2016 Jan;172(2):187-207. doi: 10.1111/bjh.13853. Epub 2015 Nov 16. Erratum in: Br J Haematol. 2016 Nov;175(3):546. PMID: 26568159.
Chin, CH., Chen, SH., Wu, HH. et al. cytoHubba: identifying hub objects and sub-networks from complex interactome. BMC Syst Biol 8 (Suppl 4), S11 (2014). https://doi.org/10.1186/1752-0509-8-S4-S11
Kamat, G., Renukaradhya K Math, Goni, D., Balikai, G. ., Savanur, A. ., Mudennavar, N. ., & Palaksha Kanive Javaregowda. (2022). Use of Cyclosporine A and danazol in treatment of aplastic anemia: A real-world data from a teaching hospital in South India. Asian Journal of Medical Sciences, 13(10), 223–226. https://doi.org/10.3126/ajms.v13i10.45513
Jaime-Pérez, J.C., Colunga-Pedraza, P.R., Gómez-Ramírez, C.D. et al. Danazol as first-line therapy for aplastic anemia. Ann Hematol 90, 523–527 (2011). https://doi.org/10.1007/s00277-011-1163-x
Townsley DM, Dumitriu B, Liu D, Biancotto A, Weinstein B, Chen C, Hardy N, Mihalek AD, Lingala S, Kim YJ, Yao J, Jones E, Gochuico BR, Heller T, Wu CO, Calado RT, Scheinberg P, Young NS. Danazol Treatment for Telomere Diseases. N Engl J Med. 2016 May 19;374(20):1922-31. doi: 10.1056/NEJMoa1515319. PMID: 27192671; PMCID: PMC4968696.

No competing interests reported.

Supplementary.zip

Download PDF

Version 1

posted

You are reading this latest preprint version

Whole exome sequencing of adult Indians with apparently acquired Aplastic Anemia: initial experience at tertiary care hospital

Status:

Version 1

Abstract

Figures

1. Introduction

2. Materials and Methods

3. Results and Discussions

Conclusions

Declarations

References

Additional Declarations

Supplementary Files

Status:

Version 1