Subscribe to RSS
DOI: 10.1055/s-0042-1749345
Long-Read Sequencing Identifies the First Retrotransposon Insertion and Resolves Structural Variants Causing Antithrombin Deficiency
Funding This work was supported by the National Institute for Health Research England (NIHR) for the NIHR BioResource project (grant numbers RG65966 and RG94028), the PI18/00598, PI21/00174, and PMP21/00052 projects (Instituto de Salud Carlos III, FEDER & Next Generation and the 21642/PDC/21 project (Fundación Séneca).Abstract
The identification of inherited antithrombin deficiency (ATD) is critical to prevent potentially life-threatening thrombotic events. Causal variants in SERPINC1 are identified for up to 70% of cases, the majority being single-nucleotide variants and indels. The detection and characterization of structural variants (SVs) in ATD remain challenging due to the high number of repetitive elements in SERPINC1. Here, we performed long-read whole-genome sequencing on 10 familial and 9 singleton cases with type I ATD proven by functional and antigen assays, who were selected from a cohort of 340 patients with this rare disorder because genetic analyses were either negative, ambiguous, or not fully characterized. We developed an analysis workflow to identify disease-associated SVs. This approach resolved, independently of its size or type, all eight SVs detected by multiple ligation-dependent probe amplification, and identified for the first time a complex rearrangement previously misclassified as a deletion. Remarkably, we identified the mechanism explaining ATD in 2 out of 11 cases with previous unknown defect: the insertion of a novel 2.4 kb SINE-VNTR-Alu retroelement, which was characterized by de novo assembly and verified by specific polymerase chain reaction amplification and sequencing in the probands and affected relatives. The nucleotide-level resolution achieved for all SVs allowed breakpoint analysis, which revealed repetitive elements and microhomologies supporting a common replication-based mechanism for all the SVs. Our study underscores the utility of long-read sequencing technology as a complementary method to identify, characterize, and unveil the molecular mechanism of disease-causing SVs involved in ATD, and enlarges the catalogue of genetic disorders caused by retrotransposon insertions.
Author Contributions
B.M.-B., W.H.O., J.C., and A.S.-J. designed the study. M.M.B., L.S., J.P., A.M., N.G., F.L.R., and V.V. helped with the study design. B.M.-B., M.M.B., J.P., and A.M. performed laboratory experiments and analyzed the experimental data. J.S. performed sample preparation and executed long-read sequencing. A.S.-J. developed the analysis workflow for long-read sequencing, applied this to data processing, and performed the computational and statistical analyses. B.M.-B. performed computational analyses and variant validation. J.J.L.C. and F.V. provided valuable insight into microarray and NGS data analysis. A.U., M.F., M.P., and P.M. recruited participants and collected the clinical data and samples. B.M.-B., W.H.O., J.C., and A.S.-J. wrote the manuscript. All authors read and approved the final version of the manuscript.
Data and Code Availability
The workflow developed for the detection of structural variants is publicly available at http://github.com/who-blackbird/magpie.
Patient Consent
All included subjects gave their written informed consent to enter the study.
Ethical Approval
This study was approved by the Ethics Committee of Morales Meseguer Hospital and the East of England Cambridge South National Institutional Review Board (13/EE/0325). The research conforms to the principles of the Declaration of Helsinki and their later amendments.
Publication History
Received: 16 September 2021
Accepted: 10 January 2022
Article published online:
28 June 2022
© 2022. The Author(s). This is an open access article published by Thieme under the terms of the Creative Commons Attribution-NonDerivative-NonCommercial License, permitting copying and reproduction so long as the original work is given appropriate credit. Contents may not be used for commercial purposes, or adapted, remixed, transformed or built upon. (https://creativecommons.org/licenses/by-nc-nd/4.0/)
Georg Thieme Verlag KG
Rüdigerstraße 14, 70469 Stuttgart, Germany
-
References
- 1 Egeberg O. Thrombophilia caused by inheritable deficiency of blood antithrombin. Scand J Clin Lab Invest 1965; 17: 92
- 2 Corral J, de la Morena-Barrio ME, Vicente V. The genetics of antithrombin. Thromb Res 2018; 169: 23-29
- 3 Lijfering WM, Brouwer JLP, Veeger NJGM. et al. Selective testing for thrombophilia in patients with first venous thrombosis: results from a retrospective family cohort study on absolute thrombotic risk for currently known thrombophilic defects in 2479 relatives. Blood 2009; 113 (21) 5314-5322
- 4 Mahmoodi BK, Brouwer J-LP, Ten Kate MK. et al. A prospective cohort study on the absolute risks of venous thromboembolism and predictive value of screening asymptomatic relatives of patients with hereditary deficiencies of protein S, protein C or antithrombin. J Thromb Haemost 2010; 8 (06) 1193-1200
- 5 Bravo-Pérez C, Vicente V, Corral J. Management of antithrombin deficiency: an update for clinicians. Expert Rev Hematol 2019; 12 (06) 397-405
- 6 Stenson PD, Ball EV, Howells K, Phillips AD, Mort M, Cooper DN. The Human Gene Mutation Database: providing a comprehensive central mutation database for molecular diagnostics and personalized genomics. Hum Genomics 2009; 4 (02) 69-72
- 7 Ordulu Z, Kammin T, Brand H. et al. Structural chromosomal rearrangements require nucleotide-level resolution: lessons from next-generation sequencing in prenatal diagnosis. Am J Hum Genet 2016; 99 (05) 1015-1033
- 8 Beauchamp NJ, Makris M, Preston FE, Peake IR, Daly ME. Major structural defects in the antithrombin gene in four families with type I antithrombin deficiency–partial/complete deletions and rearrangement of the antithrombin gene. Thromb Haemost 2000; 83 (05) 715-721
- 9 Lam HYK, Mu XJ, Stütz AM. et al. Nucleotide-resolution analysis of structural variants using BreakSeq and a breakpoint library. Nat Biotechnol 2010; 28 (01) 47-55
- 10 Sanchis-Juan A, Stephens J, French CE. et al. Complex structural variants in Mendelian disorders: identification and breakpoint resolution using short- and long-read genome sequencing. Genome Med 2018; 10 (01) 95
- 11 Beyter D, Ingimundardottir H, Eggertsson HP. et al. Long read sequencing of 1,817 Icelanders provides insight into the role of structural variants in human disease. Nat Genet. 2021; 53 (06) 779-786
- 12 Sedlazeck FJ, Rescheneder P, Smolka M. et al. Accurate detection of complex structural variations using single-molecule sequencing. Nat Methods 2018; 15 (06) 461-468
- 13 Cretu Stancu M, van Roosmalen MJ, Renkens I. et al. Mapping and phasing of structural variation in patient genomes using nanopore sequencing. Nat Commun 2017; 8 (01) 1326
- 14 French CE, Delon I, Dolling H. et al; NIHR BioResource—Rare Disease, Next Generation Children Project. Whole genome sequencing reveals that genetic conditions are frequent in intensively ill children. Intensive Care Med 2019; 45 (05) 627-636
- 15 de Koning APJ, Gu W, Castoe TA, Batzer MA, Pollock DD. Repetitive elements may comprise over two-thirds of the human genome. PLoS Genet 2011; 7 (12) e1002384
- 16 de la Morena-Barrio M, Sandoval E, Llamas P. et al. High levels of latent antithrombin in plasma from patients with antithrombin deficiency. Thromb Haemost 2017; 117 (05) 880-888
- 17 de la Morena-Barrio ME, Martínez-Martínez I, de Cos C. et al. Hypoglycosylation is a common finding in antithrombin deficiency in the absence of a SERPINC1 gene defect. J Thromb Haemost 2016; 14 (08) 1549-1560
- 18 De Coster W, De Rijk P, De Roeck A. et al. Structural variants identified by Oxford Nanopore PromethION sequencing of the human genome. Genome Res 2019; 29 (07) 1178-1187
- 19 Li H, Handsaker B, Wysoker A. et al; 1000 Genome Project Data Processing Subgroup. The sequence alignment/map format and SAMtools. Bioinformatics 2009; 25 (16) 2078-2079
- 20 Ruan J, Li H. Fast and accurate long-read assembly with wtdbg2. Nat Methods 2020; 17 (02) 155-158
- 21 Li H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics 2018; 34 (18) 3094-3100
- 22 Stankiewicz P, Lupski JR. Structural variation in the human genome and its role in disease. Annu Rev Med 2010; 61 (01) 437-455
- 23 Quinlan AR, Hall IM. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 2010; 26 (06) 841-842
- 24 Köster J, Rahmann S. Snakemake-a scalable bioinformatics workflow engine. Bioinformatics 2018; 34 (20) 3600
- 25 Turro E, Astle WJ, Megy K. et al; NIHR BioResource for the 100,000 Genomes Project. Whole-genome sequencing of patients with rare diseases in a national health system. Nature 2020; 583 (7814): 96-102
- 26 Vogt J, Bengesser K, Claes KBM. et al. SVA retrotransposon insertion-associated deletion represents a novel mutational mechanism underlying large genomic copy number changes with non-recurrent breakpoints. Genome Biol 2014; 15 (06) R80
- 27 Payer LM, Burns KH. Transposable elements in human genetic disease. Nat Rev Genet 2019; 20 (12) 760-772
- 28 Huang CRL, Burns KH, Boeke JD. Active transposition in genomes. Annu Rev Genet 2012; 46: 651-675
- 29 Nakamura Y, Murata M, Takagi Y. et al. SVA retrotransposition in exon 6 of the coagulation factor IX gene causing severe hemophilia B. Int J Hematol 2015; 102 (01) 134-139
- 30 van der Klift HM, Tops CM, Hes FJ, Devilee P, Wijnen JT. Insertion of an SVA element, a nonautonomous retrotransposon, in PMS2 intron 7 as a novel cause of Lynch syndrome. Hum Mutat 2012; 33 (07) 1051-1055
- 31 Aneichyk T, Hendriks WT, Yadav R. et al. Dissecting the causal mechanism of X-linked dystonia-Parkinsonism by integrating genome and transcriptome assembly. Cell 2018; 172 (05) 897.e21-909.e21
- 32 Bragg DC, Mangkalaphiban K, Vaine CA. et al. Disease onset in X-linked dystonia-parkinsonism correlates with expansion of a hexameric repeat within an SVA retrotransposon in TAF1 . Proc Natl Acad Sci U S A 2017; 114 (51) E11020-E11028
- 33 Hancks DC, Kazazian Jr HH. Roles for retrotransposon insertions in human disease. Mob DNA 2016; 7 (01) 9
- 34 Kazazian Jr HH, Moran JV. Mobile DNA in health and disease. N Engl J Med 2017; 377 (04) 361-370
- 35 Carvalho CMB, Lupski JR. Mechanisms underlying structural variant formation in genomic disorders. Nat Rev Genet 2016; 17 (04) 224-238
- 36 Kato I, Takagi Y, Ando Y. et al. A complex genomic abnormality found in a patient with antithrombin deficiency and autoimmune disease-like symptoms. Int J Hematol 2014; 100 (02) 200-205
- 37 Picard V, Chen J-M, Tardy B. et al. Detection and characterisation of large SERPINC1 deletions in type I inherited antithrombin deficiency. Hum Genet 2010; 127 (01) 45-53