Criteria for the Interpretation of mRNA Analyses
The criteria of the German Consortium for Hereditary Breast and Ovarian Cancer for the evaluation of sequence variants with subsequent mRNA analysis are based on the guidelines of the ENIGMA Consortium (see also [7 ]). The respective threshold values of potential splice variants for an empirical predictive prognosis based on three commonly used predictive programmes are given in Appendix A 1. Appendix A 4 ([Fig. 2 ]) gives a schematic representation of the areas considered by the VUS Task Force when evaluating the splice variants. mRNA analysis is carried out using fresh blood samples, cultured lymphocytes, cultured lymphoblastoid cell lines, etc. and compared in parallel with at least 5 controls of the same type of material. A sequence variant is described as pathogenic if it has the following effect on mRNA transcription: one or more aberrant transcripts of the variant allele are detected, which lead to a stop codon or an in-frame deletion and result in the destruction of known functional domains. Sequencing of the full-length transcript of the variant allele or the presence of an intronic variant of a cis-acting polymorphism is considered sufficient (evidence of monoallelic expression) to determine the transcript amount (using semi-quantitative or quantitative methods). Variants which show a transcript pattern comparable to the mean value of controls are rated as neutral/not pathogenic due to the lack of aberrant mRNA. As regards the cDNA primer design, the physiological splice variants/naturally occurring isoforms must also be taken into account (see [8 ], [9 ]).
Fig. 2 Classification of variants which can affect splicing.
Caution: Certain BRCA1 and BRCA2 variants which are ± 1, 2 bp from the exon border and which are predicted or proven to lead to at least 20 – 30% naturally occurring in-frame RNA isoforms per allele could presumably result in some residual protein activity (see [9 ], [10 ], [11 ], [12 ]) and are therefore classified as VUS Class3, unless there is evidence to the contrary (see Overview, Appendix 5, [Table 5 ])
Table 5 Excerpt from https://enigmaconsortium.org/wp-content/uploads/2018/10/ENIGMA_Rules_2017-06-29-v2.5.1.pdf .
Table 6: BRCA1 and BRCA2 exon boundary variants predicted or known to lead to naturally occurring in-frame RNA isoforms that may rescue gene functionality. Variants at these positions should be considered Class3 Uncertain unless proven otherwise.*
Gene
Alternative splicing event
Variants implicated
Rationale
* This summary table does not yet capture the possibility of acceptor site changes leading to small in-frame deletions > 3 bp, e.g. due to NAG (NNN)n NAG sites. It is recommended that bioinformatic prediction analysis is carried out for variation in/near all donor and acceptor sites to assess the likelihood that a variant will or will not cause alternative splicing.
Note: It could be argued that nonsense or frameshift variants in BRCA1 exon 9, BRCA1 exon 10, or BRCA2 exon 12 may not be associated with high risk of cancer due to rescue by the expression of in-frame naturally occurring isoforms that bypass the premature termination codon and thus encode a functional protein. A review of multiple clinical and control datasets for the frequency of unique nonsense or frameshift variants – adjusted for exon size – does not provide strong support for this hypothesis at present (Spurdle, de la Hoya, unpublished data). Additional research is underway to further investigate the functional/clinical importance of germline nonsense or frameshift variants in these exons.
Moreover, further work is planned within ENIGMA (led by Paolo Radice) to document variants that have undergone splicing assays and are proven to be “leaky” variants, to provide a record of all spliceogenic variants for which additional research is necessary. This resource will identify variants that have already been classified using clinical data, as positive and negative controls for future quantitative mRNA studies.
BRCA1
Δ8p
c.442-1 (IVS7-1)
c.442-2 (IVS7-2)
BRCA1 exon 8 acceptor site is an experimentally validated tandem acceptor site (NAGNAG) subject to alternative splicing (Colombo et al., 2014). c.442-1,-2 variants are predicted to inactivate the 5′ acceptor site, but not the 3′ acceptor site, thus producing Δ8p transcripts.
Δ9,10
c.548-1 (IVS8-1)
c.548-2 (IVS8-2)
c.593 to non-G
c.593+1 (IVS9+1)
c.593+2 (IVS9+2)
c.594-1 (IVS9-1)
c.594-2 (IVS9-2)
c.670 to non-G
c.670+1 (IVS10+1)
c.670+2 (IVS10+2)
Carriers of variants at these positions are predicted to produce normal (or increased) levels of BRCA1 Δ(9,10), a major in-frame alternative splicing event (Colombo et al., 2014).
The BRCA1 variant c.594-2A>C (shown from ENIGMA research to co-occur in cis with c.641A>G), has been reported to demonstrate clinical characteristics inconsistent with a high risk of cancer expected for a pathogenic BRCA1 variant (Rosenthal et al., 2015). The haplotype of c.[594-2A>C; 641A>G] has been shown from mRNA analysis in human samples to produce high levels of Δ10 transcripts (70% of the overall expression, and has been designated as Class1 Not Pathogenic by the ENIGMA Consortium using multifactorial likelihood analysis that includes genetic (segregation, case-control analysis) and pathology data (de la Hoya et al., 2016).
Δ11q, Δ11
c.4096 to non-G
c.4096+1 (IVS11+1)
c.4097+2 (IVS11+2)
Data collected by the ENIGMA consortium demonstrates that the BRCA1 c.4096+1G>A variant, proven to result in the production of naturally occurring in-frame transcripts Δ11q (Bonatti et al., 2006) and also Δ11 (Radice, unpublished data), may not exhibit the clinical characteristics of a standard high-risk pathogenic BRCA1 variant (Spurdle, unpublished data).
Δ13p
c.4186-1 (IVS12-1)
c.4186-2 (IVS12-2)
BRCA1 exon 13 acceptor site is an experimentally validated tandem acceptor site (NAGNAG) subject to alternative splicing (Colombo et al., 2014). c.4186-1,-2 variants are predicted to inactivate the 5′ acceptor site, but not the 3′ acceptor site, thus producing Δ13p transcripts.
Δ14p
c.4358-1 (IVS13-1)
c.4358-2 (IVS13-2)
BRCA1 exon 14 acceptor site is an experimentally validated tandem acceptor site (NAGNAG) subject to alternative splicing (Colombo et al., 2014). c.4358-1,-2 variants are predicted to inactivate the 5′ acceptor site, but not the 3′ acceptor site, thus producing Δ14p transcripts.
BRCA2
Δ12
c.6842-1 (IVS11-1)
c.6842-2 (IVS11-2)
c.6937 to non-G
c.6937+1 (IVS12+1)
c.6937+2 (IVS12+2)
Carriers of these variants are predicted to produce exon 12 skipping. BRCA2 Δ12 is a naturally occurring in-frame splicing event (Fackenthal et al., 2016). BRCA2 exon 12 is functionally redundant (Li et al., 2009).
Approach of the VUS Task Force
The Consortium recommends routinely testing for 10 genes which are known (as per 8/19) to be associated with breast and/or ovarian cancer: ATM, BRCA1, BRCA2, BRIP1, CDH1, CHEK2, PALB2, RAD51C, RAD51D and TP53 (http://www.konsortium-familiaerer-brustkrebs.de/ ).
The Consortiumʼs panel of experts (VUS Task Force) holds monthly telephone conferences and, if necessary, meetings to reach a consensus on the classification of newly reported sequence variants, discuss any new evidence available for the re-evaluation of already known variants, and evaluate variants of unclear significance. In the event of a re-evaluation, the central database of the Consortium will inform all centres about the reclassification (recall system).
It should be expressly noted that new information can lead to changes in the classification of variants and that these classifications are regularly reviewed by the panel of experts. Similarly, new findings can lead to changes in the list of core genes for which the German Consortium for Hereditary Breast and Ovarian Cancer recommends that patients are tested. All such changes along with the inclusion of new findings into the classification are published on the homepage of the German Consortium for Hereditary Breast and Ovarian Cancer (
http://www.konsortium-familiaerer-brustkrebs.de/
).
Classification of Sequence Variants According to Their Functional Relevance
1. Class1 (functionally irrelevant/no loss of function) if one of the following criteria is met:
1.1 Allele frequency of variants in large population groups (e.g. Caucasians, Africans, or Asians) is ≥ 1% (minor allele frequency [MAF] ≥ 0.01). Caution: An allele frequency of ≥ 1% in subpopulations with a low-diversity gene pool (examples: Finnish population, founder mutations!) is not sufficient.
1.2 Variants with a calculated multifactorial probability of < 0.001 of being pathogenic.
Caution: This currently only applies to the high-risk genes BRCA1/2 (for an exemplary calculation, see [13 ]).
1.3 Variants in high-risk genes which occur in at least 10 individuals in appropriate cohorts of persons without disease ([Table 1 ]).
2. Class2 (probably no loss of function/functionally irrelevant) if one of the following criteria are met:
2.1 Allele frequency of variants in large population groups (e.g. Caucasians, Africans, or Asians) is 0.5 – 1% (MAF 0.005 – 0.01) Caution: An allele frequency of 0.5 – 1% in subpopulations with a low-diversity gene pool (examples: Finnish population, founder mutations!) is not sufficient.
2.2 Exonic variants (A) which result in substitution of an amino acid (missense variants) or small in-frame insertions/deletions (insertions/deletions of one or fewer amino acid [s]) and whose a priori probability of pathogenicity is ≤ 2% (A-GVGD analysis, http://priors.hci.utah.edu/PRIORS/ ); intronic variants (B) which are more than − 20 bp, + 10 bp from the exon border ; and synonymous variants (C) if these variants (A – C) will, according to bioinformatic prediction programmes (see Appendix A 1), in all probability not change the splicing mechanism. In non-BRCA1/2 genes, the above-mentioned variants must be present in large population groups with an allele frequency of 0.001 ≤ MAF < 0.01.
2.3 Synonymous substitutions or intronic variants which show no mRNA aberrations in the form of exon deletions/duplications or monoallelic expression of the wildtype (wt) transcript in “in-vitro” laboratory tests even if, according to bioinformatic prediction programmes (see Appendix A 1 for programmes and threshold values), in all probability they change the splicing mechanism.
2.4 Variants which occur in the same gene with a clearly pathogenic variant in trans (co-occurrence), if it has been verified that a homozygous or compound heterozygous genotype is associated with a known, clinically unambiguous phenotype.
2.5 Variants with a calculated multifactorial probability of pathogenicity of 0.001 – 0.049.
Caution: This currently only applies to the high-risk genes BRCA1/2 (for an exemplary calculation, see: [13 ]).
2.6 Exon variants which code for the same amino acid exchange as a sequence variant which has already been classified as Class1 but are based on a different nucleotide exchange if no aberrant splicing is predicted.
2.7 Missense variants for which information from functional analyses, etc., is available but not sufficient for multifactorial classification and which have been classified as Class2 by a panel of experts (e.g. ENIGMA).
3. Class3 (unclear functional relevance) if one of the following criteria is met: variants which cannot be unambiguously assigned to Class1, Class2, Class4, or Class5, e.g.:
3.1 Special cases which could be assigned to one of the other classes based on the evaluation criteria but are listed in Appendix A 5 among the characteristics of individual core genes or in Table 5, Appendix of BRCA1/2 classification criteria, Version 2.5.1, July 2017 (ENIGMA) ([Table 5 ]).
3.2 Variants where the data used for their evaluation is contradictory and for which further studies are still required.
3.3 Variants which are − 20 bp, + 10 bp from the exon border and which, based on bioinformatic predictive programmes (see Appendix A 1), probably change the splicing mechanism as long as no in-vitro mRNA analysis has been done yet ([Fig. 2 ], Schematic representation of variants in the vicinity of splice sites).
3.4 Exon duplications which have not been analysed further (e.g. break point analysis, cDNA analysis, etc.).
3.5 Variants with a calculated multifactorial probability of pathogenicity of 0.05 – 0.949.
Caution: This currently only applies to the high-risk genes BRCA1/2 (for an exemplary calculation, see [13 ]).
4. Class4 (probable loss of function/functionally relevant) if one of the following criteria is met:
4.1 Variants with a calculated multifactorial probability of pathogenicity of 0.95 – 0.99.
Caution: This currently only applies to the high-risk genes BRCA1/2 (for an exemplary calculation, see Goldgar et al., 2004 [13 ]).
4.2 Variants which code for a premature termination of protein biosynthesis (nonsense or frameshift variants) and do not necessitate the loss of known clinically relevant functional protein domains as long as the location of the stop codon is not downstream from the Nonsense-mediated decay-(NMD-)relevant site, 50 base pairs before the end of the penultimate exon.
4.3 Intronic variants in position ± 1,2 or G > non-G in the last position of the exon: if there is a positive splicing prediction (see Appendix A 1) and the first 6 bases in the intron are not GTRRGT and an aberrant in-vitro mRNA analysis is not yet available (i.e. has not [yet] been confirmed by a panel of experts or the pathomechanism of loss of function has been confirmed to be exon skipping or allele-specific transcript expression).
Exceptions:
A cryptic splice site (AG/GT) in the vicinity is activated and the (predicted) new exon is spliced in-frame (→ Class3)
The (predicted) skipped exon (or exons) is alternatively spliced in significant quantities (→ Class3) .
The (predicted) skipped exon (or exons) is spliced in-frame and contains no known functional domain (→ Class3)
4.4 Variants which code for the same amino acid exchange as pathogenic missense variants which have already been categorised as Class5 but are caused by another nucleotide exchange and for which there is no positive splicing prediction (see Appendix A 1).
4.5 In-frame deletions (even for just one amino acid) which lead to the loss of a missense variant already categorised as Class5 and which result in the interruption of known, functionally important domains.
4.6 Extensive in-frame deletions which lead to the interruption/loss of known, functionally important domains.
4.7 In-frame insertions verified by in-vitro mRNA analysis which result in the interruption of functionally important domains.
4.8 Variants which lead to mutations of the translation initiation codon (AUG, methionine) and for which there is no evidence (e.g. an alternative start codon in the immediate vicinity) which would support an alternative classification.
4.9 Variants for which information from functional analyses, clinical data, etc., is available but insufficient for a multifactorial classification and which are categorised as Class4 by a panel of experts (e.g. ENIGMA).
5. Class5 (loss of function/functionally relevant) if one of the following criteria are met:
5.1 Variants which code for a premature termination of protein biosynthesis (nonsense or frameshift variants) which prevents the expression of known, clinically relevant, functional protein domains.
5.2 Variants with a calculated multifactorial probability of pathogenicity of > 0.99.
Caution: This currently only applies to the high-risk genes BRCA1/2 (for an exemplary calculation, see Goldgar et al., 2004 [13 ]).
5.3 Splice variants for which a frameshift effect was established by in-vitro mRNA analysis, which leads to a premature termination of protein biosynthesis and prevents the expression of known, clinically relevant, functional protein domains and for which a wild-type transcript of the mutated allele has not been confirmed (monoallelic expression).
5.4 Splice variants for which in-vitro mRNA analysis detected an in-frame deletion/insertion which leads to the interruption or loss of a known, clinically relevant domain or functional inactivation through changes of the protein structure and for which a wild-type transcript of the mutated allele has not been verified (monoallelic expression).
5.5 Copy-number deletion variants which result in the interruption or loss of one or more exons with known, clinically relevant functional domains or lead to a reading frameshift, which results, according to the prediction, in the inactivation of known, clinically relevant, functional domains.
5.6 Copy-number duplication variants of any size, confirmed by laboratory analysis, which duplicate one or more exons and lead to a reading frameshift, which results, according to the prediction, in the inactivation of known, clinically relevant, functional domains.
Appendix
A 1. Splicing prediction programmes and their threshold values
The splicing prediction programmes MaxEntScan (MES), Splice Site Finder (SSF), and Human Splicing Finder (HSF) are considered relatively reliable and should therefore be used to evaluate possible effects on the splicing process. MaxEntScan results are considered non-normal for a deviation of delta of ≥ 15% [14 ], Human Splicing Finder for a delta of ≥ 4.1% [10 ] and Splice Site Finder for a deviation of delta of ≥ 5% [14 ]. An mRNA analysis should be done for evaluation in cases with non-normal prediction (at least two of the three programmes mentioned below). The precondition is that the physiological splice site is recognised by the respective prediction software based on the following threshold values.
The threshold values (calculated for BRCA1/2 [14 ]) are:
MES > 3
SSF > 60
HSF > 80
An approximation of these threshold values can also be for the other genes. Once specific threshold values have been defined, then the defined threshold values must be used.
A 2.
[Tables 1 ] and [2 ].
A 3. Evaluation Criteria Flow Chart
[Fig. 1 ].
A 4. Schematic Representation of Variants in the Vicinity of Splice Sites
[Fig. 2 ].
The use of predictive programmes to assess possible splicing effects is obligatory for all new mutations including stop mutations as “rescue” effects can occur through alternative transcripts.
A 5. Characteristics of Individual Genes
The above-mentioned general evaluation criteria should apply to all genes. However, some exceptions, variations and special features are present in specific variants and regions of individual genes, which, for the sake of clarity, are listed below.
A 5.1 BRCA1/2
The following should be categorised as Class3: truncating BRCA1 mutations after the amino acid position 1854 and truncating BRCA2 variants after amino acid position 3308 (they are not categorised as Class1, as structural mutations cannot be excluded). Exception: truncating variants after the polymorphic stop codon p.(Lys3326*) are classified as dispensable/neutral (Class1) [17 ] and ENIGMA: p.(Lys3326*) is a frequently detected polymorphism which is not associated with a higher risk, OR 1.3 – 1.5 depending on breast or ovarian cancer. This means that variants which lead to a stop downstream from p.(Lys3326*) will also not be associated with an increased risk of developing disease.
The following should be categorised as Class5 in BRCA1/2 : all truncating BRCA1 variants up to the last mutation unequivocally identified as pathogenic at amino acid position 1853 [18 ] and all truncating BRCA2 variants up to amino acid position 3308, c.9924C>G [19 ]. See ENIGMA BRCA1,2 functional domains, [Tables 3 ] and [4 ]. Caution: Be aware of the potential impact of NMD; the last 50 bp in the penultimate exon and variants in the last exon usually are usually not subject to NMD. The predictive value of RNA analysis of blood may be limited as it does not involve the target tissue.
Table 3 Excerpt from https://enigmaconsortium.org/wp-content/uploads/2018/10/ENIGMA_Rules_2017-06-29-v2.5.1.pdf .
Table 3: Catalogue of BRCA1 conserved domains/motifs and currently known, clinically important amino acid residues, and relevance for classification of BRCA1 in-frame and terminal exon sequence variants.
Domain/Motif
AA start
AA end
AA alterations with demonstrated clinical importancea
Classification of in-frame deletions targeting domain/motifs
References and summary interpretationa
RING
1
101
L22S (c.65T>C [p.Leu22Ser])
T37K (c.110C>A [p.Thr37Lys])
C39R (c.115T>C [p.Cys39Arg])
H41R (c.122A>G [p.His41Arg])
C44S (c.130T>A [p.Cys44Ser])
C44Y (c.131G>A [p.Cys44Tyr])
C61G (c.181T>G [p.Cys61Gly])
Class5 if at least one clinically relevant residue is removed. Otherwise Class3.
http://www.ncbi.nlm.nih.gov/protein/15988069 ; http://hci-exlovd.hci.utah.edu ; multifactorial analysis for H41R (c.122A>G [p.His41Arg]) (Whiley et al., 2014).
NES
81
99
None reported
Class3
Domain location description (Rodriguez and Henderson, 2000).
NLS1
503
508
None reported
Class3
Domain location description (Chen et al., 1996, Thakur et al., 1997).
NLS2
607
614
None reported
Class3
Domain location description (Chen et al., 1996, Thakur et al., 1997).
NLS3
651
656
None reported
Class3
Domain location description (Chen et al., 1996).
COILED-COIL
1391
1424
None reported
Class3
Domain location description (Hu et al., 2000).
BRCT DOMAINS
1650
1863
T1685A (c.5053A>G [p.Thr1685Ala])
T1685I (c.5054C>T [p.Thr1685Ile])
V1688del (c.5062_5064del [p.Val1688del])
R1699W (c.5095C>T [p.Arg1699Trp])
G1706E (c.5117G>A [p.Gly1706Glu])
A1708E (c.5123C>A [p.Ala1708Glu])
S1715R (c.5143A>C [p.Ser1715Arg])
G1738R (c.5212G>A [p.Gly1738Arg])
L1764P (c.5291T>C [p.Leu1764Pro])
I1766S (c.5297T>G [p.Ile1766Ser])
M1775K (c.5324T>A [p.Met1775Lys])
M1775R (c.5324T>G [p.Met1775Arg])
C1787S (c.5359T>A [p.Cys1787Ser])
G1788V (c.5363G>T [p.Gly1788Val])
V1838E (c.5513T>A [p.Val1838Glu])
Class5 if at least one clinically relevant residue is removed. Otherwise Class3.
Domain boundaries derived from X-ray crystallography data are aa1646-1863 (1T15, http://www.ncbi.nlm.nih.gov/Structure/mmdb/mmdbsrv.cgi?uid=27907 ), and ENIGMA functional assay data (Monteiro, unpublished).
Digestion data indicate aa1860-1863 are dispensable based on susceptibility to digestion (Lee et al., 2010), while pathogenic variant data indicate that 1855-1862 are dispensable (Hayes et al., 2000). Position 1854 is implicated as clinically important by the observation that Y1853X (c.5559C>G [p.Tyr1853Ter]) is a recognised high-risk pathogenic variant.
These combined data indicate that position 1854 or 1855 is the C-terminal border of the BRCT/BRCA1 relevant for the clinical interpretation of sequence variants in exon 24 of BRCA1. That is, a variant predicted to disrupt expression of protein sequence only downstream* of position 1855 would not be considered clinically important.
a Missense substitutions in specific functional domains that are designated as Class5 pathogenic based on multifactorial likelihood of the posterior probability of pathogenicity > 0.99 (listed in http://hci-exlovd.hci.utah.edu or individual references), and which have no/little effect on the mRNA transcript profile, unless the variant results in an aberrant transcript that encodes a discrete in-frame deletion considered informative for the definition of clinically important domains.
* Typo was corrected in version 2.5.1.
Note: The following pathogenic exonic variants known to alter mRNA splicing have been excluded from Table 3 above, as justified below:
Variant
mRNA Change
Predicted protein change
Reason for exclusion
BRCA1 R1495M (c.4484G>T [p.Arg1495Met])
r.[4358_4484del, 4358_4675del]
p.(Ala1453Glyfs Ter10) – predominant transcript
Predominant alternate transcript is out of frame. Loss of function is assumed due to loss of full-length transcript from variant allele (Houdayer et al., 2012, Colombo et al., 2013, Santos et al., 2014).
BRCA1 E1559K (c.4675G>A [p.Glu1559Lys])
r.[4665_4675del]
p.(Gln1366Alafs Ter13)
Alternate transcript is out-of-frame. Level of full-length transcript not assessed (Wappenschmidt et al., 2012).
BRCA1 A1623G (c.4868C>G [p.Ala1623Gly])
r.[4868_4986del]
p.(Ala1623Aspfs Ter16)
Alternate transcript is out of frame. Variant allele produces some full-length transcripts (Walker et al., 2010).
BRCA1 D1692N (c.5074G>A [p.Asp1692Asn])
r.[4987_5074del, 5074_5075ins5074+1_5074+153]
p.(Val1665Serfs Ter8) – predominant transcript
Predominant alternate transcript, based on minigene assay (Ahlborn et al., 2015), is out of frame.
Table 4 Excerpt from https://enigmaconsortium.org/wp-content/uploads/2018/10/ENIGMA_Rules_2017-06-29-v2.5.1.pdf .
Table 4: Catalogue of BRCA2 conserved domains/motifs and currently known clinically important amino acid residues, and relevance for classification of BRCA2 in-frame and terminal exon sequence variants.
Domain/Motif
AA start
AA end
AA alterations with demonstrated clinical importancea
Classification of in-frame deletions targeting domain/motifs
References and summary interpretationa
PALB2 Binding
10
40
None reported
Class3
Domain location description (Oliver et al., 2009, Xia et al., 2006)
BRC-1
1002
1036
None reported
Class3
http://www.ncbi.nlm.nih.gov/protein/NP_000050.2
BRC-2
1212
1246
None reported
Class3
http://www.ncbi.nlm.nih.gov/protein/NP_000050.2
BRC-3
1422
1453
None reported
Class3
http://www.ncbi.nlm.nih.gov/protein/NP_000050.2
BRC-4
1518
1549
None reported
Class3
http://www.ncbi.nlm.nih.gov/protein/NP_000050.2
BRC-5
1665
1696
None reported
Class3
http://www.ncbi.nlm.nih.gov/protein/NP_000050.2
BRC-6
1837
1871
None reported
Class3
http://www.ncbi.nlm.nih.gov/protein/NP_000050.2
BRC-7
1971
2005
None reported
Class3
http://www.ncbi.nlm.nih.gov/protein/NP_000050.2
BRC-8
2051
2085
None reported
Class3
http://www.ncbi.nlm.nih.gov/protein/NP_000050.2
DBD (DNA/DSS1 binding domain – helical, OB1, OB2, OB3)
2481
3186
W2626C (c.7878G>C [p.Trp2626Cys])
I2627F (c.7879A>T [p.Ile2627Phe])
E2663V (c.7988A>T [p.Glu2663Val])
T2722R (c.8165C>G [p.Thr2722Arg])
D2723G (c.8168A>G [p.Asp2723Gly])
D2723H (c.8167G>C [p.Asp2723His])
G2748D (c.8243G>A [p.Gly2748Asp])
I2778_Q2829del (c.8332_8487del [p.Ile2778_Gln2829del])
R3052W (c.9154C>T [p.Arg3052Trp])
Class5 if at least one clinically relevant residue (or all of AA2778-2829) is removed.
Otherwise Class3.
http://www.ncbi.nlm.nih.gov/protein/NP_000050.2 ; http://hci-exlovd.hci.utah.edu .
Pathogenic variant c.8486G>A (also recorded as Gln2829Arg) results in a transcript encoding an in-frame exon 19 deletion only (Houdayer et al., 2012), indicating that genetic variation encompassing loss of this entire exon (AA2778-2829) should be considered clinically important. The clinical impact of alteration/deletion of individual amino acids in exon 19 is not yet established.
NLS1
3263
3269
None reported
Class3
Domain local description (Guidugli et al., 2014)
BRC-9 or TR2
3265
3330
None reported
Class3
Note: although amino acids 3270-3305 within this fragment are reported to bind RAD51-DNA filaments (Davies and Pellegrini, 2007), there is no sequence conservation with the BRC repeats located between aa1002 and aa2014. Domain boundaries are derived from x-ray crystallography data are aa3265-3330 (Esashi et al., 2005, Esashi et al., 2007).
Case-control and frequency data indicate that BRCA2 c.9976A>T (p.Lys3326Ter) does not confer a high risk of cancer (OR 1.3–1.5, dependent on breast or ovarian cancer subtype (Meeks et al., 2016), demonstrating that residues at and downstream of 3327 are likely dispensable.
Position 3308 is implicated as clinically important by the observation that a nonsense variant c.9924C>G (p.Tyr3308Ter) is recognized as a high-risk pathogenic variant with known functional relevance ([Vallee et al., 2016]; Bayes score 1122 : 1 from a single large kConFab family, Spurdle unpublished data). There is currently no publicly available clinical information to support pathogenicity of nonsense or frameshift variants located between positions 3309 and 3325.
These data combined suggest that the C-terminal border of the BRC-9 relevant to the clinical interpretation of sequence variants in exon 27 of BRCA2 lies between 3309 and 3325. That is, a variant predicted to disrupt expression only of protein sequence downstream of position 3325 would be considered unlikely to be clinically important. Further functional and clinical studies are underway to refine risk, if any, for predicted nonsense or frameshift variants downstream of position 3326.
NLS2
3381
3385
No
Class3
Domain location description (Guidugli et al., 2014).
This domain is considered unlikely clinically relevant since it lies downstream of position 3326.
a Missense substitutions in denoted functional domains that are designated as Class5 pathogenic based on multifactorial likelihood posterior probability of pathogenicity > 0.99, and for which there is no/little effect on mRNA transcript profile –
unless
the variant results in an aberrant transcript that encodes a discrete in-frame deletion considered informative to definition of clinically important domains. (Splicing aberrations are reported for BRCA2 c.7988A>T [p.Glu2663Val] and c.8168A>G [p.Asp2723Gly] (Walker et al., 2010), but these did not lead to complete loss of function of the full length transcript), and missense alterations showed abrogation of functional activity using multiple assays (Walker et al., 2010). An additional conserved region not commonly recognized as a BRCA2 domain/motif is located AA 1110-1183, but no pathogenic missense substitutions have been recorded for this region.
Note – The following pathogenic exonic variants known to alter mRNA splicing have been excluded from Table 4 above, as justified below:
Variant
mRNA Change
Predicted protein change
Reason for exclusion
BRCA2 R2659K (c.7976G>A [p.Arg2659Lys])
r.[7806_7976del]
p.(Ala2603_ Arg2659del)
Alternate transcript is in-frame but level of full length transcript not assessed (Farrugia et al., 2008)
BRCA2 R2659T (c.7976G>C [p.Arg2659Thr])
r.[7806_7976del]
p.(Ala2603_ Arg2659del)
Alternate transcript is in-frame but level of full length transcript not assessed (Farrugia et al., 2008)
BRCA2 P3039P (c.9117G>A [p.Pro3039Pro])
r.[8954_9117del]
p.(Val2985 Glyfs*4)
Allele-specific assay shows out-of-frame transcript (Houdayer et al., 2012)
Other special features are listed in the Appendix of the evaluation guidelines of the ENIGMA Consortium, which can be accessed via the following link: https://enigmaconsortium.org/wp-content/uploads/2018/10/ENIGMA_Rules_2017-06-29-v2.5.1.pdf , Tables 3, 4 and 6 ([Tables 3 ] to [5 ]).
A 5.2 ATM
The evaluation criteria for ATM are based on a combination of the following criteria:
The 5-class IARC system for the assessment of the pathogenicity of BRCA1 and BRCA2 variants.
The 3-class system to evaluate the pathogenicity of ATM variants [20 ] which includes in silico analyses such as Align-GVGD.
The ACMG guidelines on the classification of variants [3 ], [21 ].
Additional literature: [22 ], [23 ], [24 ], [25 ], [26 ], [27 ], [28 ].
Class1:
If the allele frequency is ≥ 1% (MAF ≥ 0.01) in large population groups (e.g. Caucasians, Africans, or Asians) or there is evidence of homozygous variant carriers in control populations. If this is the case, then the variant is always categorised as Class1. An allele frequency of ≥ 1% in subpopulations with a low-diversity gene pool (examples: Finnish population, founder mutations!) is not sufficient.
Class2:
If the allele frequency is ≥ 0.5–< 1% (MAF ≥ 0.005 – 0.099) in large population groups (e.g. Caucasians, Africans, or Asians), the variant is always categorised as Class2.
Missense variants which, according to in silico analysis (Align-GVGD, SIFT), are very probably neutral and/or outside the functionally critical domain (FATKIN).
Class3:
Class4:
Variants with an in-frame deletion which are within the functionally critical domain (FATKIN).
Missense variants which are within the functionally critical domain (FATKIN) and are, according to in silico analysis (Align-GVGD, SIFT), very probably harmful and described as functionally inactive.
Class5:
Truncating ATM variants up to the FATKIN domain.
Missense variants, in-frame deletion or splice mutations which reduce ATM protein expression to < 20% for the mutated allele [28 ], [29 ].
Variants associated with classic AT.
Splice variants: see BRCA1 and BRCA2 .
Functional domains: FATKIN with FAT; PI3K-related kinase; FATC
[Table 6 ].
Table 6 ATM , functional domains and relevance for the interpretation of the clinical importance of sequence variants – catalogue of clinically relevant functional domains and amino acids.
Region
AA start
AA end
AA alterations with demonstrated clinical importance (AT) characterising known functional domains
References and summary interpretation
Substrate binding
91
97
None reported
Domain location description [30 ]
Also contains p53- and BRCA1-binding domain
NLS
385
388
None reported
Domain location description [28 ], [31 ]
Leucine zipper
1218
1238
None reported
Domain location description [25 ], [28 ]
Proline rich
1373
1382
None reported
Domain location description [25 ], [28 ]
FATKIN
1893
3056
Yes, e.g.
p.(Val2424Gly)
p.(2546_2548del), in frame
p.(Asp2625Glu)
p.(Ala2626Pro)
p.(Val2716Ala)
p.(Ser2855_Val2856delinsArgIle)
AA alterations and in-frame deletions [26 ], [28 ], [29 ], [32 ], [33 ], [34 ], [35 ]
Domain location description [25 ], [27 ], [28 ], [36 ], [37 ]
Domains:
FAT: 1893-2612
KIN: 2612-3056 with ATP-binding: 2716-2730, substrate (nibrin and p53) binding: 2682-3012, FATC with TIP60 binding: 3034-3056
Domain location description
Additional literature: [22 ], [23 ], [24 ], [25 ], [26 ], [28 ], [33 ], [35 ], [38 ].
A 5.3 PALB2
Additional literature: [35 ], [38 ], [40 ], [41 ], [42 ], [43 ], [44 ], [45 ], [46 ].
[Table 7 ].
Table 7 PALB2 , functional domains and relevance for the interpretation of the clinical importance of sequence variants – catalogue of clinically relevant functional domains and amino acids.
Region
AA start
AA end
AA alterations with potential clinical importance
References and summary interpretation
BRCA1 interaction domain
9
43
Yes, e.g.
p.(Leu35Pro)
Also covers oligomerisation domain/covers coiled-coiled motif
Domain location description [47 ], [48 ], [49 ];
Amino acid alteration in VUS and functional analysis [50 ].
DNA-binding site
1
200
None reported
Domain location description [51 ]
RAD51 binding site
101
184
None reported
Domain location description [51 ], [52 ]
DNA-binding site
372
561
None reported
Covers also chromatin association motif (ChAM, 395-446)
Domain location description [51 ], [53 ]
MRG15 (MORF4L1) interaction domain
611
764
None reported
Domain location description [48 ]
WD40 repeat
853
1186
Yes, e.g.
p.(Thr1030Ile)
p.(Leu1143Pro)
BRCA2 (1019-1098), RAD51C, XRCC3 and/or RAD51 complex formation
Domain location description [51 ], [52 ], [54 ], [55 ], [56 ], [57 ].
Amino acid alterations [48 ], [54 ], [58 ]
A 5.4 CHEK2
In exons 11 – 15, highly homologous, functionally inactive sequences (pseudogenes) on various other chromosomes (2, 7, 10, 13, 15, 16, X, and Y) [59 ], [60 ] which can superimpose the relevant sequences > long-range PCR of exons 11 – 15 and bioinformatic filtering of pseudogene reads, where possible.
Functional domains: SQ/TQ-rich domain*, forkhead-associated (FHA)** domain, kinase domain***, nuclear localisation signal (NLS) [61 ], [62 ], [63 ].
Table 8 CHEK2 , functional domains and relevance for the interpretation of the clinical importance of sequence variants – catalogue of clinically relevant functional domains and amino acids.
Region
AA start
AA end
AA alterations with Potential Clinical Importance
References and summary interpretation
* Q/TQ consensus sites are sites phosphorylated by ATM/ATR [116 ]. e.g. phosphorylation of Thr-68 is important for CHEK2 activation and oligomerisation.
** The phosphorylated Thr-68 site of CHEK2 interacts with the FHA domain of another CHEK2 molecule and thus leads to the formation of CHEK2 oligomers [66 ].
*** [67 ]
§ The CHEK2 c.470T>C p.Ile157Thr variant, although still present with various classifications (VUS/likely pathogenic/pathogenic) in ClinVar and other databases has been reclassified (date: 28.08.2018) as Class2/likely benign by the German Consortium on HBOC on the following basis: It is frequently listed in large unaffected control cohorts (0.5%, gnomAD V. 2.1.1, non-cancer). The population frequency in Finnish Europeans is 2.5% (10 homozygous carriers). In addition, it is present in 47/7325 individuals (0.64%) in the FLOSSIES database (non-cancer female controls of European descent aged > 70 years). Although showing functionally impaired dimerisation and autophosphorylation [61 ], [62 ], [68 ], numerous large case-control studies show results indicating low or no increased breast cancer risk [64 ], [69 ]: breast cancer OR = 1.58 (1.42 – 1.75), colon cancer OR = 1.67 (1.24 – 2.26) [70 ], [71 ]. Additionally, it has been observed with a frequency of 2% in controls [2 ]. However, it may act as a polygenic risk allele.
SQ/TQ-rich
19
69
e.g. c.85C>T,p.Gln29*
[62 ], [65 ]
FHA
92 [115]
205 [175]
p.Arg117Gly, p.Arg145Trp, p.Gly167Arg
[61 ], [62 ], [63 ]
Kinase
212
501
c.1040A>C, p.Asp347Ala #; c.1100del; c.1164dup; p.Thr476Met, c.1169A>C, p.Tyr390Ser; c.1183G>T, p.Val395Phe#; c.1283C>T, p.Ser428Phe; c.1427C>T, p.Thr476Met
[61 ], [62 ], #ClinVar
NLS
515
538
e.g. c.1547delC, p.Ser516Leufs#; c.1555C>T, p.Arg519Ter#;
[63 ], #ClinVar
Numerous known missense variants in the FHA domain. Caution: Consult FLOSSIES database during evaluation! (e.g. c.470C>T; p.Ile157Thr: Class2 [see § footnote [Table 8 ]) or with unclear clinical relevance (e.g. c.434G>A, p.Arg145Gln; c.422A>C, p.Lys141Thr).
Missense variants in the kinase domain with unclear clinical relevance: e.g. c.1216C>T, p.Arg406Cys.
Similarly, in the NLS domain, only truncating variants have been classified as pathogenic to date ([Table 8 ]).
The investigation by Ow et al. [63 ] gives an overview of the identified mutations in the CHEK2 gene region. The publication by Roeb et al. (2012) includes a schematic overview of the functional domains as well as the results of functional analyses of the missense mutations located in these different CHEK2 domains [62 ].
A 5.5 TP53
IARC TP53 database; the functional analyses by Kato et al. (2003) and Monti et al. (2007, 2011) are reliable [72 ], [73 ], [74 ], [75 ], [76 ], [77 ].
Functional domains: oligomerisation domain, core domain (DNA-binding).
Possible dominant negative effect of missense variants and stop variants which affect the oligomerisation domain.
Mosaic mutations and clonal haematopoiesis are possible, therefore watch out for the variant allele fraction when carrying out NGS analysis; if necessary, carry out further analysis to confirm germline mutations.
[Table 9 ].
Table 9 p53 , functional domains and relevance for the interpretation of the clinical importance of sequence variants – catalogue of clinically relevant functional domains and amino acids.
Region
AA start
AA end
AA alterations with demonstrated clinical importance (including conflicting interpretations of pathogenicity but criteria provided)
References and summary interpretation
Transcription activation
1
55
p.(Val10Ile)
p.(Val31Ile)
p.(Pro47Ser)
Amino acid alterations (ClinVar)
Domain location description [78 ]
Also binding site for numerous proteins including HDM2 (amino acids 15-29; IARC)
Proline-rich domain
61
94
p.(Pro82Leu)
p.(Ala83Val)
Amino acid alterations (ClinVar)
Domain location description [78 ]
DNA-binding region
102
292
p.(Gly105Asp)
p.(Lys120Glu)
p.(Thr125Met)
p.(Ser127Phe)
p.(Asn131Tyr)
p.(Cys141Tyr)
p.(Pro151Ser)
p.(Pro151Thr)
p.(Pro152Leu)
p.(Arg156His)
p.(Arg158Cys)
p.(Arg158His)
p.(Tyr163Asp)
p.(Tyr163Cys)
p.(Arg175Leu)
p.(Arg175His)
p.(Cys176Tyr)
p.(His179Tyr)
p.(Arg181Cys)
p.(Arg181His)
p.(Ala189Val)
p.(His193Arg)
p.(His193Leu)
p.(Leu194Phe)
p.(Ile195Thr)
p.(Arg213Gln)
p.(Val.216Met)
p.(Tyr220Cys)
p.(Tyr220Ser)
p.(Ile232Thr)
p.(Tyr234Cys)
p.(Asn235Ser)
p.(Tyr236Asp)
p.(Met237Val)
p.(Met237Ile)
p.(Cys238Tyr)
p.(Ser241Phe)
p.(Cys242Tyr)
p.(Gly245Asp)
p.(Gly245Ser)
p.(Gly245Cys)
p.(Met246Val)
p.(Met246Leu)
p.(Met246Arg)
p.(Arg248Gln)
p.(Arg248Trp)
p.(Ile251Leu)
p.(Ile251Ser)
p.(Thr256Ala)
p.(Leu257Arg)
p.(Glu258Lys)
p.(Arg267Trp)
p.(Arg267Gln)
p.(Val272Leu)
p.(Arg273Hisv
p.(Arg273Cys)
p.(Cys275Tyr)
p.(Cys277Tyr)
p.(Arg280Thr)
p.(Asp281Val)
p.(Asp281Gly)
p.(Arg282Gly)
p.(Arg282Leu)
p.(Arg282Trp)
p.(Arg283His)
p.(Arg283Lys)
p.(Glu286Lys)
Amino acid alterations (ClinVar)
Domain location description (IARC)
Also binding site for numerous proteins including 53BP1 (IARC) and RAD51 amino acids 94-160 and 264-315 [79 ]
Oligomerisation region
325
356
p.(Gly325Val)
p.(Arg337Leu)
p.(Arg337Cys)
p.(Glu339Lys)
p.(Arg342Pro)
Amino acid alterations (ClinVar)
Domain location description (IARC)
Also binding site for numerous proteins
Covering main nuclear localisation signal (amino acids 316-322) [80 ], [81 ]
Basic (repression of DNA-binding region)
369
388
None reported
Domain location description (IARC)
Also binding site for numerous proteins including RAD54 [82 ]
A 5.6 RAD51D
Functional domains: N-terminal domains and ATP-binding domain with the highly conserved Walker A and B motifs [83 ], [84 ], [85 ].
[Table 10 ].
Table 10 RAD51D , functional domains and relevance for the interpretation of the clinical importance of sequence variants – catalogue of clinically relevant functional domains and amino acids.
Region
AA start
AA end
AA alterations with demonstrated clinical importance (including conflicting interpretations of pathogenicity)
References and summary interpretation
N-terminal region
1
83
None reported
N-terminal domain required for ssDNA-specific binding function [86 ]
Linker
60
78
None reported
Proper interaction with RAD51C and XRCC2 [85 ]
ATPase domain and RAD51B, RAD51C, and XRCC2 binding
99
274
p.(G112A) (disrupts binding of RAD51D to RAD51C [87 ])
p.(S207L) (disrupts RAD51D-XRCC2 interaction [85 ])
p.(A210V) (predicted to be potentially pathogenic [88 ], [89 ])
p.(R266C) [90 ], Meindl et al. (unpublished)
ATPase, AAA+ type
Walker A and B motifs crucial for HR. These motifs are also implicated in binding to RAD51C and XRCC2.
Additional literature: [91 ], [92 ], [93 ], [94 ]
A 5.7 RAD51C
References: [91 ], [95 ] – [101 ]
Functional domains: DNA repair/recombination protein RecA-like, ATP-binding domain
[Table 11 ].
Table 11 RAD51C , functional domains and relevance for the interpretation of the clinical importance of sequence variants – catalogue of clinically relevant functional domains and amino acids.
Region
AA start
AA end
AA alterations with demonstrated clinical importance (including conflicting interpretations of pathogenicity)
References and summary interpretation
N-terminal region
1
66
None reported
Homology-derived putative DNA-binding domain[91 ]
ATPase domain and RAD51B, XRCC3, and RAD51D binding
79
376
p.(Gly125Val)
p.(Cys135Tyr)
p.(Leu138Phe)
p.(Gly153Asp)
p.(Asp159Asn)
p.(Val169Ala)
p.(Leu219Ser)
p.(Arg258His)
p.(Gly264Ser)
Amino acid alterations and functional consequences [95 ], [96 ], [97 ], [98 ] Domain location description [91 ] ATPase domain includes Walker A nucleotide binding motif (amino acids 125-132) and Walker B nucleotide binding motif (amino acids 238-242) [91 ], [99 ]
Nuclear localisation signal
366
370
None reported
Domain location description [99 ]
A 5.8 BRIP1
[Table 12 ].
Table 12 BRIP1 , functional domains and relevance for the interpretation of the clinical importance of sequence variants – catalogue of clinically relevant functional domains and amino acids.
Region
AA start
AA end
AA alterations with demonstrated clinical importance (including conflicting interpretations of pathogenicity)
References and summary interpretation
DEAD/DEAH box helicase domain
17
441
Domain location description [102 ]
Helicase superfamily c-terminal domain
697
851
Domain location description [102 ]
The BRCA1 interacting region of BRIP1
976
1006
Phosphorylation of FANCJ at Ser-990 is important for its interaction with BRCA1
[103 ]
MLH1 interaction
Lysines 141 and 142 are required for direct interaction of FANCJ with MLH1
[104 ]
Nuclear localisation signal
158
175
None reported
Domain location description [102 ]
Additional literature: [1 ], [102 ], [105 ], [106 ], [107 ], [108 ], [109 ], [110 ]
A 5.9 CDH1
References:
Focuses predominantly on molecular genetics: [111 ]
Review of functional analyses: [112 ]
Review of the HDGC Consortium: [113 ]
Review of lobular breast cancer: [114 ]
Distribution of pathogenic variants at the CDH1 locus [111 ]
Known pathogenic CDH1 variants are distributed across the entire locus; it is therefore not possible to define a clinically relevant functional protein domain. The last known truncating pathogenic variant in the last exon is c.2506G>T (p.Glu836*) [115 ]. All truncating variants upstream must therefore be categorised at least as Class4.
The proposal put forwards by the ClinGen Consortium to categorise variants with a MAF > 0.2% as ACMG Class1, contrary to IARC guidelines, is currently being debated.
CDH1 Rule Specifications for the ACMG/AMP Variant Curation Guidelines ClinGen (https://www.clinicalgenome.org/site/assets/files/8816/clingen_cdh1_acmg_specifications_v1.pdf ).