Key words breast - decision analysis - biopsy - MR imaging
Introduction
Since its introduction in the 1980s, MR mammography or breast MRI has become established
as the most sensitive method for detecting breast cancer [1 ]
[2 ]
[3 ]
[4 ]. Even though the method was long overshadowed by X-ray mammography as an easy, cost-efficient,
and effective early detection method, it has become an indispensable part of most
areas of breast diagnostics. Recent studies have confirmed the method’s long suspected
added value in the early detection of breast cancer in women with dense breast tissue
[5 ]
[6 ]
[7 ]
[8 ]. X-ray mammography is not sufficiently sensitive in these women and the additional
benefit of tomosynthesis, ultrasound, and other methods is limited. In a prospective
randomized study, the use of breast MRI was able to lower the interval cancer rate
in women with very dense breast tissue to that of women with less dense breast tissue.
These results are extremely promising, and initial analyses have shown the method
to be cost-effective even though a long-term effect has, of course, not been able
to be shown yet [5 ]
[6 ]
[7 ].
In spite of significant international differences in the use of breast MRI, a survey
conducted by the EUSOBI (European Society of Breast Imaging) showed high acceptance
of the method and use beyond the recommendations and guidelines, which can no longer
be considered current [9 ]. The classic indications for breast MRI include the staging of breast cancer, particularly
in the case of premenopausal women, high breast density, invasive lobular phenotype,
and suspicion of multicentricity [4 ]
[10 ]. Traditionally, the indication of MRI as a problem-solving tool was not very specific.
Current evidence demonstrates that in case of positive or unclear findings on conventional
breast imaging, performing MRI can avoid unnecessary follow-up and biopsies thanks
to its high negative predictive value [11 ]
[12 ]
[13 ]
[14 ]. The method thus increases the specificity and positive predictive value in contrast
to earlier assumptions [11 ]
[12 ]. There is currently a high level of evidence for the use of MRI as an additional
screening method in women with an intermediate risk and the use of MRI to prevent
unnecessary biopsies [11 ]
[12 ]
[15 ]
[16 ]. This does not affect established indications such as the workup of implants or
clinical symptoms like pathological secretion without a conventional imaging correlate
or the follow-up of neoadjuvant therapies. It should be noted that the empirical evidence
for some of these indications is rather limited [3 ]
[4 ]. Consequently, a continuously increasing rate of breast MRI examinations has been
observed and often present a logistic and diagnostic challenge for clinics and departments.
In addition to a standardized examination technique that should include protocols
that are as short as possible but also multiparametric for quality as well as capacity
reasons, reporting of the examination must be highly structured. Structured reporting
has been a topic of interest for years and it has become an established part of most
radiology subspecialties. Its importance in breast MRI cannot be overstated. The BI-RADS
lexicon for X-ray mammography, breast ultrasound, and breast MRI findings was introduced
in radiology as the first structured reporting standard [17 ]. Numerous systems derived therefrom for other organs are in use today. The BI-RADS
lexicon continues to provide a tool for structured reporting that is used internationally.
In principle, it covers all aspects from the examination technique to the management
recommendation but does not include a clinical decision rule as has been standard,
for example, in prostate diagnosis and the PI-RADS system for years [18 ]
[19 ]
[20 ]. Without a clear clinical decision rule, diagnosis is subjective, dependent on experience,
and often difficult [21 ]
[22 ]
[23 ]. Breast MRI is therefore challenging to interpret. The Kaiser score solves this
problem. Using established BI-RADS criteria (Werner Alois Kaiser, after whom the score
is named, was not only an MRI pioneer but also played a major role in the creation
of the original MRI BI-RADS lexicon), a simple flowchart guides the interpreting physician
in two to three steps toward a risk category that can then be translated into an objective
diagnosis and management recommendation under consideration of symptoms [24 ]. The present article discusses the Kaiser score system and its application in clinical
practice.
Examination protocol
A strength of magnetic resonance imaging is its high soft-tissue contrast, which can
be varied by changing the sequence parameters or sequence types. The combination of
information from various contrasts/sequences is often referred to as multiparametric
[25 ]. At least T2-weighted and T1-weighted sequences before and after contrast administration
are interpreted in combination for breast MRI [26 ]. Functional pathophysiological information regarding contrast dynamics is supplemented
by morphological criteria and has high diagnostic significance in the synopsis. As
described in detail elsewhere, we recommend performing the examination in a prone
position using a dedicated multichannel surface coil (currently standard practice)
and an examination protocol in axial slice orientation on a 1.5 or 3 Tesla scanner
[26 ]
[27 ].
Contrast dynamics
The most important part of every breast MRI examination is the contrast dynamics.
While maintaining identical parameters, repetitive T1-weighted sequences are measured
before and after intravenous administration of MRI contrast agents. The first four
minutes after contrast administration are sufficient to determine the curve type [26 ]. Fat saturation is considered optional. We recommend only using this technique if
fat saturation can be consistently achieved with high quality on the available MRI
scanner. In our clinical practice, the Dixon method is used for this purpose. This
method is robust with respect to B0 inhomogeneities so that it also offers advantages
regarding motion artifacts. However, when using older scanners, the potential failure
of reconstruction or a fat-water mix-up due to phase shifts must be taken into consideration
[26 ]
[28 ].
The dynamics make it possible to reliably detect malignant findings. Hypervascularization
caused by hypoxia-induced angiogenesis is considered the pathophysiological basis
for this feature [29 ]. This process is essential for the development of breast cancer and starts in the
earliest tumor stage. Therefore, breast MRI is also capable of detecting ductal carcinoma
in situ (DCIS) with high sensitivity [30 ]
[31 ]. In comparison, non-enhancing cancers are rare. They are only seen in individual
case reports [32 ]. The radiological-pathological correlation in these cases typically indicates low-grade
tumors with a low proliferation rate. Therefore, the literature classifies such findings
as biologically insignificant [11 ]
[13 ]
[30 ]
[33 ]
[34 ]. Technical errors are usually the reason for false-negative breast MRI findings.
Typical causes like extravasation and incorrect contrast injection can be easily avoided
with careful quality management [26 ].
Benign breast lesions also regularly show pathological enhancement. Dynamics alone
are not capable of reliably differentiating benign from malignant findings [35 ]. Morphological criteria are therefore essential to ensure the high specificity of
breast MRI [24 ]
[26 ]. These include, for example, the margin and the presence of edema.
T2-weighted sequences
T2-weighted sequences without fat saturation are highly suited for evaluating morphological
criteria. If the time window allows, a short tau inversion recovery sequence (STIR)
can be additionally measured. Both methods are fluid-sensitive and show tissue containing
water with high signal intensity [36 ]. In addition to the differentiation between benign and malignant lesions, T2-weighted
sequences allow subtyping of the tissue, e. g., evaluation of tissue fibrosis. At
the same time, architectural distortions, post-therapeutic residues, cystic structures,
duct ectasia, and edema are effectively visualized [26 ].
Compared to STIR, the T2-weighted sequence without fat saturation has a higher spatial
resolution and a better signal-to-noise ratio. The T2 contrast should be high by selecting
a long echo time of approximately 170–200 ms. This sequence is unparalleled in the
visualization of architectural distortions, Cooper’s ligaments, mild duct ectasia,
cysts, and intracystic masses [26 ]
[36 ]. The STIR sequence also has advantages. Thanks to contrast defined by T1 and T2
weighting, it is particularly sensitive for fluid and is preferred by some colleagues
for lymph node analysis. As a result of the short inversion time, signal loss is often
seen in structures with a short T1 time such as oil cysts and duct ectasia with a
high foam cell content. Duct ectasia is typically associated with periductal mastitis
or non-puerperal mastitis.
Diffusion-weighted imaging (DWI)
The EUSOBI recommends the basic protocol for breast MRI in addition to a diffusion-weighted
sequence (DWI), which is already widely available internationally [9 ]
[37 ]. DWI visualizes the Brownian motion of water molecules in the extracellular space
and thus provides insight into the microstructure of the tissue [37 ]
[38 ]. Clinical DWI protocols should include only two b-values (0 or 50 and 800 s/mm2 ). Parametric apparent diffusion coefficient (ADC) maps are automatically created
from diffusion coefficients calculated on a voxel basis. In clinical practice a quantitative
ADC can thus be assigned to every pixel. Conclusions about the microstructure of the
examined tissue can be made on this basis like in the case of laboratory values. These
features of DWI explain the continued popularity of this technique [37 ]
[39 ].
Contrast-enhancing lesions on breast MRI
Contrast-enhancing lesions on breast MRI
The basis of every breast MRI diagnosis is the identification of contrast-enhancing
“findings”. These findings can also be referred to as lesions.
MRI BI-RADS designates any mass or non-mass enhancement in contrast dynamics that
cannot be assigned to background parenchymal enhancement as a lesion. The latter has
no disease value according to current knowledge.
If there is no contrast-enhancing lesion on breast MRI, the presence of a biologically
relevant cancer can be virtually ruled out. If a lesion is found, it must be further
categorized as mass or non-mass. Mass lesions are characterized by topographically
continuous and space-occupying properties. If the lesion is not space-occupying and
its growth is discontinuous, the lesion is a non-mass lesion.
In clinical practice, the definitive categorization of lesions as mass or non-mass
is not always possible. In addition, the specific diagnostic non-mass criteria of
MRI BI-RADS is insufficient for classifying lesions as benign or malignant. Therefore,
the KS does not include specific non-mass criteria.
Principle and application of the Kaiser score
Principle and application of the Kaiser score
The data on which the Kaiser score is based can be attributed to the MRI pioneer Werner
Alois Kaiser. He introduced contrast dynamics and the dedicated breast surface coil
and played a key role in the creation of the MRI BI-RADS lexicon with differentiation
of mass and non-mass lesions. In regular scientific audits, all MRI examinations at
his institute were evaluated based on standardized criteria that he had refined over
many years and were added to a continually growing database. Two of the authors of
this article participated in this work for years as his students (PATB, MD). The Kaiser
score that was derived from this database rightfully bears his name [26 ]. The protocol, the Kaiser score and examples are shown in [Fig. 1 ], [2 ], [3 ], [4 ].
The classification algorithm, which belongs to the family of classification trees,
tests all diagnostic criteria in the database and statistically identifies the one
with the greatest discriminatory power [24 ]. The thus homogenized database (probably benign and probably malignant findings)
is further homogenized by iterative hierarchical application of the remaining diagnostic
criteria. This segmentation of the database ends when no further improvement can be
statistically achieved. The Kaiser score corresponds to a classification tree with
three levels and five diagnostic criteria (see [Fig. 5 ]). On the basis of the lesion assessment based on the diagnostic criteria, the user
arrives at a final category ranked according to increasing probability of malignancy
[26 ]. The robustness of the original Kaiser score was ensured by the cross-validation
method. The following international validation studies were able to independently
demonstrate the value of the Kaiser score in various clinical scenarios and to define
limit values [18 ]
[39 ]
[40 ]
[41 ]
[42 ]
[43 ]
[44 ]. Interestingly, the diagnostic categories of the Kaiser score are not only probabilities
of malignancy but also represent specific imaging phenotypes of breast lesions. Therefore,
every category of the Kaiser score can be assigned to specific differential diagnoses
(see [Table 1 ] and [Fig. 2 ], [3 ], [6 ], [7 ]).
Table 1
Typical differential diagnoses of KS categories.
KS
Benign
Malignant
Comment
1
Fibroadenoma,
Fine granular enhancement of acinar breast tissue structures
Extremely unlikely
Definitely benign
2
Fibroadenoma
Papilloma (intraductal)
Extremely unlikely
Clinical management depends on symptoms and location (e. g., papilloma)
3
Regional hormonal or inflammatory enhancement
DCIS (possible)
Typical benign finding, usually non-mass; can correspond to DCIS in association with
suspicious microcalcification; type 2 curve usually missed upon visual inspection;
DCIS typically shows KS 5
4
Adenosis
Fibroadenoma (atypical)
Invasive cancer (unlikely, only very small lesions)
This category corresponds to the typical presentation of adenosis, typically multiple
and bilateral.
Caution: in the case of a very small size and insufficient image quality, a KS of
8 can be misinterpreted as a KS of 4.
Always perform a second critical evaluation when assigning a KS of 4 in patients with
BRCA-1 mutation (observe default category).
5
Benign proliferative changes
Risk lesions: Complex sclerosing lesions, atypical ductal hyperplasia
DCIS (typical)
Typical phenotype of DCIS; overlap with benign proliferative changes
6
Inflamed scar tissue (atypical)
Breast cancer recurrence in scar tissue, invasive carcinoma under neoadjuvant therapy
Rarest KS phenotype
7
Unlikely
Invasive cancer
Classic breast cancer, typically hormone receptor-positive
8
Less typical:
Adenosis
Papilloma
Fibroadenoma
Aggressive breast cancer, e. g., triple-negative cancer (typical), metastasis, lymphoma
Classic phenotype of highly aggressive breast cancer
Caution: Potenzial misinterpretation as KS 4 in the case of small size and/or suboptimal
image quality
9
Highly unlikely
Invasive cancer
Classic breast cancer, typically hormone receptor-positive
10
Highly unlikely
Invasive cancer
Classic breast cancer, typically her2/neu-positive; elevated risk of lymph node metastases
11
Highly unlikely
Invasive cancer
Fig. 1 Basic breast MRI examination protocol. The basic protocol is a full multiparametric
breast MRI exam. It starts with a T2-weighted sequence. Alternatively or in addition,
a Short-Tau-Inversion-Recovery sequence can be acquired (STIR). Dynamic contrast-enhanced
scanning starts with a precontrast T1-weighted measurement, followed by IV contrast
injection and an injection delay of 30 seconds. Subsequently, 4 identical T1-weighted
measurements are acquired. DWI is a recommended optional addition. TSE: Turbo-Spin
Echo/Fast Spin Echo; EPI: Echo Planar Imaging, DWI: Diffusion Weighted Imaging, GRE:
Gradient echo, CM: contrast medium, here gadolinium-based macrocyclic contrast media.
Fig. 2 The Kaiser score integrates the 4 BI-RADS criteria margins, curve type, internal
enhancement and edema into a flowchart. The flowchart asks the characteristics of
the respecvtive diagnostic criterion (e.g. “circumscribed” vs “irregular” margins).
If the diagnostic criterion does not show definite characteristics, the basic category
is chosen. The decisions guide through the flowchart to a specific score. These scores
range between 1 and 11 and reflect increasing probabilities of malignancy. The traffic
light colouring serves as an interpretation guidance (details given in text and tab.
2): green: not suspicious, biopsy primarily not recommended (BI-RADS 2/3). Yellow:
suspicious, biopsy recommended (BI-RADS 4a, 4b). Red: highly suspicious, biopsy necessary.
IF biopsy returns benign results, congruence between biopsy results and imaging has
to be carefully reviewed (BI-RADS 4c/5).
A free and interactive online application can be accessed at https://radiologie-weiterbildung.de/kaiser-score/ .
Fig. 3 Premenopausal patient with new focal lesion. The STIR sequence A reveals an isointense mass lesion without edema (dashed circle: area surrounding
the lesion). The lesion exhibits fast initial enhancement in the early dynamic series
(B , not fat-saturated). The white arrows mark three faint and root-like extensions of
the lesion. This finding is consistent with “spiculation”. Compared to the early dynamic
phase, the last dynamic phase C reveals a signal decline, consistent with “wash-out”. The internal enhancement is
“not homogeneous”. According to [Fig. 2 ], a KS of 9 is assigned. Percutaneous core biopsy revealed invasive ductal carcinoma
G2, hormone receptor-positive. Note: The KS is robust and compensates for observer-related
bias. In this example – for instance – the user may not acknowledge the presence of
“spiculations”. In this reading, a KS of 8 will be assigned, which is also to be considered
as “suspicious” (BI-RADS IVc/V).
Fig. 4 Perimenopausal patient with newly detected architectural distortion on mammography.
On sonography, multiple indeterminate lesions are evident. The non-fat-saturated T2-weighted
sequence reveals a heterogeneous, isointense mass lesion. There is evidence of architectural
distortion (arrows in A ) and perifocal edema (arrowhead in A ). Subtraction images of the early dynamic series B reveal mass lesions with strong enhancement. There is evidence of several subtle
spiculations. Both architectural distortion and spiculations are better delineated
in the late dynamic sequence (arrows in C ). Here, peripherally accentuated heterogeneous washout is also evident. Findings
are consistent with a KS of 11. Percutaneous core biopsy revealed invasive ductal
carcinoma G3, hormone receptor-positive. Two ipsilateral lymph node metastases were
confirmed histologically. This example highlights the robustness of the KS to observer-related
bias. If the reader is not sure whether spiculations are definitely present, the default
category “not spiculated” must be selected. In this case a KS of 8 will be assigned.
This will not change the overall KS assessment, as both KS 11 and 8 have to be considered
as suspicious (BI-RAVS IVc/V).
Fig. 5 Premenopausal patient with new onset of bloody secretion. The T2-weighted sequence
reveals presence of a retroareolar ductasia (A : T2 TSE, echo time 192 ms) with subsequent isointense intraductal lesion (arrow).
In the early dynamics (B), there is evidence of strong and homogeneous enhancement
with subsequent plateau (C). Findings translate to a KS of 2. Caveat: There is evidence
of delayed enhancement in the surrounding tissue. This pattern may mimic a “persistent
signal increase”. However, in equivocal cases, the default category applies, which
is “plateau” in this case. The combination of clinical presentation, intraductal location,
and generally benign breast MRI phenotype is highly suggestive of a benign papilloma.
Management was invasive and the lesion was completely resected. Histology revealed
an intraductal papilloma without atypia.
Fig. 6 Perimenopausal patient with new palpable lesion on the left breast. Breast MRI was
indicated for biopsy planning due to sonographically equivocal bilateral lesions.
The T2w-TSE sequence (A , TE 192 ms) reveals a circumscribed focal lesion (arrow). Slow initial enhancement
B is followed by a persistent signal increase in the late dynamic phase C . No spiculation. No edema. Finding translates to a KS of 1, corresponding to a benign
breast MRI phenotype (BI-RADS II). Without guidance of a structured assessment tool,
the bizarre internal structure may appear suspicious at first glance. The KS assessment
as clearly benign is supported by additional diagnostic criteria. In this case a cystic
compartment within the lesion is evident (arrowhead). This pattern is typical of benign
findings. According to the patient’s preference, a percutaneous core biopsy was performed.
Histologic workup revealed a regressively altered fibroadenoma B2 consistent with
the breast MRI phenotype. Upon the patient’s request, the lesion was subsequently
surgically removed.
Fig. 7 Postmenopausal patient with faint pleomorphic microcalcifications in two small clusters
within the left inner quadrants. The STIR sequence A reveals no abnormalities. After gadolinium administration, there is evidence of a
non-homogeneous regional non-mass. The non-mass reveals multiple grouped and ring-like
enhancements in the initial dynamic phase (B : subtraction image). In the late phase, signal intensity remains unchanged (plateau).
The lesion is not circumscribed. The finding translates to a KS of 5, corresponding
to a breast MRI phenotype of a malignant lesion, most likely DCIS. Histological verification
by vacuum-assisted biopsy revealed DCIS G3.
Practical application is based on the classification tree in [Fig. 2 ]. Alternatively, an interactive Web application is also available [45 ].
What differentiates the Kaiser score from other algorithms?
What differentiates the Kaiser score from other algorithms?
The Kaiser score is the only evidence-based, generally applicable, and thoroughly
independently validated decision rule for breast MRI. Other classification algorithms
are not based on representative samples, cannot be applied to all findings with strong
contrast, or have an insufficient level of accuracy. No other algorithm has been validated
on a sufficiently independent basis. A detailed description of other studies exceeds
the scope of this article. Therefore, we only reference the primary literature [45 ]
[46 ]
[47 ]
[48 ]
[49 ]
[50 ]. We would like to discuss an important aspect of clinical decision rules, particularly
in breast diagnosis. The accuracy of diagnostic tests is based on the number of correctly
classified cases that can be assessed by various methods. Even if such values (e. g.,
90 % accuracy) allow a satisfactory first assessment of the quality of a test, the
practical value is low. “Will I make the correct decision for patient × if I rely
on the algorithm” is a more accurate description of the concrete clinical problem.
For this purpose, an algorithm must have limit values at which the presence of a disease
is highly likely or unlikely. In breast cancer diagnosis, reliable tumor exclusion
is of primary relevance as a result of treatment being based on immunohistochemical
tumor classification. Such a “rule-out” criterion that rules out a malignant tumor
with high reliability has been validated multiple times in various settings for the
Kaiser score: a biopsy can be avoided at a Kaiser score < 5 [18 ]
[26 ]
[41 ]
[42 ]. The Kaiser score can be seamlessly integrated into the clinical context and can
be easily combined with clinical results and algorithms using Bayesian principles.
However, a recent multicenter study was not able to prove added value of the integration
of quantitative ADC values in the Kaiser score [39 ].
Level of evidence
The Kaiser score is the only evidence-based, generally applicable, and independently
validated decision rule for breast MRI.
Nonetheless, the literature includes alternative classification algorithms. However,
these cannot be used for broad clinical application: Typical limitations are samples
that are not representative, a lack of generalizability (e. g., only applicable for
masses), and insufficient accuracy. Most notably, none of these algorithms has been
independently validated which is a basic requirement for evidence-based clinical use.
A detailed overview of alternative classification algorithms is not the goal of this
article. Therefore, we only reference the primary literature [46 ]
[47 ]
[48 ]
[49 ]
[50 ]
[51 ].
Diagnostic criteria of the Kaiser score
Diagnostic criteria of the Kaiser score
The diagnostic criteria on which the Kaiser score is based correspond to those of
the BI-RADS lexicon [19 ]
[24 ]
[26 ]. A recent comprehensive German-language presentation of the BI-RADS lexicon [52 ] and a structured analysis of breast MRI [53 ] are included in the literature. Based on our experience using the Kaiser score for
many years and on numerous interdisciplinary discussions and discussions with colleagues
over the last 20 years, the criteria for findings must be clearly defined. However,
semantic criteria are always subject to a certain level of subjectivity, for example,
the effect of the subjective first impression on the objective description (“looks
like cancer...”). Consequently, structured description based on the BI-RADS lexicon
is considered difficult to reproduce [18 ]
[21 ]. In contrast, the Kaiser score is robust with respect to the subjective interpretation
of individual criteria. However, some basic definitions are needed.
Default category
This includes the term default category as defined in this article for the first time.
A diagnostic criterion can be “present” or “not present”. To minimize subjectivity,
a criterion should only be evaluated as positive if there is no doubt. If, for example,
the presence of spiculation is unclear, it should always be assessed as absent. Only
a critical evaluation can minimize the subjective effect of the first impression on
the examiner. Particularly with respect to breast imaging, there is always the fear
of overlooking lesions, resulting in a low threshold for assigning high BI-RADS categories
and thus in a low biopsy threshold [51 ]. Especially with regard to value-based health care, we define a default category
for every diagnostic criterion that is automatically applied in the case of unclear
findings or contradictory images.
Margin
The BI-RADS lexicon differentiates between a “circumscribed” and “non-circumscribed”
margin. Spiculation is considered a special type of “non-circumscribed” margin [19 ]. The machine learning algorithm on which the KS is based identified the independent
significance of “spiculation” [24 ]. Therefore, this feature is discussed separately in the following.
A lesion is considered circumscribed when it can be easily delimited on all sides
from the surrounding tissue. There are no zones in which the lesion infiltrates the
perifocal tissue. The margin can be seen in both the early dynamic phase and the T2-weighted
sequence. Partial volume effects can affect the interpretation of this feature. The
default category for the margin is “not circumscribed”.
Spiculation
In X-ray mammography, spiculation is considered a highly specific malignancy criterion.
On breast MRI, spiculations have a concave lateral margin and a pointed tip and protrude
in a “root-like” manner from contrast-enhancing lesions [40 ]. A singular spicule can be present and is referred to as the “root sign” by Werner
Kaiser. The classic finding with multiple spicules is thus a variant of this criterion
[40 ]. [Fig. 2 ], [3 ] show examples of spiculations.
The default category is “absent”. Spiculation is thus only categorized as present
in textbook cases.
Spiculations can be more evident on T2-weighted sequences than in the dynamic phase.
Therefore, this criterion can be evaluated on T2-weighted sequences and in the dynamic
phase. Spiculations can sometimes be confused with minor motion artifacts in the dynamic
phase. If there is suspicion of spiculations in the subtraction series, we always
verify the finding on the original images.
Practical tip: It is important not to combine the margin and spiculations with one
another. In fact, both criteria can be present independently of one another: Accordingly,
a spiculated lesion can have a “circumscribed” margin. Conversely, there can be “non-circumscribed”
and “non-spiculated” lesions.
Curve type
The BI-RADS lexicon defines three curve types: The spectrum ranges from persistent
signal increase between the early and late post-contrast image (type I curve) to plateau
(unchanged signal intensity: type II curve) and washout (type III curve). The latter
is characterized by signal loss between the late and early post-contrast image.
While a washout curve is suspicious for malignancy, a persistent signal increase tends
to indicate a benign lesion. The plateau type of enhancement is nonspecific and should
be considered suspicious in case of doubt. However, the curve type alone does not
have a sufficient level of diagnostic significance. An invasive carcinoma can show
persistent enhancement and benign adenosis is often displayed with washout [35 ]. It is the combination of the KS criteria that allows a reliable diagnosis.
The default category for the evaluation of curve type is “plateau”. Another curve
type should only be diagnosed in the case of clear signal change over time.
We prefer visual assessment of the curve type. This requires a standardized viewing
layout (hanging protocol). Alternatively, classic curve measurement using a region
of interest (ROI) can be performed. However, this requires the subjective identification
of the region with maximum contrast enhancement in the lesion [35 ]. This type of curve measurement is therefore time-intensive and prone to errors.
Therefore, we do not recommend this measurement. Alternatively, software tools with
color-coded overlays of the curve types can be used. These tools can support the diagnosis
of non-mass lesions [52 ]. However, even minor motion artifacts can be confused with suspicious curve types.
For this reason, we always verify the findings with the original images.
Practical tip: Exact identification of the instant of early enhancement is essential.
If the wait time is too short after intravenous contrast administration or in the
case of a delayed circulation time, the first measurement will be performed too early.
In this case, we use the second measurement after gadolinium administration as a reference
for early contrast enhancement. If this approach is not used, a lesion can be overlooked
in the worst case or the curve may appear less suspicious [53 ].
Internal enhancement
Internal enhancement can be “homogeneous” or “not homogeneous”. The default category
is “not homogeneous”.
Homogeneous lesions have either no or a structured internal architecture. Septa and
compartments are typical examples and are characteristic for these usually benign
findings. “Centrifugal” and “central” are subtypes of homogeneous internal enhancement
[26 ]. Such lesions are to be identified on non-contrast images. In the early dynamic
phase, only enhancement of a central compartment is seen. In the case of the “centrifugal”
phenotype, there is also enhancement of the lesion periphery over time. This does
not occur in the case of “central” internal enhancement. Histology usually shows fibroadenomas
with regressive changes.
All other lesions are considered “not homogeneous”. A structured internal architecture
is not present here. An unstructured enhancement pattern typical for cancer without
clear internal structures is typically seen. “Centripetal internal enhancement” is
a subtype. This is pathognomonic for a malignant lesion. Central fibrotic or necrotic
carcinomas are typically seen in the histological correlation. On breast MRI, only
enhancement of the periphery is initially seen. In contrast, enhancement of the center
of the lesion occurs only with a delay and usually incompletely.
Practical tip: Ring-like enhancement is often primarily evaluated as typical of malignancy.
However, the differential diagnosis also includes benign findings like abscesses and
simple cysts. Homogeneous ring-like enhancement is the rule in these cases, while
malignant lesions with ring-like enhancement have a different morphological pattern.
Heterogeneous contrast enhancement of the periphery is seen. Irregular or even nodular
thickening is possible there.
It is challenging to analyze the internal enhancement of a non-mass. The analogy to
microcalcification diagnosis is helpful here. The morphology of scattered (discontinuous)
lesions is decisive also in these cases. Monomorphic calcification shows a uniform
(homogeneous) morphology. The opposite is true for polymorphic microcalcification
(not homogeneous). Analogously, the internal enhancement on breast MRI can be classified
as homogeneous (= monomorphic) and not homogeneous (= polymorphic).
A lesion with homogeneous enhancement and central washout is typical for benign adenosis.
However, caution is advised in the case of very small areas of enhancement, which
can imitate homogeneous enhancement due to the small size.
Edema
Ipsilateral edema is a highly specific criterion for malignancy. It indicates an aggressive
breast cancer phenotype. Ipsilateral edema is associated with higher grading, lymphangitic
carcinomatosis, and a poor prognosis. To avoid false-positive findings, the criterion
may not be used after recent biopsy, surgery, or radiotherapy. Bilateral edema indicates
a systemic etiology (renal, cardiac) [54 ]
[55 ]
[56 ].
The default category is “not present”. Ipsilateral edema can be further classified
as perifocal, prepectoral, diffuse, or subcutaneous.
Further diagnostic criteria: unimportant?
Experienced diagnosticians know a number of additional criteria for the interpretation
of breast MRI (shown with maximum detail in [57 ]). The lack of inclusion in the Kaiser score does not mean that they cannot provide
additional diagnostic value. The Kaiser score is based on a machine learning algorithm
that categorizes semantic lesion features provided by radiologists according to various
imaging phenotypes [24 ]. In general, under such conditions, the greater the database, the more subcategories
can be defined. The Kaiser score can be used for any enhancement of the breast [18 ]
[26 ]. The classification algorithm of the Kaiser score lumps all enhancement together.
This has advantages and disadvantages: the Kaiser score is simple, easy to use, and
diagnostically accurate. However, there are special cases in which the clinical context
modifies the diagnosis (as discussed in the corresponding section of this article).
One example is intraductal enhancement. The presence of T1 hyperintense duct ectasia
argues strongly against a malignant origin and for a stasis of secretions with possible
periductal inflammation, while T2w intraductal fluid with high signal intensity in
association with a contrast-enhancing focal lesion is virtually pathognomonic for
the diagnosis of a papilloma (see [Fig. 4 ]). The Kaiser score reflects a probability of malignancy but can be combined under
consideration of additional criteria and clinical information for more specific diagnoses
(see [Table 2 ]). However, the results of the Kaiser score are sufficient for a general differentiation
between benign and malignant contrast-enhancing lesions.
Table 2
Management recommendation depending on KS and clinical situation.
Clinical situation
KS
BI-RADS and clinical management
Asymptomatic patient with low risk without specific conventional correlate
1–4
BI-RADS 2; follow-up can be considered
Any patient with a new lesion or without prior images
5–11
BI-RADS 4 (KS 5–7) or 5 (KS > 7); percutaneous biopsy needed
Asymptomatic risk patient without specific conventional correlate
3–4
BI-RADS 3; 6-month follow-up recommended. Low biopsy threshold in new KS 4 lesion
in patients with BRCA-1 mutation
Symptomatic patient with new finding on palpation
1–4
BI-RADS 3; 6-month follow-up recommended, possibly also biopsy
Symptomatic patient with ipsilateral nipple discharge
2, 4
BI-RADS 4a; when intraductal or retroareolar biopsy with suspected diagnosis of papilloma
recommended
Corresponding mammographic microcalcifications BI-RADS 4
3
BI-RADS 4; biopsy recommended, also follow-up with X-ray mammography or MRI possible
(maximum DCIS possible)
Known lesion with long-term stability documented on imaging
8
BI-RADS 2 (when stable ≥ 2 years); otherwise, BI-RADS 3 and follow-up
Post-therapeutic scar tissue < 5 years after intervention
5–7
BI-RADS 3; 6-month follow-up recommended. Biopsy can be considered but if the PPV
is low, progression in the short term is not likely.
Management recommendation based on the Kaiser score
Management recommendation based on the Kaiser score
The KS is not a fully automatic algorithm. Its use supports the interpreting physician
in that it translates the physician’s structured reporting of criteria into a probability
of malignancy in an evidence-based manner. From the KS result, the examiner can derive
a concrete clinical management recommendation that also includes the individual senological
situation of the patient. The combination of the clinical picture (symptoms, palpation
findings), conventional imaging (new lesion), and hormone status allows accurate personalized
diagnosis. For example, a new palpable lesion is considered suspicious for malignancy
in postmenopausal patients until proven otherwise, while the situation is more complex
in premenopausal women with the same constellation of findings. In the following we
discuss some typical scenarios with more details being provided in [Table 2 ].
For example, a lesion with a KS of 2 is to be evaluated as definitely benign ([Fig. 2 ]). However, the finding can still require invasive management. This is the case,
for example, in a symptomatic intraductal papilloma (see [Fig. 5 ]).
In high-risk situations, false-negative findings should be avoided. Caution is advised
when assigning a KS of 4 in high-risk situations. Insufficient image quality and/or
a small lesion size can result in misinterpretation of KS 8 findings as KS 4. It is
important to systematically apply the default category, thereby resulting in a KS
of 8 in case of doubt.
The long-term clinical course also affects the management recommendation. Large adenosis
or fibroadenomas are sometimes classified as KS 8. This corresponds to the KS of a
suspicious finding. If these findings remain unchanged for years, a biopsy is not
recommended.
The management recommendation after determination of the KS in breast MRI must also
take current diagnostic imaging into account. A lesion with a KS of 3 is to be evaluated
as benign. However, if suspicious pleomorphic microcalcifications are present on X-ray
mammography or tomosynthesis, this finding must be included in the management recommendation.
In this case, a lesion with unclear malignant potential (B3) must be suspected. We,
therefore, recommend invasive management.
In the case of a contrast-enhancing lesion within a cyst, the finding is “complex
cyst”. The differential diagnosis includes the entire family of papillary lesions
and ranges from papilloma (without atypia) to papillary DCIS and invasive carcinoma.
Thus, histological workup is indicated in every case regardless of the KS and invasive
management is necessary (see [Fig. 8 ]).
Fig. 8 Perimenopausal patient with new palpable lesion. On breast MRI a “complex cyst” is
diagnosed corresponding to a contrast-enhancing lesion within a cyst (arrow in A,
TSE sequence, TE 192 ms). The internal enhancement is heterogeneous. The curve type
is indistinct, so the basic category “plateau” applies. The margin is circumscribed.
The finding translates to a KS of 2. Although the KS is strongly suggestive of a benign
finding, management of the lesion has to be invasive. The finding was biopsied and
surgically removed. An intracystic papilloma without atypia was diagnosed upon histological
verification.
Summary and future developments
Summary and future developments
The KS is an easy-to-use, evidence-based decision rule for breast MRI. Based on its
structure, the KS supports an objective description, structured documentation, and
exact diagnosis of contrast-enhancing lesions on breast MRI. Under consideration of
clinically relevant information, it can support rational patient management. The KS
defines specific imaging phenotypes derived from the combination of four BI-RADS criteria.
The KS greatly simplifies and shortens the documentation of findings, which has a
positive effect on the clinical workflow.
Establishing the KS in the clinical routine is the most important task for the near
future. Integration in the BI-RADS lexicon as in the case of PI-RADS is the next step. A
clinical decision rule has been integrated in PI-RADS since version 2.0, a development
we would also like to see for the BI-RADS lexicon.