Key words
differentiated thyroid carcinoma - whole-body scan - radioiodine - prognosis - false-positive
Introduction
Differentiated thyroid carcinoma (DTC) comprising papillary (PTC) and follicular
carcinoma (FTC) accounts for the majority of all thyroid malignancies [1]. The American Thyroid Association (ATA)
Management Guidelines for Adult Patients with DTC [2] recommends initial treatment with surgery and, adjuvant therapies are
defined according to risk evaluation as well as the frequency and intensity of
follow-up. Radioiodine (RAI) is recommended as adjuvant DTC therapy for remnant
thyroid ablation or therapy for persistent and/or metastatic disease [2]
[3]
[4].
Post-treatment Whole-Body Scans (ptWBS) are a mandatory test within the context of
patients receiving RAI. The goals of the ptWBS are to identify iodine avidity for
structural disease, and to refine disease staging by identifying patients with
previously unknown distant metastasis [2].
Prior studies have suggested that about 8% of DTC cases are reclassified via
ptWBS exams as having distant metastasis [5].
Of note, ptWBS seems to be more sensitive to detecting new lesions versus diagnostic
Whole-Body Scans (DxWBS) [6]. Data on
sensitivity and specificity of the ptWBS exams are scarce and estimated at 65 and
55%, respectively, with a diagnostic accuracy for distant metastasis of
63% [7].
ptWBS is subject to diagnostic and interpretation problems like other exams [8]
[9]
[10]
[11]
[12]
[13] but is especially plagued by a high
false-positive rate. In low-risk DTC populations, the limited detection of distant
metastasis restricts its diagnostic utility [14]
[15]
[16]. Remarkably, Garger (2016) reported
73% of false-positive distant uptakes [17]. False-positive ptWBS is a potential source of unintended
psychological stress and wasteful financial health care expenditures.
Any area of the body that concentrates RAI may potentially be interpreted as a region
of thyroid cancer metastasis but functional expression of sodium/iodine
symporter (NIS) in non-thyroid tissues, iodine retention in body fluids
(i. e., cysts, bronchiectasis), technical contamination, infections, and
inflammatory processes may exhibit the same pattern of uptake and can cause
false-positive findings on ptWBS [13]
[14]
[15]
[17]
[18].
The ptWBS also can yield false-negative results. A negative ptWBS may misleadingly
suggest the absence of metastatic DTC. In those patients with documented distant
metastasis, the negative ptWBS uptake could indicate that an incorrect preparation
for RAI administration occurred (inadequate TSH levels, iodine contamination due to
the absence of low iodine diet, etc.) or that the tumor underwent a
dedifferentiation process [19]
[20]
[21].
The current DTC risk assessment strategies have incorporated clinical and
pathological criteria, along with ultrasensitive thyroglobulin (Tg), a marker with
high sensitivity and specificity for detection of persistent DTC [22]
[23]. In this context, there is uncertainty
regarding the contemporaneous diagnostic usefulness of ptWBS as an adjunct tool for
DTC staging.
Thus, the aim of this study was to evaluate the diagnostic value of ptWBS in a cohort
of DTC patients in a tertiary, university-based hospital.
Patients and Methods
Patients and study design
The subjects were from a cohort of DTC patients from the Thyroid Outpatient
Clinic of the Thyroid Unit, Endocrine Division of the Hospital de
Clínicas de Porto Alegre (HCPA), a tertiary care, university teaching
hospital in southern Brazil. We enrolled all consecutive patients from 2009 to
2015 with a histological diagnosis of DTC who received the first dose of RAI and
had a ptWBS image available. All subjects have given written informed
consent.
Ethical approval
The study was approved by our institution’s ethics committee (Ethics and
Research Committee of Hospital de Clinicas de Porto Alegre - CAAE
68434617.5.0000.5327/GPPG 17-0482). All methods were performed in
accordance with the relevant guidelines and regulations for research involving
human database.
Treatment protocol and follow-up
The DTC treatment protocol consisted of total thyroidectomy followed sometime by
administration of RAI as indicated as well as the use of suppressive
levothyroxine therapy according to current guidelines [24]
[25]. Decisions regarding cervical lymph
node dissection were at the discretion of the surgical team. The iodine
administration protocol used RAI activities prescribed at the attending
physician’s discretion. RAI was administered in a stimulated thyrotropin
(TSH) condition of endogenous hypothyroidism
(TSH>30 mUI/l) after withdrawing levothyroxine (at least
3–4 weeks) and assuming a low iodine diet for 2 weeks. Stimulated
postoperative thyroglobulin (sPOTg) was measured at this time. Serum levels of
anti-thyroglobulin antibody (TgAb) were measured in the same blood sample, and
patients with positive results were excluded from all analyses of sPOTg.
In the first evaluation, the following data were recorded for each patient:
demographics, tumor characteristics (e. g., histological features,
extension, and lymph node involvement) and treatment (e. g., surgery,
RAI, and other interventions). Each patient was classified using the
8th edition of the TNM/AJCC (TNM8) staging system (I, II,
III, or IV) [26]
[27]. Clinically negative N0 status was
determined by clinical examination of the neck or preoperative and postoperative
neck ultrasound (US) imaging. Alternatively, we used macroscopic examination
during surgery and pathological study of patients with lymph node resection.
The risk of persistent/recurrent disease was assessed based on the
proposed risk stratification system by the ATA 2009 risk guideline [28] with patients classified into three
risk groups: low (intrathyroidal tumors without any intermediate-high-risk
features), intermediate (tumors with microscopic invasion of perithyroidal
tissues or cervical lymph node metastasis or tumor with aggressive histology or
vascular invasion), and high (tumors with macroscopic tumor invasion, incomplete
tumor resection, distant metastasis, and thyroglobulinemia out of proportion to
what is seen on the ptWBS).
Post-treatment whole-body scan
Planar I-131 ptWBS was performed 7 to 10 days after RAI administration in both
anterior and posterior projections using a dual-detector gamma camera (Infinia,
GE Healthcare, Chicago, IL, USA) with high-energy, parallel-hole collimators.
Continuous acquisition mode was used at a table speed of
6 cm/min, 1024×256 matrix, and automated body contour
detection applied. The photopeak was centered at 364 keV with
a±10% window. Additional spot views were acquired at
5 min/view with a 256×256 matrix in the case of
unexpected iodine uptake and accumulations suspected to be contamination
(scanned after removing the contamination or the stained clothes).
Distant metastasis
All cases with distant uptake on ptWBS were reviewed by two independent
physicians (CFN and JMD). The gold standard for the presence of distant
metastasis was an evaluation that considered imaging tests (CT, bone
scintigraphy) at the ptWBS uptake sites. Those with distant uptake on ptWBS and
imaging tests confirming the presence of distant metastasis in that same
topography were classified as true-positives. If the distant uptake on ptWBS was
not confirmed by imaging exams, then it was classified as a false-positive
uptake.
Furthermore, all patients with a sPOTg>10 ng/ml were
screened for the presence of distant metastasis by imaging (neck US, CT,
and/or bone scintigraphy). Cases with no distant uptake on ptWBS but
with proven distant metastasis by other imaging tests were considered
false-negatives.
Laboratory analysis
Serum Tg measurements were conducted using immunoradiometric assays (from 2000 to
2002 via radioimmunoassay; 2002 to 2010 via electrochemiluminescence; and 2010
to the present via chemiluminescence). These assays had a functional sensitivity
of 0.2 ng/ml. The TgAb were measured using the passive
agglutination method from 2000 to 2010 and by chemiluminescence from 2010 until
the present. TSH levels were measured via a chemiluminescence assay from 2000 to
2006 (Immulite 2000 Siemens, Munich, Germany), electrochemiluminescence from
2006 to 2010 (Modular E Roche, Basel, Switzerland), chemiluminescence assay from
2010 to 2014 (Centaur XP Siemens, Munich, Germany), and electrochemiluminescence
from 2014 to the present (Cobas E602 ROCHE, Basel, Switzerland). The necessary
procedures for standardization and validation were performed after each new
assay had been implemented. These tests were all conducted in the HCPA central
laboratory.
Statistical analysis
The clinical and laboratory data are reported as the mean±standard
deviation (SD) values or as the median and percentiles 25 and 75
(P25–75) for continuous variables or as absolute numbers and percentages
for categorical variables. Comparative analyses of frequencies were performed
using the Pearson Chi-Square or Fisher’s Exact Test as appropriate.
These analyses were performed using the Statistical Package for Social Science
Professional software version 20.0 (IBM Corp., Armonk, NY, USA).
Sensitivity and specificity of ptWBS were calculated for the overall population
sample and specific subgroups. Agreement of ptWBS with the gold standard
(clinical and image classification) was accessed using Cohen’s Kappa
coefficient for two-level mutually exclusive categories using VassarStats.
Comparisons of predictive value for ptWBS diagnostic performance between
different DTC populations were calculated using the WinPepi version 11.65 with
the Weighted Generalized Score test of Kosinski.
Youden's index was calculated using sensitivity and specificity for
comparisons of the performance of the ptWBS between two different populations.
The Youden index is a measure of test performance and is calculated based on the
sensitivity and specificity of the test for a given population (Youden
index=sensitivity+specificity–1). Its value ranges from
0 through 1; zero means that the diagnostic test gives the same proportion of
positive results for groups with and without the disease (i. e., the
test is useless) and a value of 1 is when the test is considered perfect
(i. e., there are no false-positives or false-negatives) [29]. All tests were two-tailed, and a
p<0.05 was considered statistically significant.
Results
Clinical characteristics
We evaluated 268 patients with clinical and oncological features in [Table 1]. The mean age at diagnosis was
46±16 years; 220 (82%) were women, and PTC was diagnosed in 234
(87%) patients. The median tumor size was 2.7 cm (P25–75
1.3–3.5 cm). Of these, 107 (40%) had lymph node
metastasis, and 29 (11%) had distant metastasis. Staging indicated 212
(80%) as stage I, 41 (15%) as stage II, 2 (1%) as stage
III, and 10 (4%) as stage IV. All patients underwent total thyroidectomy
and received RAI therapy. The median administered RAI activity was
96±34 mCi.
Table 1 Characteristics of 268 patients with differentiated
thyroid carcinoma.
Age at diagnosis (years)
|
46±16
|
Female sex - n (%)
|
220 (82)
|
Histology - n (%)
|
|
Papillary
|
234 (87)
|
Follicular
|
34 (13)
|
Tumor size (cm)
|
2.7 (1.3–3.5)
|
Lymph node metastasis (N1) - n (%)
|
107 (40)
|
Distant metastasis - n (%)
|
29 (11)
|
TNM8 - n (%)
|
|
I
|
212 (80)
|
II
|
41 (15)
|
III
|
2 (1)
|
IV
|
10 (4)
|
ATA Risk - n (%)
|
|
Low
|
106 (40)
|
Intermediate
|
127 (48)
|
High
|
32 (12)
|
Radioactive iodine activity (mCi)
|
96±34
|
Data are expressed as the mean±SD, median (percentiles 2575) or
frequencies. TNM8: 8th edition of the tumor, node, metastasis
stage; ATA risk: American Thyroid Association risk.
Post-treatment Whole-Body Scan performance
Two-hundred and forty (90%) patients showed no uptake outside the thyroid
bed, and 28 (10%) patients displayed distant uptake on ptWBS. Of the 28
patients with distant uptake at ptWBS, distant disease was confirmed in 19
(68%, true-positives), and nine (32%) were false-positives. The
false-positive uptakes on ptWBS were as follows: six on the thorax, two on the
spine, and one on the pelvis. Further studies clarified the false-positive
uptakes and revealed two cases of osteoarthrosis (inflammatory), one renal cyst,
and one adrenal incidentaloma ([Table
2]). Of note, no cause could be defined for five false-positive ptWBS
uptakes (indeterminate etiology).
Table 2 False-positive patients on post-treatment whole-body
scan uptake.
Patient
|
TNM8
|
ATA risk
|
ptWBS Uptake
|
Confirmatory exam
|
Type of uptake
|
Etiology
|
sPOTg TgAb
|
Current status
|
Relapse site
|
Follow up (years)
|
1
|
I
|
Low
|
Thoracic spine
|
CT and bone scintigraphy
|
Focal
|
Spine surgery
|
97.2 NR
|
Biochemical incomplete
|
None
|
6.4
|
2
|
I
|
Intermediate
|
Thorax
|
CT
|
Focal
|
Adrenal incidentaloma
|
89.8 NR
|
Indeterminate
|
None
|
5.7
|
3
|
II
|
Intermediate
|
Thorax
|
CT
|
Diffuse
|
Indeterminate
|
34.2 NR
|
Structural incomplete
|
Cervical
|
1.9
|
4
|
NA
|
NA
|
Lumbar spine
|
CT and bone scintigraphy
|
Focal
|
Ostheoarthrosis
|
4.9 NR
|
Excellent
|
None
|
1.1
|
5
|
I
|
Intermediate
|
Thorax
|
CT
|
Focal and Diffuse
|
Ground glass pneumopathy
|
18.2 NR
|
Excellent
|
None
|
2.7
|
6
|
I
|
Low
|
Thorax
|
CT
|
Focal
|
Indeterminate
|
NA
|
Excellent
|
None
|
2.2
|
7
|
I
|
Intermediate
|
Thorax
|
CT
|
Diffuse
|
Indeterminate
|
26.0 NR
|
Indeterminate
|
None
|
3.9
|
8
|
I
|
Intermediate
|
Pelvis
|
CT and bone scintigraphy
|
Focal
|
Renal cyst
|
NA
|
Structural incomplete
|
Cervical
|
2.1
|
9
|
I
|
Intermediate
|
Thorax
|
CT
|
Diffuse
|
Indeterminate
|
0.1 NR
|
Excellent
|
None
|
4.8
|
TNM8: 8th edition of the tumor, node, metastasis stage; ATA
risk: American Thyroid Association risk; ptWBS Uptake: Post-treatment
whole-body scan uptake; sPOTg: Stimulated postoperative thyroglobulin;
TgAb: Anti-thyroglobulin antibody; CT: Computed tomography; NR:
Non-reactive; RA: Reactive; NA: Not available.
Of the 240 patients with no uptake outside the thyroid bed, nine (4%) had
distant metastasis and were considered false-negative on ptWBS. The patterns of
metastasis of the false-negative uptake included five patients with pulmonary,
two with bone, and two with multiple metastasis sites.
The overall sensitivity of ptWBS for detecting distant metastasis was 68%
with a specificity of 96%. The Youden index was 0.64 and Kappa agreement
with the gold standard of 0.66.
ptWBS Performance According to ATA Risk
We next explored the ptWBS performance according with the ATA risk. Of the 268
DTC patients, 106 (40%) were classified as ATA low-risk, 127
(48%) ATA intermediate-risk, and 32 (12%) ATA high-risk. Three
patients were excluded due to a lack of information for ATA risk
classification
In the low-risk group, only 3 patients (3%) showed distant ptWBS uptake
outside the thyroid bed. Of these, two were considered false-positives and one
was a true-positive (29 years-old, man, pT2 Nx Mx, with diffuse and focal ptWBS
lung uptake and thorax CT micronodules; who was classified as biochemical
incomplete response after three years of follow up). No false-negative ptWBS
occurred in this group. The sensitivity and specificity of ptWBS for distant
metastasis were 33 and 98%, respectively. The performance of ptWBS for
ATA low-risk patients had a Youden index 0.31 and a Kappa of 0.27.
The group classified as ATA intermediate-risk included 121 patients (95%)
with no ptWBS uptake outside thyroid bed, and 6 (5%) with distant
uptake. Of these 6, 5 were considered false-positives, and one was a
true-positive (25 years-old, woman, pT2 N1b Mx; with cervical, mediastinal and
lung (focal and diffuse) ptWBS uptake. Cervical and thorax CT were unremarkable,
stimulated Tg 66.6 ng/ml. After three years of follow up the
patients was classified as biochemical incomplete response). False-negative
ptWBS occurred in three (2%) intermediate-risk patients rendering a
ptWBS sensitivity of 25% and a specificity of 96% for this
subgroup. The performance of ptWBS included a Youden index of 0.21 and a Kappa
of 0.19 for ATA intermediate-risk patients.
Among the 32 ATA high-risk patients, 13 (40%) had distant metastasis (3
diagnosed before and 10 after total thyroidectomy but before ptWBS) and 19
(60%) had locally advanced disease. Of all high-risk patients, 14
(44%) showed no uptake outside thyroid bed on ptWBS, and 18
(56%) had distant uptake. All positive ptWBS were considered
true-positives. There were 4 (13%) false-negative exams in the negative
ptWBS uptakes. These figures resulted in a ptWBS sensitivity and specificity of
82 and 100%, respectively ([Table
3]/[Fig. 1]). The
performance of ptWBS for ATA high-risk patients showed a Youden’s index
of 0.82 and a Kappa of 0.74.
Fig. 1 Post-treatment Whole-Body Scan uptake, stratified by the
American Thyroid Association (ATA) risk.
Table 3 Performance of post-treatment whole-body scan for
diagnosis of distant metastasis between different groups of
differentiated thyroid carcinoma.
|
All (n=268)
|
American Thyroid Association Risk
(n=265)*
|
Stimulated Postoperative Thyroglobulin
(n=174)*
|
Low (n=106)
|
Intermediate (n=127)
|
High (n=32)
|
<1 ng/ml (n=28)
|
1–10 ng/ml (n=71)
|
>10 ng/ml (n=75)
|
Sensitivity
|
0.68
|
0.33
|
0.25
|
0.82
|
NA
|
0.67
|
0.69
|
Specificity
|
0.96
|
0.98
|
0.96
|
1.00
|
0.96
|
0.99
|
0.92
|
Youden's index
|
0.64
|
0.31
|
0.21
|
0.82
|
NA
|
0.66
|
0.61
|
Kappa
|
0.66
|
0.27
|
0.19
|
0.74
|
0
|
0.23
|
0.70
|
*p=0.535 for ATA low-risk vs. ATA
intermediate-risk; p=0.065 for ATA low-risk vs. ATA high-risk;
p≤0.0001 for ATA intermediate-risk vs. ATA high-risk;
p=0.95 for sPOTg 1–10 ng/ml vs
sPOTg>10 ng/ml; and p=0.69 for
sPOTg<10 ng/ml vs.
sPOTg>10 ng/ml.
The weighted generalized score was used to compare the diagnostic performance of
ptWBS among the different ATA risk DTC populations. For low-intermediate versus
high-risk, the Youden index was 0.26 versus 0.82 (p=0.003),
respectively. When we excluded ATA low-risk patients and compared ATA
intermediate-risk versus high-risk patients, the Youden index was 0.21 versus
0.82 (p≤0.0001), respectively.
ptWBS Performance according to sPOTg
As an additional analysis, we stratified the DTC cohort according to sPOTg into
three categories:<1.0 ng/ml, between
1–10 ng/ml, and>10 ng/ml. Here,
174 patients with negative TgAb values were included (94 were excluded due to a
positive TgAb result). Twenty-eight patients had a
sPOTg<1.0 ng/ml, 71 patients had a sPOTg
1–10 ng/ml, and 75 had a
sPOTg>10 ng/ml.
We compared the performance of ptWBS for patients with
sPOTg≤10.0 ng/ml versus
sPOTg>10 ng/ml: The sensitivity values were 67%
versus 69%, specificity was 98% versus 92%, and the
Youden index 0.65 versus 0.61 (p=0.69), respectively. Comparing only
patients with sPOTg 1–10 ng/ml versus
sPOTg>10 ng/ml the sensitivity was 67% versus
69%, specificity was 99% versus 92%, and the Youden
index was 0.66 versus 0.61 (p=0.95) ([Table 3]).
Discussion
Management of DTC patients is under continuous review and has changed substantially
in recent years. The ptWBS test has pitfalls that limit its performance to the
diagnosis of distant disease mainly due to false-positive uptakes in patients with a
low pre-test probability of metastasis, jeopardizing the current recommendation for
ptWBS in all DTC patients. Indeed, the ptWBS performance for the overall DTC
population studied had 68% sensitivity, 96% specificity, a Youden
index of 0.64, and a Kappa of 0.66. We also showed that the ptWBS contribution for
disease management varies according to ATA risk categories with poor performance for
ATA low and intermediate-risk DTC patients (Youden index of 0.26). In ATA high-risk
DTC patients, however, the ptWBS has a good performance to identify those with
distant metastasis (Youden index of 0.82).
Recent studies have raised questions on the contemporaneous usefulness of ptWBS as an
adjunct diagnostic tool for DTC patients. Garger et al. showed that the incidence of
ptWBS false-positives is around 73% with predominance for focal uptake. The
etiology of false-positive uptakes cannot be determined in one-third of patients
[17]
[30]. Accordingly, we analyzed ptWBS after the
first activity of RAI and found five false-positive results patterns. The most
frequent are diffuse uptake on the thorax. Of note, in 5/9 (56%)
patients we could not determine the etiology of the false-positive uptake. These
findings are concerning because they add undesirable psychological stress and waste
healthcare resources.
We further explored the performance of the ptWBS exam according to DTC patient
characteristics. When stratifying the DTC population according to the ATA risk, the
performance of ptWBS for ATA low-intermediate-risk was poor with a predominance of
false-positive uptakes over true-positives. This leads to a higher risk of
misdiagnosing patients. Fewer than 1% of low-intermediate-risk patients
could benefit from ptWBS. In addition, only two patients stratified as ATA low to
intermediate risk with true-positive ptWBS, met sPOTg values criteria for distant
metastasis screening (sPOTg>10 ng/ml). Thus, ptWBS did not
bring additional benefit to the follow-up strategy.
We also analyzed the ptWBS performance for the subgroup of ATA intermediate-risk
patients, and the performance of the exam was also poor (sensitivity 25% and
specificity 96%). These findings agree with recent data published by Agate
et al. that showed a low diagnostic value of ptWBS for low and intermediate risk DTC
patients [16].
On the other hand, our data illustrates that ptWBS has high sensitivity
(82%), specificity (100%), and good agreement (Kappa 0.74) for ATA
high-risk patients. It is useful in the diagnosis of distant metastasis in a
substantial portion of this subgroup of patients. These results suggest that if the
pre-test probability for distant metastasis is high, then the exam has a positive
predictive value of 100%.
The role of sPOTg has been well established in adult DTC populations. Thus, we
further stratified the DTC cohort according to sPOTg levels using a cut-off point of
10 ng/ml as suggested by a systematic review of sPOTg prognostic
value [23]. However, the sPOTg cut-off of
10 ng/ml was not adequate to identify subgroups of DTC patients with
better ptWBS performance.
Some limitations of our study should be acknowledged. The relatively low number of
high-risk patients raises for caution on definitive conclusions for this subgroup of
individuals. Also, the 2015 ATA Guidelines updated the ATA 2009 risk stratification
system, incorporating BRAFV600E tumor mutation status and detailed information on
the number and size of lymph node metastasis. Whereas the new system may provide an
improvement in the risk assessment, it also poses barriers regarding its
applicability. Unfortunately, the retrospective design of our study did not allow to
incorporate the ATA 2015 risk stratification system, given that we could not obtain
all required criteria to consistently classify our patients. Also, we were not able
to perform 18-FDG-PET/CT studies since only recently, this technique was
available (although still of restricted use) in our center. Another aspect that
merits consideration is that we did not use ptWBS associated with SPECT/CT.
SPECT/CT improves the diagnostic accuracy of ptWBS [7] and has been a useful tool to clarify
localization of ptWBS uptakes. However, in this study we exclusively evaluated the
performance of ptWBS considering that the SPECT/CT is not available in many
healthcare scenarios, a common place reality for many healthcare facilities in low
and middle income countries. Finally, the study was conducted in a single DTC
referral center. Thus, it would be of interest to replicate this study in different
contemporaneous cohorts.
In summary, our results suggest that we should reconsider the routine use of ptWBS
for all DTC patients. It remains useful in ATA high-risk DTC patients.
Author Contributions
CFN, RSS, ALM, and JMD contributed to the study conception, design, data analysis,
interpretation, and manuscript preparation. CFN, RSS, ABZ, FZ, ALM, and JMD were
responsible for data collection, data analysis, and manuscript preparation. All
authors read and approved the final manuscript version.
Availability of Materials and Data
Availability of Materials and Data
All medical records are restricted to maintain patient privacy and avoid unnecessary
exposure but can be assessed under request.