Key words thorax - CT - technical aspects - image manipulation/reconstruction
Introduction
The improvement of the spatial and temporal resolution of multidetector computed tomography (MDCT) resulted in a significant increase in the indication for the method and thus an increase in the cumulative radiation exposure [1 ]
[2 ]
[3 ]. In a European comparison, the greatest number of X-ray examinations per capita with the consequently highest radiation exposure per inhabitant and per year is performed in Germany (effective dose, ED = 1.9 mSv/a) [4 ]. Therefore, radiation exposure reduction has become a central topic in diagnostic radiology [5 ]. Since the thorax is the third most common region for CT examination in Germany and represents the second highest collective dose [4 ], even minimal dose reductions in the individual examinations affect the collective ED. While the use of lead and bismuth shields is controversial [6 ]
[7 ]
[8 ], the use of anatomy-based dose modulation is established and has been substantiated by a number of studies [9 ]
[10 ]. Adaptive z-collimation [11 ] can effectively reduce the over-ranging that becomes an issue with an increasing detector panel width. It was able to be shown in numerous phantom-based and follow-up studies that the image quality in chest CT can be improved by iterative reconstruction algorithms with acceptable reconstruction times [12 ]
[13 ] resulting in the potential for dose reduction [14 ]
[15 ]
[16 ]. In iterative reconstruction (IR), an output image is initially calculated from the measured projection data. From this image a comparison dataset is then calculated via a forward projection and is compared to the output data. Depending on the type of algorithm, this is performed separately in the image data room or with inclusion of the raw data room. If the results deviate from one another, a correction image is calculated. Data is then again synthesized from this correction image and compared to the measured data. This iteration loop is repeated until a defined termination criterion is fulfilled.
To establish IR at a reduced radiation dose for patients, the goal of this study was to compare the image quality of standard reconstruction (filtered back projection, FBP) at a full dose (FD) to the image quality of an image-based iterative reconstruction at a half dose (HD) in a single-phase and intraindividual study design (non-inferiority study) and to evaluate clinically relevant lesions in addition to the already published parameters.
Materials and Methods
Patients
52 consecutive patients with a clinical indication for contrast-enhanced chest CT were prospectively examined on a 128-row dual-source (DS) unit (Somatom Definition Flash, Siemens Healthcare, Forchheim, Germany). The most common indication was tumor staging (n = 42) followed by dyspnea of an unclear origin (n = 6) and the search for an infectious focus (n = 4). 38 men and 14 women with an average age of 62 ± 13 years and a body mass index of 26 ± 5 kg/m2 were included. The exclusion criteria were known intolerance to contrast agents, renal insufficiency (eGFR < 60 ml/min/1.73 m2 ), hyperthyroidism (TSH < 0.3 mU/l) and BMI > 40 kg/m2 . The last criterion was selected to rule out incomplete exposure by the smaller of the two tube-detector systems (field of measurement (FOM) = 33 cm). Written informed consent after clarification by the physician was available for all patients and the study was approved by the ethics commission.
CT technique
All examinations were performed with a tube voltage of 120 kV, a referenced tube current time product of 100 mAs with real-time tube current modulation (CareDose 4 D, Siemens Healthcare, Forchheim, Germany), a rotation time of 0.5 s, a pitch of 0.8, and a total collimation of 38.4 mm (64 × 0.6 mm) using a z-spring focus. The calculated tube current was divided equally between the two X-ray tubes. All patients received 0.4 g of iopromide (Ultravist 370® , Bayer Healthcare, Berlin, Germany) per kg bodyweight with an automatic 2-piston injector (Stellant D CT, Medrad Inc., Warrendale, PA, USA) and a flow rate of 3 ml/s followed by 50 ml of 0.9 % saline solution administered intravenously via a peripheral indwelling venous cannula. The scan delay was 50 s.
Dose calculation
The radiation exposure was estimated from the dose protocol on the basis of the volume CT dose index (CTDIvol ) and the dose length product (DLP). The effective dose was calculated from the DLP via the conversion factor for chest CT (k = 0.0145 mSv/mGy*cm) [17 ].
Image reconstruction
The separate and simultaneous reading out of the data from the two detector systems makes it possible to calculate an image dataset with a half dose (HD) as well as a full dose (FD) in a single examination without an additional radiation dose for the patient. The reference was the FD-FBP datasets to which the HD-IR and HD-BP images were compared ([Fig. 1 ]). Each raw dataset was calculated with a high-resolution (B70 / I70) and a soft (B40 / I40) convolution kernel with a slice thickness of 5 mm and 1 mm and a reconstruction increment of 5 mm and 0.7 mm, respectively. IR was performed using the IRIS algorithm (iterative reconstruction in image space, Siemens Healthcare, Forchheim, Germany). 5 iteration steps were performed for the calculation of each HD-IR dataset in this study. The duration of the thick-slice reconstructions was recorded for the individual datasets.
Fig. 1 Flowchart of image reconstruction: Only detector A is read out for half-dose image reconstructions. Iterative reconstructions repetitively compare image aspects such as edge information and image noise with the output image calculated via filtered back projection.
Image quality
After anonymization and removal of all acquisition and reconstruction parameters, all image series were evaluated on a 3 D workstation (Syngo MMWP VE36A, Siemens Healthcare, Erlangen, Germany). The objective image quality was measured at the height of the maximum extension of both heart chambers as a standard deviation of the attenuation values of round measurement areas in the left and right ventricle, the descending aorta, the autochthonous back musculature on the right and left of the spinal column and in the air ventral to the thorax ([Fig. 2 ]). The measurement areas were placed in an identical position in all 3 datasets via the copy function while avoiding artifacts. The subjective image quality was evaluated by two radiologists (14 and 4 years professional experience) in accordance with the European Guidelines on Quality Criteria for Computed Tomography [18 ]. The lung parenchyma was evaluated in the sharp convolution kernel with window values of 1700/–600 (width/center) and the mediastinal structures were evaluated in the soft convolution kernel at 400/40 (width/center). The complete image of the thoracic wall, thoracic aorta, vena cava, intrathoracic vessels, heart, and lung parenchyma and the sharp visualization of 11 mediastinal structures (front mediastinum, trachea, main bronchi, paratracheal tissue, lymph nodes, esophagus, heart, aortic contour, aortic wall, superior vena cava, inferior vena cava, other mediastinal vessels) and 9 pulmonary structures (costal pleura, mediastinal pleura, lung fissures, segmental pulmonary vessels, subsegmental vessels, segmental bronchi, subsegmental bronchi and bronchioles, secondary lung lobules, lung parenchyma) were evaluated on the basis of a dichotomous Likert scale (1 –completely or clearly visualized, 2 – incompletely or unclearly visualized) [18 ]. General image aspects such as subjective image noise and spatial resolution were evaluated on a 3-point scale (1 – too low, 2 – optimal, 3 – too high), and the general diagnostic applicability was evaluated using a 4-point Likert scale (1 – highly acceptable, 2 – minimally acceptable, 3 – acceptable with restrictions, 4 – unacceptable) [18 ]. The same scale was used for additional evaluation of pathological lesions with respect to contrast and edge sharpness. In the event of multiple lesions of one entity, the smallest was always used for evaluation. Moreover, image artifacts (spiral artifacts, streak artifacts, truncation artifacts, and a coarse image appearance) were also evaluated according to a 4-point scale (1 – no artifact, 2 – artifact with moderate effect on the visualization of a structure, 3 – artifact with negative effect on the visualization of a structure, 4 – artifact impairing diagnosis).
Fig. 2 Objective data analysis via voxel evaluation in the right and left ventricle, in the descending aorta, bilateral in the erector spinae and in air.
Statistics
After inclusion of the first 16 patients, a case number estimation for an equivalence hypothesis (non-inferiority) of the HD-IR data compared to the FD-FBP data was performed on the basis of a preliminary evaluation of the quantitative image quality with a statistical power of 80 % [19 ]. All results are given as mean ± standard deviation (SD) for continuous variables and as percentages and ranges for categorical variables. The equality of normally distributed, interval-scaled data was checked via one-way analysis of variance (ANOVA) and then compared pair-wise via post-hoc tests, via Tamhane’s T2 test in the case of inequality of the variances in Levene’s test and via the Bonferroni test in the case of variance equality. Since the ordinal-scaled data of the qualitative image analysis were not distributed normally, the non-parametric Friedman test and then a post-hoc test according to Schaich were used [20 ]
[21 ]. The agreement between the examiners was checked for the categorical parameters by the Cohen’s Kappa test. The significance level was defined as < 0.05. The total statistical evaluation was performed with the software PASW Statistics 18.0 (SPSS Inc., Chicago, IL).
Results
To rule out inferiority of the HD-IR data in the 90 % confidence interval with 80 % reliability, a case number of 50 patients at a maximum allowed image noise deviation of 10 % was calculated.
The average examination length (z-axis) was 38 ± 4 cm with an average examination time of 6.2 ± 0.6 s. The anatomically controlled tube current modulation resulted in 82 ± 36 effective mAs (range: 33 – 236 eff. mAs), corresponding to approx 41 eff. mAs per X-ray tube. Therefore, the median values were: FD-CTDI 6.2 mGy (range: 2.9 – 20.2 mGy), FD-DLP 245 mGy*cm (range: 111 – 858 mGy*cm) and FD-ED 3.6 mSv (range: 1.6 – 12.4 mSv). The reconstruction of a chest image series (5/5) took 12 ± 0.4 s, corresponding to an average reconstruction speed of 6.3 images/s, for the FBP algorithm and 55 ± 2.1 s, corresponding to 1.3 images/s, for IR. This results in a reconstruction speed that is 4.7 times faster for FBP than for IR. The selection of the filter kernel did not affect the reconstruction speeds.
Objective image quality
A balanced vascular contrast of approx. 130 HU was generated with the selected contrast agent protocol. The average density values did not show any significant differences in the various image reconstructions ([Table 1 ]). As expected, an increase in image noise in the intracorporeal measurement regions of approx. 40 % was observed in the HD-FBP reconstructions compared to HD-IR and FD-FBP. With an increase of 34.8, the noise measurement in air is lower compared to FD-FBP due to the limited density scale at –1024 HU and the associated shift of the distribution curve of the measured values. In contrast, the differences in the measured image noise in the full-dose and IR datasets were not significant (p = 0.25). The average image noise in HD-IR was –1.3 % less than in FD-FBP ([Table 2 ]).
Table 1
Mean size [pixels] and density values [HU] of the regions of measurement.
descending aorta
right ventricle
left ventricle
musculature right
musculature left
air
Field of measurement [pixels]
383
987
1244
171
205
2722
FD-FBP [HU]
137
129
128
55
58
–999
HD-IR [HU]
137
133
130
53
54
–998
HD-FBP [HU]
138
133
131
53
55
–999
p-value
0.99
0.62
0.32
0.54
0.43
0.39
Table 2
Image noise [HU] and deviation from FD-FBP as percentage.
field of measurement
FD-FBP
HD-IR
HD-FBP
descending aorta
20.9
19.5 (–6.7 %)
30.6 (+ 46.2 %)
right ventricle
20.5
19.4 (–5.1 %)
29.6 (+ 44.4 %)
left ventricle
21.6
21.2 (–1.9 %)
30.2 (+ 40.1 %)
right musculature
20.2
20.4 (+ 0.7 %)
28.9 (+ 43.2 %)
left musculature
20.7
20.4 (–1.4 %)
28.8 (+ 39 %)
air
10.2
10.8 (+ 6.7 %)
13.7 (+ 34.8 %)
Subjective image quality
Using the evaluated categorical parameters, both examiners achieved an average agreement of κ = 0.493. With κ = 0.592 the value for the FD dataset was highest followed by HD-IR (κ = 0.483) and HD-FBP (κ = 0.425). According to Landis and Koch, there is moderate agreement for all 3 datasets [22 ].
Image impression
The evaluation of the three datasets by both examiners resulted in statistically significantly better values for FD-FBP and HD-IR (p < 0.001) compared to HD-FBP with respect to the general diagnostic value. The differences between FD-FBP and HD-IR were not significant. Of the 52 datasets, 30 (58 %) were evaluated as highly acceptable, 21 (40 %) as minimally acceptable, and 1 (2 %) as acceptable with restrictions after image reconstruction with HD-FBP. In contrast, only one IR dataset and one FD dataset (2 %) were evaluated as minimally acceptable while the rest were evaluated as highly acceptable (n = 51; 98 %). The subjective image noise was evaluated as too high in 98 % (n = 51) of the HD-FBP datasets and in only 4 % (n = 2) of the FD-FBP and HD-IR datasets. Insufficient noise was not found. Only the differences between HD-FBP and FD-FBP/HD-IR were significant. The achieved spatial resolution was classified as optimal for all sectional image series of the 3 datasets.
Anatomy
It was possible to completely visualize the thoracic wall, thoracic aorta, vena cava, heart, lung parenchyma, and contrast-filled vessels in all three datasets of the patient collective. The FD-FBP and HD-IR datasets did not show any significant differences in the visualization of mediastinal structures (p = 0.865) and pulmonary structures (p = 0.141). A loss of quality was only seen in the depiction of the subsegmental bronchi and bronchioles in the HD-IR datasets in the individual comparison (p = 0.008). However, the sharp visualization of both the mediastinal and the pulmonary structures was significantly worse in HD-FBP than in HD-IR and FD-FBP (p ≤ 0.018). In the individual comparison, significant differences in the mediastinal vascular visualization (inferior vena cava, heart, aortic contour and wall) were described. The depiction of the lungs showed significant advantages for the comparison between FD-FBP and HD-FBP with regard to the majority of HR structures (subsegmental vessels, subsegmental bronchi and bronchioles, secondary lung lobules, and lung parenchyma) and for the comparison between HD-IR and HD-FBP with regard to the depiction of the lung parenchyma ([Table 3 ]).
Table 3
P-values of the statistically significant post-hoc tests. Non-significant p-values of the subgroups as well as anatomical structures and lesions without significant results in the Friedman test are not shown.
post-hoc
FD-FBP vs. HD-IR
FD-FBP vs. HD-FBP
HD-FBP vs. HD-IR
mediastinal anatomy
0.865
< 0.001
< 0.001
heart
0.012
0.005
aortic contour
0.002
0.005
aortic wall
0.008
0.012
inferior vena cava
0.039
pulmonary anatomy
0.141
< 0.001
0.018
subsegmental vessels
0.039
subsegmental bronchi and bronchioles
0.008
0.001
secondary lung lobules
0.039
lung parenchyma
0.05
0.05
contrast of the lesions
0.783
0.054
0.099
marginal definition of the lesions
0.855
0.001
0.002
pulmonary circular foci
0.022
0.045
soft tissue lesions
0.005
0.007
Lesions
Overall, 141 pathological changes, 93 in the lung parenchyma ([Fig. 3 ], [4 ]) and 48 in the scanned soft tissues ([Fig. 5 ], [6 ]), were documented in 52 patients. 43 (30 %) were classified as malignant and 98 (70 %) as benign. For the evaluation of the subjective image quality, these were classified on the basis of the different underlying morphologies as changes in the lung framework (n = 38), pulmonary circular foci (n = 55), vascular changes (n = 15), and soft tissue lesions (n = 33). The lesions were between 1 and 44 mm in size (average value 7 mm). While no significant differences in the contrast of the lesions with respect to the surrounding tissue could be shown in the three datasets in the individual comparison, the HD-FBP reconstruction showed statistically significant edge enhancement disadvantages with regard to intrapulmonary circular foci and soft tissue lesions. The statistically significant differences in the edge sharpness of lung framework changes in all three datasets in the Friedman test (p = 0.011) could not be confirmed in the pair-wise post-hoc tests ([Table 3 ]).
Fig. 3 46-year-old male (BMI = 28 kg/m²) with dyspnea. 1-mm slice thickness and sharp convolution kernel for FD-FBP A/a , HD-IR B/b und HD-FBP C/c . Centrilobular nodules, consolidations and thickening of the bronchial walls in sarcoidosis are visible in all 3 reconstructions.
Fig. 4 Top: 65-year-old male (BMI = 24 kg/m²) with fever and dyspnea. 5-mm slice thickness and high-resolution reconstruction kernel for FD-FBP A , HD-IR B and HD-FBP C . Pneumonia: consolidations (short arrow) and central parts of the ground-glass opacity are visible in all three reconstructions. However, peripheral expansion of the ground-glass opacity (long arrow) is hardly distinguishable from the background image noise in HD-FBP. Bottom: 56-year-old male (BMI = 17 kg/m²), follow-up for oral squamous cell carcinoma. Centrilobular emphysema of the lungs: Distinction between residual lung parenchyma and bullae (arrow) is limited in HD-FBP (C) due to the increased noise pattern and similar in HD-IR (B) and FD-FBP (A).
Fig. 5 49-year-old male (BMI = 24 kg/m2) with known bronchial carcinoma. 5-mm slice thickness and soft convolution kernel for reconstruction. Mediastinal lymph node metastasis (white arrow): The edge sharpness in FD-FBP A and HD-IR B is superior to that of HD-FBP C , while the lesion contrast is similar in all three datasets.
Fig. 6 65-year-old male (BMI = 27 kg/m2) for staging of cholangiocellular carcinoma. 5-mm slice thickness and soft convolution kernel for reconstruction. Pericardial effusion: The contour sharpness in FD-FBP a and HD-IR b is superior to that of HD-FBP c , while the contrast to the surrounding tissue is similar in all three datasets.
Artifacts
Streak artifacts occurring primarily at high-contrast tissue boundaries, e. g. the heart (n = 35), the erector spinae (n = 27), the ribs (n = 8), and around the head of the humerus (n = 6), were detected most frequently. They occurred significantly more frequently in HD-FBP than in FD-FBP (p < 0.001) and HD-IR (p < 0.001). They affected the image quality of the HD-FBP images in 29 cases. This was not the case in the remaining datasets (p = 0.492). Diagnosis-limiting artifacts were not observed. With respect to the image structure, a pixilated appearance of the HD-IR reconstructions with a moderate effect on the visualization of the structures was seen in 24 patients (46 %). This was uniformly distributed across the entire volume dataset and was not considered to negatively affect diagnosis. Distribution curves of the frequency spectrum of the image noise in the liver parenchyma calculated as an example show a reduced noise-power spectrum ([Fig. 7 ]) here as a verifiable correlate, as also published in the literature on the basis of a phantom model [23 ]. This did not occur in the FBP reconstructions. Spiral or truncation artifacts were not observed.
Fig. 7 Distribution of the noise power spectrum (NPS) in FD-FBP A , HD-IR B und HD-FBP C : Shift of the peak to the left (lower spatial frequencies on the x-axis) in HD-IR as a correlate for the coarse image appearance.
Discussion
Without negatively affecting attenuation values and contrast, a constant image noise can be achieved via iterative image reconstruction even at a half dose in relation to reconstructions via filtered back projection. In contrast to HD-FBP datasets, an image quality comparable to that of FD-FBP could be achieved with IR. In contrast to HD-FBP, streak artifacts were significantly reduced with IR, and in contrast to FD-FBP, the noise texture was changed to a coarse image appearance and disadvantages in the depiction of the subsegmental bronchi and bronchioles were found.
Due to the need for high-performance computers, iterative reconstructions were not compatible with the clinical routine for a long time. The steadily increasing capacity of processors and the development of new algorithms with shifting of the iterative correction loops from the raw data room to the image data room greatly increased the application area [24 ]
[25 ]. There are currently different algorithms for iterative reconstruction that allow correction of artifacts in some cases via access to the raw data [26 ]. Either an improvement in image quality at an equivalent dose or constant image quality at a lower dose could be shown for different issues and examination regions [27 ]
[28 ]
[29 ]. However, the image-based algorithm examined in the present study allows calculation of iterative datasets regardless of the raw data and can therefore be used regardless of the time of acquisition for already archived image data. In a phantom study, Ghetti et al. were able to show a reduction of both the nominal image noise and the noise power spectrum while maintaining the spatial resolution [23 ]. Pontana et al. reported advantages with respect to the subjective image appearance and the objective image quality at a 30 % dose reduction compared to the standard dose and FBP with a prototype of the IRIS algorithm in a follow-up study design using five iteration steps [14 ]. Hwang et al. had a comparable study approach and found the objective image quality at a half dose and using IRIS to be slightly worse compared to FD-FBP with respect to the edge sharpness of mediastinal structures [30 ], while the subjective image quality was increased [31 ]. The present study also shows slight advantages of HD-IR compared to the reference dose and FBP with respect to the anatomical structures on the basis of standardized evaluation criteria (European Commission on Radiological Protection). However, these are largely without statistical significance in contrast to HD-FBP. The additional evaluation of pathological changes makes it possible to check clinically relevant image aspects in the single-phase, intraindividual comparison with elimination of evaluation distortion due to recall bias, change in the orientation in the room, and morphological changes in the time course.
In detail, the image noise of IR at a half dose showed no statistical significance with respect to conventional image data at a full dose. In contrast, as expected, the FBP reconstructions followed the physical dependence of image noise on the radiation dose at a constant tube voltage by the factor √2. The iterative reconstruction algorithm did not affect the density values. Therefore, the nominal image contrast is maintained. The global evaluation of the datasets did not yield any statistically significant differences in the subjective image quality between FD-FBP and HD-IR. In contrast, HD-FBP was inferior in this regard. In the detailed examination of the anatomical structures, FD-FBP and HD-IR are equally superior to HD-FBP with respect to the depiction of mediastinal soft tissues and the general visualization of the lung. In contrast, the differences in the visualization of the HR structures are significantly smaller between the three datasets with advantages for HD-IR compared to HD-FBP and disadvantages compared to FD-FBP. In the comparison of FD-FBP and HD-IR, this is only significant in the evaluation of the subsegmental bronchi and bronchioles.
HD-IR reconstructions achieve image quality that is comparable to that of FD-FBP and superior to that of HD-FBP even with respect to the edge sharpness of intrapulmonary circular foci and soft tissue lesions as well as to the severity of beam-hardening artifacts.
Disadvantages of HD datasets compared to FD for the visualization of lung framework changes were above the level of statistical significance. However, almost half of all HD-IR datasets have an unfamiliar coarse pixel structure. This subjective impression can be objectively confirmed on the basis of histograms of the noise power spectrum.
Despite good image quality evaluations, limitations of the present study in addition to the unfamiliar image appearance and the longer reconstruction time compared to that of FBP must be taken into consideration. These relate in particular to the need to optimize the relationship between dose reduction and the effectiveness of the iterations. The software version we used does not allow deviation from the 5 predefined iteration loops. Additional iterations could result in a further reduction of the image noise while fewer iterations could result in a less unfamiliar image appearance. In the present study design only equal splitting of the tube current to the two X-ray tubes was examined. Step-wise deviations with unequal parts would also be conceivable here. The diagnostic evaluation in the present study was performed purely retrospectively. The effect on clinical evaluation and possible treatment decisions could not be evaluated. The patients were also not selected according to indication and pathologies to be expected. The percentage of inflammatory lesions, for example, was relatively low. The heterogeneous entity of the pathological lesions could differ greatly in a different collective. In addition, a bias regarding lesion evaluation could not be completely ruled out in the evaluation by two separate radiologists with different levels of professional experience. The latter limitations could be responsible for the minor difference in the results compared to the study populations and radiologists of Hwang et al.
The present image quality results achieved with iterative reconstructions of generated computed tomography scans of the thorax at a radiation dose of only 50 % compared to a conventional examination protocol provide the foundation for additional studies for optimizing dose reduction using iterative reconstruction algorithms and their prospective application in the clinical routine.