Keywords Likert - MRI - rectal cancer - TRG
Introduction
Magnetic resonance imaging (MRI) has an established role in baseline staging, post-treatment restaging, and surgical planning in rectal cancer. Preoperative chemoradiation is used to reduce tumor bulk and sterilize the surgical field. Apart from killing tumor cells, it also induces inflammation, fibrosis, submucosal edema, and mucinous change in the tumor.[1 ] The tumor response to treatment is categorized on histopathology using pathological tumor regression grade (pTRG), which correlates with patient outcomes.[2 ]
[3 ]
Few MRI studies have suggested that an MRI-based tumor regression grade (mrTRG) on the pre-operative MRI can be used to predict pTRG; however, the reported accuracy of mrTRG varies in different studies.[4 ]
[5 ]
[6 ]
[7 ] An objective mrTRG may be unable to account for findings like mucinous changes or thick fibrosis with minimal heterogeneity or irregularity. A subjective Likert scale may be able to assess these better. A study analyzing prostatic lesions and comparing Likert score with Prostate Imaging – Reporting and Data System category found the former to be more accurate.[8 ] Also, the ability to accurately differentiate among various TRG scores among patients with incomplete response is not that important clinically, as all these cases need the same management, namely surgery. On the other hand, the ability to predict pathologic complete response (pCR) confidently becomes more pertinent, given the potential for adopting a “Watch and Wait” program for these patients.[9 ]
[10 ]
[11 ]
[12 ]
The study aims to evaluate the ability of MRI in assessing treatment response comparing an objective mrTRG score and a subjective Likert score, with a focus on the ability to predict pCR.
Materials and Methods
Subjects
This is a retrospective study approved by the Institutional Review Board. One-hundred seventy consecutive cases of histopathologically proven rectal cancer were identified from the database of our colorectal disease management group over 5 years. We excluded patients operated more than 8 weeks after MRI (1 patient) and MRIs lacking adequate diagnostic quality (3 patients). All the remaining 166 patients had an MRI done after receiving neoadjuvant chemoradiation and prior to surgery. One-hundred forty-one of one-hundred sixty-six patients received presurgical chemoradiation followed by surgery. Twenty-five of one-hundred sixty-six patients that did not respond adequately to the chemoradiation on MRI and clinical examination received a 6-week course of chemotherapy, followed by a second post-treatment MRI and subsequent surgery. All 166 cases underwent definitive surgery within 8 weeks of their latest pre-surgical MRI. We included patients that had a baseline MRI (36/166) or post-treatment MRI (8/166) done outside our institute but of optimum diagnostic quality, with the Digital Imaging and Communications in Medicine (DICOM) images available to be uploaded on the institutional picture archiving and communication system (PACS). We also included patients without a baseline MRI and only the post-treatment MRI on PACS (56/166), since only the post-treatment MRIs were being assessed. Diffusion-weighted imaging (DWI) sequence was available in 151/166 cases.
Clinical and Histopathology Data
The demographic data, histopathology, and treatment details for the patient were taken from the institutional electronic medical records. The surgical specimens were reported by dedicated gastrointestinal oncopathologists at our institute. The detailed final histopathology, including pTRG, pathological T stage (pT), pathological N stage (pN) and presence of mucin (cellular and acellular), was documented in all patients. The method of Mandard et al[13 ] was used to evaluate pTRG. pTRG 1 indicated complete response; pTRG 2 indicated predominant fibrosis with rare residual cancer cells scattered through the fibrosis; pTRG 3 indicated presence of fibrosis and tumor cells, with fibrosis still being predominant; pTRG 4 indicated residual cancer outgrowing fibrosis; and pTRG 5 indicated absence of regressive changes.
Imaging and Image Analysis
MRI studies were performed on 1.5T (GE Healthcare, Milwaukee, Wisconsin, United States), 1.5T (Philips Medical Systems, Eindhoven, the Netherlands), or 3T (GE Healthcare, Milwaukee, Wisconsin, United Stats) machines in our institute using the institutional protocol for MRI rectum. This included large field of view (FOV) T1-weighted (T1W) and T2-weighted (T2W) axial sequences, T2W sagittal sequence of the pelvis, high-resolution small FOV thin oblique axial and oblique coronal T2W sequences along the plane of the rectal tumor, and axial DWI ([Supplementary Material Table 1 ] in Appendix). Intravenous contrast was not administered in accordance with the institutional protocol.
The images were reviewed on Centricity PACS (GE Healthcare, Milwaukee, Wisconsin, United States) workstation by two oncoradiologists in consensus, with 11 and 9 years of experience. The radiologists were blinded to the histopathology findings. An objective MR predicted tumor regression grade (mrTRG) was determined ([Table 1 ]).[14 ] A 5-point subjective Likert scale was also assigned with score of 1 to 5, Likert 1 meaning highly likely to be pCR and Likert 5 meaning highly unlikely to be pCR ([Table 2 ]). DWI and apparent diffusion coefficient (ADC) images were visually assessed as well. For patients without DWI (n =15), mrTRG was based purely on T2WI; presence of thick or irregular fibrosis was scored mrTRG2 while minimal fibrosis was scored mrTRG 1.
Table 1
Summary of MRI regression scale used in our study
mrTRG scale
mrTRG
1
Radiological complete response: no evidence of the treated tumor, or presence of T2 hypointense fibrosis without diffusion restriction
2
Good response (T2 hypointense fibrosis with restricted diffusion or minimal mucinous changes, without definite T2 intermediate signal intensity tumor)
3
Moderate response (visible T2 intermediate signal signifying tumor with variable fibrosis or mucinous changes)
4
Slight response (little areas of fibrosis or mucin; predominantly residual tumor)
5
No response or worsening disease
Abbreviations: MRI, magnetic resonance imaging; mrTRG, MRI-based tumor regression grade.
Table 2
Subjective Likert scale used in the study
Likert score
Subjective likelihood of complete response (pCR)
1
Highly likely
2
Likely
3
Equivocal
4
Unlikely
5
Highly unlikely
Abbreviation: pCR, pathologic complete response.
Statistical Analysis
The SPSS (the statistical package for social sciences), IBM Corp, released 2017, IBM SPSS Statistics for Windows, Version 25.0. Armonk, NY: IBM Corp was used for statistical analysis. Receiver operating characteristic curves were constructed to determine the ability of Likert scale, mrTRG and DWI to predict complete response, with postoperative histopathology being the gold standard. The optimal cutoff points on the scale of 1 to 5 were obtained for mrTRG and Likert scale with the greatest sum of sensitivity and specificity using the Youden Index. Further, the sensitivity, specificity, positive likelihood ratio, negative likelihood ratio, positive predictive value (PPV), negative predictive value (NPV), and accuracy with 95% confidence intervals (CI) were calculated for mrTRG, Likert scale, and DWI sequence.
Results
One-hundred nine of one-hundred sixty-six (65.7%) patients were males, with a median age of 50 years (range: 29–71 years). Fifty-four of one-hundred sixty-six (33%) patients had pCR on pathology. The mrTRG, Likert, and pTRG distribution of the total number of patients is shown in [Fig. 1 ].
Fig. 1 Bar chart showing comparative analysis of Likert, magnetic resonance imaging-based tumor regression grade (mrTRG), and pathological tumor regression grade (pTRG).
The respective distribution of pTRG among patients assigned different Likert scores and pTRGs is given in [Table 3 ]. Six of nine cases (67%) that were Likert 1 showed pCR (pTRG 1), 2/9 (22.2%) showed pTRG 2, and 1/9 (11%) was pTRG 3. Out of the 35 cases that were Likert 2, 21/35 (60%) were pCR, while 7/35 (20%) were pTRG 2. If Likert scores 1 and 2 were considered together, then 27/44 (61%) cases were pCR, while 9/44 (20%) cases were pTRG 2. The percentage of pCR for different Likert scores is given in [Table 4 ].
Table 3
Case-wise correlation of Likert score and pTRG
pTRG
Likert
1
2
3
4
5
1
6 (67%)
21(60%)
21
4
3
2
2 (22%)
7 (20%)
18
15
3
3
1
0 (11%)
0
4
3 (20%)
0
12
5
9
4
8
10
9
5
1
3
1
Total
9
35
60
37
25
Abbreviation: pTRG, pathological tumor regression grade.
Table 4
Number of cases showing pCR when different subsets of Likert score are considered
Likert
No. of cases (n )
pCR
1
9
6 (67%)
1 + 2
44
27 (61%)
3 + 4 + 5
122
28 (23%)
Abbreviation: pCR, pathologic complete response.
With regard to mrTRG ([Table 5 ]), 8/9 (89%) cases of mrTRG 1 were pCR, 1/9 was pTRG 2, and none was pTRG 3-5. Among the 60 cases that were mrTRG 2, 29/60 (48%) were pCR, 15/60 (25%) were pTRG 2, and 16/60 (27%) were pTRG 3-5 ([Fig. 2 ]). If mrTRG 1 and 2 were considered together, then 37/69 (54%) cases were pCR and 16/69 (23%) were pTRG 2. The percentage of pCR for different mrTRG is given in [Table 6 ].
Table 5
Case-wise correlation of mrTRG and pTRG
pTRG
mrTRG
1
2
3
4
5
1
8 (89%)
29 (48%)
12
5
1
2
1 (11%)
15 (25%)
21
7
1
3
0
0 (0%)
0
11
4 (27%)
1
11
7
2
4
12
11
3
5
1
3
0
Total
9
60
57
33
7
Abbreviations: mrTRG, magnetic resonance imaging-based tumor regression grade; pTRG, pathological tumor regression grade.
Table 6
Number of cases showing pCR when different subsets of mrTRG are considered
mrTRG
No. of cases (n )
pCR
1
9
8 (89%)
1 + 2
69
37 (54%)
3 + 4 + 5
97
18 (19%)
Abbreviations: mrTRG, magnetic resonance imaging-based tumor regression grade; pCR, pathologic complete response.
Fig. 2 A 43-year-old male patient of moderately differentiated rectal carcinoma. Axial T2-weighted image of the pre-treatment magnetic resonance imaging (MRI) showed a T2 intermediate signal ulceroproliferative growth arising from the posterior wall of the lower and mid-rectum (arrow in A). The patient received neoadjuvant chemoradiation. The post-treatment MRI (B and C) showed decrease in the size of the rectal growth with T2 dark fibrosis (arrow in B) and restricted diffusion seen on diffusion weighted sequences (arrow in C). The patient was given a score of MRI-based tumor regression grade and Likert 3. He underwent abdominoperineal resection. Histopathology showed no residual viable tumor with pathologic complete response.
The most accurate cutoff point for the mrTRG to predict complete response was 2.5 (using Youden index), with a sensitivity of 69.2%, specificity of 69.6%, PPV of 85.6%, NPV of 46.4%, and accuracy of 69.3%. The most accurate cutoff for the Likert scale to predict complete response was 3.5, with a sensitivity of 47.5%, specificity of 89.1%, PPV of 91.9%, NPV of 39.4%, and accuracy of 59%. Thus, overall, mrTRG had a lower cutoff and was more accurate in predicting pCR compared to Likert score.
Ninety of one-hundred sixty-six cases showed no mucinous change, 46/166 showed mild mucinous changes, and 30/166 showed moderate to substantial mucinous changes. None of the latter 30 cases were labeled mrTRG 1, with one being labelled Likert 1. Eleven of thirty (36.67%) cases demonstrated complete response on pathology.
One-hundred fifty-one patients had DWI available. Sixty-seven of one-hundred fifty-one (44%) residual tumors demonstrated restricted diffusion, with 55/67 (82%) of these not having pCR. Of the 84/151 cases without restricted diffusion, 32/84 (38%) had pCR. The overall sensitivity of presence of restricted diffusivity on DWI to predict residual disease was 51.4%, with specificity of 72.7%, PPV of 82.1%, NPV of 38.1%, and accuracy of 57.6%.
Discussion
Our study demonstrates that an objective mrTRG to be more accurate than a subjective Likert score in predicting pTRG. Overall, both Likert and mrTRG scores tended to be higher leading to over-staging and under-prediction of complete response. Also, MRI was not very accurate in predicting pTRG, consistent with other studies.[15 ]
[16 ]
[17 ] This underlines the fact that mrTRG is not a parameter that can be reliably assessed in the post-neoadjuvant treatment setting, and need not be reported. A clinically more relevant parameter for assessing on MRI is the presence of CR or near complete response, identifying patients for potential “Watch and Wait” strategy. The incidence of pCR in our study was 33%, comparable with other studies that report an incidence of 8 to 38%.[18 ]
[19 ]
[20 ]
[21 ]
[22 ]
The MERCURY study elaborates the morphologic responses to chemoradiation in the form of fibrosis or desmoplastic reaction and mucinous change.[6 ] Fibrosis presents as T2 dark spicules or strands radiating from the rectal wall. According to the objective mrTRG criteria, T2 dark signal intensity areas represent fibrosis and intermediate signal intensity areas represent residual tumor. We found these criteria to be more accurate (70%) than the subjective Likert scale with an accuracy of 59%. Our study results showed concordant findings to a meta-analysis by Jang et al,[24 ] which showed an accuracy of 77 to 82% for mrTRG1 for diagnosis of pathological T1 or lower and an accuracy of 56 to 74% for mrTRG1-2 for diagnosis of pathological T1 or lower. We believe that this can be attributed to T2 shine through effect and ghosting artifact produced by pools of mucin.
The role of DWI in accurately assessing response is not clear. In a meta-analysis of 14 studies by Wu et al for assessing treatment response in rectal cancer, DWI did not improve sensitivity when coupled with conventional MRI.[30 ] In another meta-analysis of 33 studies by van der Paardt et al, the pooled sensitivity of studies that included DWI in their protocol was higher (83.6 vs. 50.4%) than studies that did not.[31 ] Our study also found limited sensitivity and accuracy of DWI in detecting residual disease. The basis of DWI lies in differential movement of water molecules in various tissues. In an experimental study by Hein et al[32 ] comparing the results of DWI in post-treatment setting with histopathology, they found that radiation-induced fibroinflammatory changes can also reduce the free movement of extracellular water molecules, thus decreasing ADC values. This may lead to misinterpretation as residual disease, potentially explaining why we observed pCR in 18% patients demonstrating restricted diffusion.
A subjective Likert scale largely depends on experience of the observer. Though it was comparatively more specific (89%) compared to mrTRG (69%) in our study, it showed a poor sensitivity of 47% and had an overall lower accuracy, making mrTRG a more useful method of assessment for an inexperienced learner.
The moderate accuracy of MRI in predicting treatment response emphasizes the need for additional methods of assessment such as endoscopy and per rectal examination in order to qualify a patient for a nonsurgical Watch and Wait regimen.[30 ]
[33 ] Addition of functional sequences like dynamic contrast enhanced MRI and MR perfusion has proven to be beneficial in literature[10 ]
[12 ]
[14 ] however, extended scan time continues to be a limitation.
Our study had various limitations such as a small cohort, retrospective analysis, and radiological assessment by only two radiologists without analysis of interobserver variation. Further randomized and prospective studies would be needed to decide on reliable parameters for prediction of pCR.
In conclusion, objective mrTRG was more accurate than a subjective Likert scale to predict complete response in our study.