Introduction
Video capsule endoscopy (VCE) plays a significant role in diagnosis and management
of small bowel diseases; its usefulness has been previously reported in numerous studies
[1]
[2]
[3]. The noninvasive nature of VCE has resulted in its application in examination of
other organs, including the esophagus [4], stomach [5], and colon [6]. For VCE, patients are required to swallow a video capsule. The VCE image readers
must then assess more than 80,000 images, an uncomfortable and time-consuming process.
VCE systems have been developed by several companies, notably Given imaging and Olympus.
These companies have introduced several features to the CE software in an effort to
reduce the time required to analyze VCE images and minimize the possibility of missing
lesions [7]
[8]
[9]. Recently, Olympus developed a new algorithm called Omni mode that aims to reduce
the number of redundant images and display all areas captured by VCE.
The aim of the current study was to demonstrate the non-inferiority of Omni mode against
a control (EC-10 system, Olympus, Tokyo) in terms of the number of true positives
(TPs) in all major and minor lesions, as well as its superiority for required reading
time. To the best of our knowledge, this is the first multicenter prospective study
to evaluate the performance of the newly developed Omni mode.
Patients and methods
Study design
This multicenter prospective study was approved by the ethics committee of each participating
institution and was registered with a registry approved by the International Committee
of Medical Journal Editors (UMIN ID000010976). Written informed consent was obtained
from all patients enrolled in the study. Patient enrollment and VCE were performed
at 6 hospitals in Japan: The Jikei University Third Hospital, Nagoya University Hospital,
Kansai Medical University Hirakata Hospital, Osaka City University Hospital, Shiga
Hospital, and Kyushu University Hospital. Sixty patients with suspected small bowel
diseases were prospectively enrolled in this study. All patients underwent VCE and
the resulting images were collected. The study utilized 7 expert VCE image readers
(K.O., K.I., M.N., T.M., T.W., T.T., and M.E.) who had previously performed more than
200 VCE examinations each and 3 judging committee members (N.H., N.O., and M.S.) who
had previously performed more than 200 CE examinations each. The judging committee
members excluded 20 VCE videos from the 60 collected, due to the absence of lesions
in the small intestine (the sample size calculation is described below). Videos that
were included in the study were evaluated by the judging committee members and each
lesion was defined and documented as “major” or “minor” by consensus. Clinically relevant
lesions that required further examination were predefined as major lesions. These
included lesions with a high potential for gastrointestinal bleeding or with uncertain
bleeding potential that required further examination and intervention such as biopsy
or confirmation of bleeding risk (such as ulcers, large angioectasia, massive bleeding
from uncertain origin, ulcer scars, ulcer scars with stenosis, large submucosal tumors,
large hemangiomas, or erosion with clots). Clinically irrelevant lesions (such as
red spots, lymphangiectasia, erosion, tattooing, and small lymphangiomas) were predefined
as minor lesions ([Fig. 1]). Two randomly allocated readers assessed the VCE images obtained using both Omni
mode and the control in each patient. All readers were blinded to the medical histories
of the patients. A total of 4 readings were performed per patient. The order of the
modalities was switched between the 2 readers and the interval between readings by
the same reader was 2 weeks. In addition, the number of readings performed by each
reader was balanced within each hospital and for each modality. Readers were allocated
to assess the images according to a random schedule generated by a biostatistician
(T.A.) and were blinded to the allocation. The number of correctly detected TPs and
reading times were then compared between the 2 modalities.
Fig. 1 Video capsule endoscopy images of predefined lesions. a Bleeding (predefined as a major lesion). b Angioectasia (predefined as a major lesion). c Angioectasia (predefined as a minor lesion). d Lymphangiectasia (predefined as a minor lesion).
VCE procedure and reading
The VCE procedures were performed using an EC-10 capsule and Endocapsule 10 system
(Olympus, Tokyo, Japan). VCE was performed after an 8-hour fasting period. Bowel preparation
and prokinetics were not defined in the protocol. The recorded digital information
was downloaded from the recorder to a computer and the images analyzed using proprietary
software. The first duodenal and cecal images in each video were marked by the judging
committee in advance. Each reader assessed the video using dual view at a rate of
10 frames per second. As the Omni mode has the ability to select and discard redundant
images, the passing of the images becomes faster than that of the control. The low
frame rate was considered appropriate for reading by Omni mode, based on the results
of the preliminary evaluation and in order to avoid missing lesions.
The VCE reading time was defined as the interval between the appearance of the first
duodenal image and the first cecal image and was recorded. The VCE reading results
were assessed and the number of TPs was tallied by the judging committee. A TP was
defined as the detection of a lesion by the reader that had also been identified by
the judging committee.
Omni mode
The conceptual scheme for the new Omni mode algorithm is presented in [Fig. 2]. Omni mode can select and discard redundant images and display all areas captured
by VCE.
Fig. 2 Scheme of the new Omni mode algorithm. Data courtesy of Olympus Corp. The new algorithm
compares the target image with reference image A and analyzes the areas of the target
image covered by reference image A. In the same manner, it analyzes the areas of the
target image covered by reference image B. If all the areas of the target image can
be covered by reference images A and B, the target image will not be displayed because
it is judged as a redundant image (Scene 1). However, if there are some areas not
covered by reference images A and B in the target image after the same comparison,
the image is displayed (Scene 2).
Sample size calculation and statistical analysis
The sample size was calculated such that it had an 80 % power in testing hypotheses
on the co-primary endpoint. With regards to the non-inferiority of the Omni mode to
the control in the number of TPs in all major and minor lesions, a sample size of
40 patients was required to achieve an 80 % power (alpha = 0.05), assuming that the
mean number of major and minor lesions following Poisson distribution was 8 (standard
deviation = 2.83) and the correlation coefficient between TPs within patients was
0.8. The non-inferiority margin for the TP ratio of the Omni mode compared with the
control was predefined as 0.9. The statistical power for showing the superiority of
the Omni mode over the control in reading time required was considered to be close
to 1 for this sample size.
The number of TPs identified using the 2 modalities was compared using the generalized
estimating equations (GEE) model with Poisson response and a log link function. The
covariance structure within patients in the GEE model was compound symmetry. The fixed
effects in the model were modality, time, and reader. Response variables included
major and minor lesions and their composites. The reading time between modalities
was compared using a general linear mixed-effects model with modality, time, and reader
as the fixed effects. The covariance structure within patients in the model was compound
symmetry. The Satterthwaite method was used for the adjustment of denominator degrees
of freedom in a test of the fixed effects.
All statistical analyses were performed using SAS software (Version 9.2, SAS institute
Inc., Cary, NC). A P value < 0.05 was considered statistically significant.
Results
The study adopted an intention-to-treat principle and 40 patients were included in
the analysis. The mean age of the patients was 68.7 ± 14.2 years and 19 patients (47.5 %)
were male. The reasons for performing VCE in each case are presented in [Table 1]. The primary basis for performing VCE was obscure gastrointestinal bleeding. The
types of predefined major and minor lesions are presented in [Table 2]. A total of 264 lesions were identified in all patients, and the number of major
and minor lesions identified was 71 and 193, respectively. A summary of the TPs and
false positives (FPs) is presented in [Table 3]. Of the 264 lesions, an average of 140 and 160.5 were detected using the Omni mode
and control, respectively. The estimated TP ratios and 95 % confidence intervals for
total, major, and minor lesions were 0.87 (0.80 – 0.95), 0.93 (0.83 – 1.04), and 0.83
(0.74 – 0.94), respectively. Thus, non-inferiority of the Omni mode to the control
was not demonstrated in this study. In the post hoc analysis, although non-inferiority
was not demonstrated, the detection rate of major lesions was not significantly different
when comparing the modalities. Detailed results are presented in [Table 4]. The mean numbers of FPs were 127.5 (3.19/patient) and 192.5 (4.81/patient) using
the Omni mode and control, respectively. Although the readers detected a high number
of true lesions using the control, they also detected a high number of FPs. We calculated
the proportion of readings with the detection of 1 or more major lesions from the
total cases in which they were present and found that the proportions were 90 % using
the Omni mode and 95 % using the control. All undetected minor lesions were subtle
lesions that did not influence the therapeutic decisions.
Table 1
Reasons for performing video capsule endoscopy.
|
Reason
|
Number of patients
|
|
OGIB
|
29
|
|
Crohn’s disease
|
3
|
|
Indeterminate enteritis
|
3
|
|
Others
|
5
|
|
Total
|
40
|
OGIB, obscure gastrointestinal bleeding.
Table 2
Types of predefined findings of major and minor lesions on video capsule endoscopy.
|
Predefined lesion
|
Major (n)
|
Minor (n)
|
|
Ulcer
|
33
|
|
|
Angioectasia
|
17
|
27
|
|
Bleeding
|
12
|
3
|
|
Ulcer scar
|
3
|
|
|
Ulcer with stenosis
|
2
|
|
|
SMT
|
1
|
1
|
|
Hemangioma
|
1
|
3
|
|
Erosion
|
1
|
47
|
|
Diverticulum
|
1
|
2
|
|
Lymphangiectasia/lymphangioma
|
|
87
|
|
Tattooing
|
|
11
|
|
Red spot
|
|
7
|
|
Edema
|
|
1
|
|
Lymphoid hyperplasia
|
|
1
|
|
Hemoclip
|
|
1
|
|
Foreign body
|
|
1
|
|
Polyp
|
|
1
|
|
Total
|
71
|
193
|
SMT, submucosal tumor.
Table 3
Summary of findings of true positives and false positives.
|
Endpoint
|
Omni mode
|
Control
|
|
Proportion of major lesions detected, average n/n (ratio)
|
59/71 (0.83)
|
63.5/71 (0.89)
|
|
Proportion of minor lesions detected, average n/n (ratio)
|
81/193 (0.42)
|
97/193 (0.50)
|
|
Proportion of all lesions detected, average n/n (ratio)
|
140/264 (0.53)
|
160.5/264 (0.61)
|
|
Sensitivity for major lesion detection – per patient basis, n/n (ratio)
|
51/56 (0.90)
|
53/56 (0.95)
|
|
Average number of false positives per patient, n
|
3.2
|
4.8
|
Table 4
Primary outcome in the GEE Poisson models[1].
|
Variable
|
Ratio estimate (CI)
|
|
Detected major lesions
|
0.93 (0.83 – 1.04)
|
|
Detected minor lesions
|
0.83 (0.74 – 0.94)
|
|
Total detected lesions
|
0.87 (0.80 – 0.95)
|
CI, confidence interval.
1 Outcome modeled using Poisson distribution with identity link. The ratio estimate
is Omni mode compared with control.
Regarding the reading time, this was significantly different across the modalities
(P < 0.001), with the Omni mode requiring 27.3 minutes and the control requiring 75.1
minutes, on average. The difference in reading time between modalities was calculated
as approximately 47.7 minutes ([Table 5]).
Table 5
Secondary outcome, general linear mixed model.
|
Variable
|
Estimate (CI)[1]
|
|
Reading time (minutes)
|
– 47.7 (– 52.0, – 43.4)
|
CI, confidence interval.
1 Estimate reported is a difference between the Omni mode with control as a reference.
The total number of displayed images per video of the small intestine was 13,444 ± 4341
using the Omni mode and 39,377 ± 13,568 using the control. Therefore, a 65 % reduction
in the number of displayed images was achieved using the Omni mode. In addition, after
calculating the software-associated false negatives for major lesions, we noted that
the Omni mode was able to identify all predefined major lesions, and no software-associated
false negatives were recorded for this modality.
Discussion
The non-inferiority of the Omni mode to a control was not demonstrated in the current
study. However, the reading time was found to be significantly lower on using the
Omni mode than on using the control.
In order to reduce the possibility of missing lesions and the reading time required,
several software modes have been developed including the Blood indicator [10]
[11]
[12], Blue mode [13]
[14], Flexible spectral Imaging Color Enhancement (FICE) [15], Automatic mode [7], and Quickview [8]
[16]. The Blood indicator selects red (blood) images [10]
[11]
[12] while Blue mode enhances lesions that are blue [14]
[15]. FICE is based on the arithmetic narrowing of the bandwidth of a conventional endoscopic
image using computerized spectral estimation technology to enhance the lesion, thus
simplifying detection [17]. In Automatic mode, the software reduces the total number of images by combining
similar images [7]; while in Quickview, the software reduces the number of recorded images by identifying
the most unusual images and presenting selected images [8]
[16]. The Omni mode is a new algorithm that selects and discards redundant images and
displays all areas captured by VCE. Several previous single-center pilot studies have
demonstrated the usefulness of these modes on a limited number of patients. The current
study included 7 expert VCE readers, 3 judging committee members, an expert biostatistician,
and a large sample size of patients. First, we attempted to confirm the non-inferiority
of the Omni mode against a control for detecting lesions in a calculated and specified
sample. On analysis, we were unable to demonstrate the non-inferiority of the Omni
mode; however, we found that the detection rate of major lesions was not significantly
different with the Omni mode when compared with the control. We suspect that our failure
in confirming the Omni mode’s non-inferiority resulted from an under-powered sample
size and heterogeneity in the definition of lesions. While we had entered the expected
number of major and minor lesions into the power analysis, the number of lesions in
each video was too small to determine non-inferiority. Moreover, among the minor lesions,
heterogeneity in the definition of lesions was recognized. For example, one reader
detected a small lymphangiectasia as a minor lesion, while another reader considered
the same lesion a normal finding. Low rates of TPs and high rates of FPs were observed
among the minor lesions. In a future study, we intend to calculate the sample size
using the number of major lesions, as these are considered abnormal.
The reading time was significantly lower when using the Omni mode than when using
the control. In the post hoc analysis, the rate of TPs among major lesions was not
significantly different between modalities. While these findings could not directly
confirm equivalence between the Omni mode and control, the findings suggest that the
Omni mode may reduce the reading time without decreasing the rate of TPs. A further
clinical trial that aims to assess more than 8 definitive major lesions is required
to support these findings.
The Omni mode could reduce the redundancy and sequentiality of the VCE video. However,
the readers in the current study did not have experience using the Omni mode. We evaluated
the types of false negatives identified in the Omni mode and the control and found
that types of missed lesions did not differ across the modalities. However, variations
in the number of false negatives identified with each modality appeared to be derived
from the reader’s abilities. Thus, reader training is required for the operation of
Omni mode and reading parameters, such as video frame rate and display of dual or
quad images, must be optimized.
A potential advantage of the Omni mode is its capacity to act as a filter and identify
patients who need a detailed, standard video reading. In other words, if the Omni
mode is “positive” it may be unnecessary to devote an extended period of time to the
video review using the standard video reading mode. On the other hand, considering
the performance of the Omni mode, it is possible that a “negative” capsule in Omni
mode could be false. Thus, if a capsule is identified as positive using Omni mode,
there is no need to review the entire video using standard mode. Interestingly, the
Omni mode was able to identify all predefined major lesions and software-associated
false negatives were not recognized. Therefore, the Omni mode could reduce VCE reading
time without increasing the lesion miss rate.
In conclusion, although the non-inferiority of the Omni mode to a control was not
demonstrated, the detection rate for major lesions was not significantly different
between the modalities. Furthermore, required reading time was low with the Omni mode.
Therefore, Omni mode may be appropriate only for assessment of major lesions using
VCE after optimization of the reading parameters and thorough reader training on the
use of this modality.