Introduction
The emergence of image-enhanced endoscopy (IEE) in recent years has improved the performance of endoscopy, particularly with the development of virtual chromoendoscopy [1]
[2], whose main advantages are its availability and simplicity, a demonstrated learning curve [3], and the accomplishment of a better characterization of the mucosal pattern. Moreover, it allows endoscopists to increasingly rely on imaging diagnosis, while decreasing the necessity to perform multiple biopsies [4].
Most of the studies have focused on narrow-band imaging (NBI) technology, with favorable results [5]
[6]
[7]
[8]
[9]
[10]
[11]
[12]. For instance, Kikuste et al. showed, in a meta-analysis, a pooled sensitivity and specificity of 0.87 and 0.77 for gastric intestinal metaplasia (GIM), and 0.90 and 0.83 for dysplasia [13]. Nevertheless, this meta-analysis was performed some years ago, and there is less evidence regarding the diagnostic accuracy of other technologies. Furthermore, there is a lack of standardization of mucosal patterns for some preneoplastic conditions and, with the advent of artificial intelligence (AI), it is of paramount importance to clearly identify the descriptors that should be used in clinical practice, because this technology is pattern-learning based.
We aimed to analyze the current evidence regarding virtual chromoendoscopy for the detection of gastric preneoplastic conditions (atrophic gastritis/GIM), lesions (dysplasia), and early gastric cancer (EGC), and to identify the factors that influence its accuracy.
Methods
Search strategy
Two electronic databases (MEDLINE through PubMed, and Embase) were searched up to December 2018. The search query for PubMed was: ((chromoendoscop* OR nbi OR "narrow band imaging" OR "Narrow Band Imaging"[Mesh] OR fice OR "flexible spectral imaging color enhancement" OR confocal OR bli OR "blue laser imaging" OR lci OR "linked color imaging" OR afi OR "autofluorescence imaging" OR i-scan)) AND ((((((((gastric [ti] AND intestinal [ti] AND metaplasia [ti]) OR "gastric intestinal metaplasia" OR "intestinal metaplasia" OR (intestinal [ti] AND metaplasia [ti]))) OR ("gastric superficial neoplastic lesions" OR (gastric [ti] AND superficial [ti] AND neoplastic [ti] AND lesion* [ti]))) OR (“gastric precancerous lesions” OR “precancerous lesions” OR (gastric [ti] AND precancerous [ti] AND lesion*[ti]))) OR (“gastric preneoplastic lesions” OR (gastric [ti] AND preneoplastic [ti] AND lesion*[ti]))) OR "Stomach Neoplasms"[Mesh]) NOT (((esophagus) NOT (gastric AND esophagus)))). This query was adapted for the Embase database (Appendix 1s, see online-only Supplementary Material).
The protocol of this study is under revision in the PROSPERO platform (www.crd.york.ac.uk/prospero/, ID 154344). This systematic review was performed according to the PRISMA guideline for diagnostic test accuracy studies (Table 1s).
Study selection
Eligibility criteria
The inclusion criteria were: original articles whose primary or secondary outcome included accuracy/sensitivity/specificity of IEE for detection of premalignant conditions or EGC. Exclusion criteria were: case reports, meta-analyses, reviews, letters, comments, congress abstracts, guidelines, studies on animals, studies with fewer than 10 cases, studies published in languages other than English/Spanish/Portuguese/Italian, and studies with unavailable statistical data for true positive, true negative, false positive, and false negative determination.
After removing duplicates and overlapping publications, two authors (M.R.C., G.E.) independently screened the titles and abstracts, and irrelevant studies were excluded. The full text of the selected studies was evaluated by the same two authors, according to the inclusion and exclusion criteria. Disagreements were resolved through discussion with a third author (D.L.). This step was performed using the Covidence online platform (www.covidence.org).
Data extraction and quality evaluation
Data extraction was performed by M.R.C. and checked by G.E. The following variables were collected: (1) author; (2) publication year; (3) country; (4) study period; (5) study design; (6) participants’ characteristics (number of patients included; number of lesions/areas biopsied; age; sex); (7) endoscope system; (8) IEE technology assessed (NBI, AFI [autofluorescence imaging], TMI [TriModal Imaging], i-SCAN, FICE [flexible spectral imaging color enhancement], BLI [blue laser imaging], LCI [linked color imaging]); (9) mucosal and vascular pattern descriptors; (10) comparator; (11) outcome (atrophic gastritis/GIM/dysplasia/EGC); (12) analysis performed (per-patient, per-biopsy/lesions); and (13) diagnostic accuracy measures (sensitivity, specificity, negative predictive value [NPV], positive predictive value [PPV], true positive, true negative, false positive, false negative, diagnostic accuracy, positive likelihood ratio, odds ratio). The reference standard was histology. Risk of bias and applicability were assessed by M.R.C. using the Quality Assessment of Diagnostic Accuracy Studies, second version (QUADAS-2).
Data synthesis and statistical analysis
Each IEE technology and gastric condition (atrophic gastritis/GIM/dysplasia/EGC) were analyzed individually, considering per-patient or per-biopsy analysis. Accuracy measures were extracted from each study using a 2 × 2 contingency table. When possible, pooled measures (sensitivity, specificity, positive likelihood ratio, diagnostic odds ratio) with their respective 95 % confidence intervals (CIs), and summary receiver operating curve were calculated.
Heterogeneity was investigated with the Cochran’s Q test (P < 0.10 meaning statistically significant heterogeneity) and I
2 (values of I
2 0–25 %, 25 %–50 %, 50 %–75 %, and > 75 % represented absent, low, moderate, and high levels of heterogeneity, respectively). When heterogeneity was absent, a fixed-effect model was used for meta-analysis. Otherwise, measures were calculated using a random-effect model. Possible sources of heterogeneity were explored by performing subgroup analysis considering the use of white-light endoscopy (WLE) before IEE, the use of high magnification, the most used mucosal pattern descriptors, and the morphology of lesions assessed. Analyses were performed using Meta-DiSc software (version 1.4).
Results
In total, 1338 studies were identified and 44 were selected for inclusion ([Fig. 1]), with a total of 10175 patients and 10451 areas biopsied. Twenty-nine studies evaluated NBI, eight AFI/TMI (one of them included an arm with high magnification NBI [ME-NBI] that was additionally considered for the NBI analysis), one i-SCAN, one FICE, three BLI, and two LCI. [Table 1] summarizes the baseline characteristics of each of the studies included [9]
[14]
[15]
[16]
[17]
[18]
[19]
[20]
[21]
[22]
[23]
[24]
[25]
[26]
[27]
[28]
[29]
[30]
[31]
[32]
[33]
[34]
[35]
[36]
[37]
[38]
[39]
[40]
[41]
[42]
[43]
[44]
[45]
[46]
[47]
[48]
[49]
[50]
[51]
[52]
[53]
[54]
[55]
[56]. The majority of them were performed in Eastern countries and had prospective recruitment. WLE was used before IEE in 33 studies, and high magnification was performed in 30 studies. There were 25 studies that compared results from IEE with WLE, and seven that compared different IEE techniques. The gold standard in all studies was histology.
Fig. 1 Flow diagram of the study selection process.
Table 1
Baseline characteristics of the included studies.
Study design
|
First author, year of publication
|
Country; study period
|
Patients, n[1]
|
Lesions/ biopsies, n[1]
|
Age, years; sex, male, n (%)
|
Equipment
|
Intervention assessed
|
Comparison
|
Outcome[2]
|
Analysis[2]
|
AG
|
GIM
|
DYS
|
ECG
|
Narrow-band imaging (NBI)
|
RCT
|
Ezoe, 2011 [28]
|
Japan; Jun 2008–May 2010
|
353
|
176
|
69; 278 (79)
|
GIF-Q240Z GIF-H260Z GIF-FQ260Z
|
WLE + ME-NBI
|
WLE, ME-NBI
|
–
|
–
|
–
|
X
|
PB
|
Ang, 2015 [18]
|
Singapore, Thailand, China, Malaysia, Australia; Jan 2012–Oct 2013
|
579
|
119
|
62.3; 355 (61)
|
GIF-290 GIF-190
|
NBI
|
WLE
|
–
|
X
|
–
|
X
|
PB
|
Gong, 2015 [40]
|
China; Jan 2013–Jan 2014
|
82
|
80
|
59; 58 (71)
|
GIF H260Z
|
WLE + ME-NBI
|
WLE + CLE
|
–
|
–
|
–
|
X
|
PB
|
Prospective
|
Bansal, 2008 [16]
|
USA; –
|
47
|
–
|
65; 46 (98)
|
GIF-240Z
|
WLE + ME-NBI
|
–
|
–
|
X
|
–
|
–
|
PP
|
Tahara, 2009 [14]
|
Japan; May 2007–Jul 2008
|
106
|
–
|
58.7; 64 (60)
|
GIF-H260Z
|
WLE + ME-NBI
|
–
|
X
|
X
|
–
|
–
|
PP
|
Ezoe, 2010 [27]
|
Japan; Mar 2006–Feb 2008
|
53
|
57
|
–; –
|
GIF-Q240Z GIF-H260Z
|
WLE + ME-NBI
|
ME-WLE
|
–
|
–
|
–
|
X
|
PB
|
Kato, 2010 [29]
|
Japan; Jan 2008–Jan 2009
|
111
|
201
|
66; 98 (88)
|
GIF-H260Z
|
WLE + ME-NBI
|
WLE
|
–
|
–
|
–
|
X
|
PP, PB
|
Rerknimitr, 2011 [9]
|
Thailand; Nov 2007–May 2009
|
38
|
228
|
60; 20 (53)
|
GIF-160Z
|
WLE + NBI
|
–
|
–
|
X
|
–
|
–
|
PB
|
Wang, 2012 [30]
|
China; Dec 2009–Nov 2010
|
76
|
82
|
64; 61 (80)
|
GIF-H260Z
|
ME-NBI
|
–
|
–
|
–
|
–
|
X
|
PB
|
An, 2012 [15]
|
Korea; Sep 2009–Apr 2010
|
47
|
93
|
55; 28 (60)
|
GIF-H260Z
|
WLE + ME-NBI
|
–
|
X
|
X
|
–
|
–
|
PB
|
Li, 2012 [31]
|
China; Dec 2009–Oct 2011
|
146
|
164
|
59; 80 (60)
|
GIF-H260Z
|
WLE + ME-NBI
|
–
|
–
|
–
|
-
|
X
|
PB
|
Liu, 2014 [17]
|
China; Nov 2011–Oct 2012
|
90
|
207
|
57.5; 49 (54)
|
GIF-H260Z
|
WLE + ME-NBI
|
–
|
–
|
X
|
–
|
X
|
PB
|
Yao, 2014 [38]
|
Japan; Oct 2009–Nov 2010
|
310
|
371
|
66; 183 (59)
|
GIF-Q240Z GIF-Q260Z
|
WLE + ME-NBI
|
–
|
–
|
–
|
–
|
X
|
PB
|
Kawai, 2014 [35]
|
Japan; Aug 2012–Dec 2013
|
225
|
52
|
65; 160 (71)
|
GIF-XP290N
|
WLE + NBI
|
WLE
|
–
|
–
|
–
|
X
|
PB
|
Yamada, 2014 [37]
[3]
|
Japan; Jun 2008–May 2010
|
362
|
353
|
69; 278 (79)
|
GIF-Q240Z GIF-H260Z GIF-FQ260Z
|
WLE + ME-NBI
|
WLE, ME-NBI
|
–
|
–
|
–
|
X
|
PB
|
Yu, 2015 [41]
|
China; Mar 2010–Jun 2012
|
3616
|
3675
|
56; 1910 (53)
|
GIF-H260Z
|
ME-NBI
|
WLE, ME-WLE
|
–
|
–
|
–
|
X
|
PB
|
Lage, 2016 [19]
|
Portugal; Jul 2013– Dec 2014
|
35
|
–
|
60; 25 (42)
|
GIF-HQ190
|
WLE + NBI
|
WLE
|
–
|
X
|
–
|
–
|
PP
|
Pimentel-Nunes, 2016 [20]
|
Portugal, Italy, USA, Romania, UK; Jan 2014–Mar 2015
|
238
|
1123
|
60; 100 (42)
|
GIF-H180 GIF-H190 GIF-260
|
WLE + NBI
|
WLE
|
–
|
X
|
X
|
–
|
PB
|
Kanemitsu, 2017 [21]
|
Japan; Jul 2014– Dec 2014
|
40
|
40
|
68; 24 (60)
|
GIF-Q240Z GIF-H260Z
|
WLE + ME-NBI
|
–
|
–
|
X
|
–
|
–
|
PP
|
Sha, 2017 [22]
|
China; Feb 2016–June 2016
|
132
|
–
|
53.5; 53 (40)
|
GIF-Q260
|
WLE + NBI
|
WLE, AA-NBI
|
–
|
X
|
–
|
–
|
PP
|
Tahara, 2017 [23]
|
Japan; Jan 2013–Mar 2016
|
125
|
166
|
67; 89 (71)
|
GIF-H260Z
|
ME-NBI
|
–
|
–
|
X
|
–
|
–
|
PB
|
Esposito, 2019 [25]
|
Italy, Portugal; Jan 2016–Sep 2017
|
250
|
–
|
55; 95 (38)
|
GIF-H180 GIF-HQ190
|
WLE + NBI
|
–
|
–
|
X
|
–
|
–
|
PP
|
Retrospective
|
Miwa, 2012 [32]
|
Japan; Aug 2006–Sep 2009
|
135
|
109
|
70; 77 (57)
|
GIF-Q240Z GIF-H260Z
|
ME-NBI
|
WLE
|
–
|
–
|
–
|
X
|
PB
|
Tsuji, 2012 [33]
|
Japan; Jul 2007– Dec 2010
|
137
|
137
|
68; 101 (74)
|
GIF-Q240Z GIF-H260Z
|
ME-NBI
|
–
|
–
|
–
|
–
|
X
|
PB
|
Maki, 2013 [34]
|
Japan; Jan 2006–Mar 2010
|
93
|
93
|
71; 73 (79)
|
GIF-Q240Z GIF-H260Z
|
ME-NBI
|
WLE
|
–
|
–
|
–
|
X
|
PB
|
Tao, 2014 [36]
|
China; Mar 2010–Jun 2012
|
508
|
643
|
63; 316 (62)
|
GIF-H260Z
|
ME-NBI
|
WLE, ME-AA-indigo carmine
|
–
|
–
|
–
|
X
|
PB
|
Fujiwara, 2015 [39]
|
Japan; Jan 2006–Aug 2013
|
99
|
103
|
–; 21 (21)
|
GIF-Q240Z GIF-H260Z
|
ME-NBI
|
ME-indigo carmine
|
–
|
–
|
–
|
X
|
PB
|
Nonaka, 2016 [42]
|
Japan; Oct 2009–Mar 2014
|
91
|
100
|
74; 84 (92)
|
–
|
ME-NBI
|
WLE
|
–
|
–
|
–
|
X
|
PB
|
Sobrino-Cossio, 2018 [24]
|
Mexico; Jan 2015–Dec 2016
|
338
|
776
|
64; 203 (60)
|
GIF-H180
|
WLE + NBI
|
WLE
|
–
|
X
|
–
|
–
|
PB
|
Autofluorescence imaging (AFI)/ trimodal imaging (TMI)
|
RCT
|
Lim, 2013 [26]
|
Singapore; Sep 2011– Jul 2012
|
20
|
125
|
62.4; 15 (75)
|
GIF-FQ260Z
|
WLE + AFI
|
WLE, ME-NBI, pCLE
|
–
|
X
|
–
|
–
|
PB
|
So, 2013 [44]
|
Singapore; Jul 2007– Nov 2007
|
64
|
146
|
61; 29 (45)
|
GIF-FQ260Z
|
TMI (WLE + AFI + ME-NBI)
|
WLE
|
X
|
X
|
–
|
–
|
PB, PP
|
Prospective
|
Kobayashi, 2001 [46]
|
Japan; –
|
52
|
54
|
–; –
|
GIF-Q40
|
WLE + LIFE-GI
|
–
|
–
|
–
|
–
|
X
|
PB
|
Kato, 2007 [47]
|
Japan; Mar 2006–Aug 2006
|
51
|
91
|
65.7; 41 (80)
|
XGIF−Q240FZ
|
WLE + AFI
|
WLE, AFI
|
–
|
–
|
X
|
–
|
PB
|
Kato, 2009 [48]
|
Japan; Apr 2007–Dec 2007
|
65
|
91
|
–; –
|
XGIF-Q240FZ
|
TMI (WLE + AFI + ME-NBI)
|
WLE, AFI
|
–
|
–
|
X
|
–
|
PB
|
Imaeda, 2014 [49]
|
Japan; Jan 2010–Dec 2011
|
182
|
74
|
79; 147 (81)
|
GIF H260Z
|
TMI (WLE + AFI + ME-NBI)
|
WLE, AFI
|
–
|
–
|
X
|
–
|
PB
|
Inoue, 2010 [43]
|
Japan; Nov 2006–Apr 2007
|
75
|
256
|
67; 48 (61)
|
EVISFQ260Z
|
WLE + AFI
|
–
|
X
|
X
|
–
|
–
|
PP
|
Shi, 2015 [45]
|
China; Feb 2012–Oct 2013
|
140
|
–
|
41; 62 (44)
|
GIF-FQ260Z
|
TMI (WLE + AFI + ME-NBI)
|
AFI
|
–
|
X
|
-
|
X
|
PP
|
i-SCAN
|
Prospective
|
Li, 2013 [50]
|
China; Jan 2012–Mar 2012
|
43
|
43
|
47.5; 32 (74)
|
EG-2990 Zi
|
WLE + ME-iSCAN
|
–
|
–
|
–
|
–
|
X
|
PB
|
Flexible spectral imaging color enhancement (FICE)
|
Prospective
|
Pittayanon, 2013 [51]
|
Thailand; Jan 2010–Dec 2011
|
60
|
120
|
63; 33 (55)
|
–
|
ME-FICE
|
ME-FICE + pCLE
|
–
|
X
|
–
|
–
|
PB
|
Blue laser imaging (BLI)
|
Prospective
|
Dohi, 2017 [52]
|
Japan; Nov 2012–Apr 2015
|
530
|
127
|
70; 95 (81)
|
EG-L590ZW
|
WLE + ME-BLI
|
WLE
|
–
|
–
|
–
|
X
|
PB
|
Chen, 2018 [53]
|
China; Jan 2017–May 2017
|
100
|
–
|
51; 54 (54)
|
EG-L590ZW
|
WLE + ME-BLI
|
WLE
|
–
|
X
|
–
|
–
|
PP
|
Chen, 2019 [54]
|
China; Jul 2017– Feb 2018
|
106
|
–
|
50; 56 (59)
|
EG-L590ZW
|
WLE + BLI
|
WLE, WLE + AA-BLI
|
–
|
X
|
–
|
–
|
PP
|
Linked color imaging (LCI)
|
Prospective
|
Ono, 2018 [56]
|
Japan; Jul 2016–May 2017
|
128
|
177
|
65; 54 (42)
|
EG-L590ZW EG-L600ZW EG-L580NW
|
WLE + LCI
|
WLE
|
–
|
X
|
–
|
–
|
PB, PP
|
Fukuda, 2019 [55]
|
Japan; Oct 2015–Jun 2016
|
48
|
48
|
73; 31 (65)
|
EG-L590ZW
|
WLE + LCI
|
–
|
–
|
X
|
–
|
–
|
PB
|
AG, atrophic gastritis; GIM, gastric intestinal metaplasia; DYS, dysplasia; EGC, early gastric cancer; RCT, randomized clinical trial; WLE, white-light endoscopy; ME, high magnification; PB, per-biopsy/lesion analysis; pCLE, probe-based confocal laser endomicroscopy; PP, per-patient analysis; AA, acetic acid; LIFE-GI, laser-induced fluorescence endoscopy in the gastrointestinal tract.
1 If an article analyzed other techniques or aims outside the scope of this meta-analysis, we considered only the “n” of the subset related to our outcomes.
2 If more than one outcome was analyzed (AG, GIM, DYS, EGC), or more than one analysis was performed (PB, PP), we included only those for which we could extract complete data.
3 Post-hoc analysis of an RCT.
[Table 2] summarizes the endoscopic descriptors used for each gastric condition. The most evaluated outcome was dysplasia/EGC. None of the studies reported a specific pattern for atrophic gastritis. Regarding GIM, the most used markers were the presence of tubulovillous pattern and “light-blue crest” (LBC) ([Fig. 2a]). Some discrepancies concerning the LBC concept were noticed: despite it originally being defined under high magnification [5], only half of the studies applied high magnification, and some of the non-magnification studies still used the LBC concept but with a different description. Regarding dysplastic lesions, the most used sign was the presence of irregular microsurface/microvascular pattern (independently of the presence of a demarcation line) ([Fig. 2b]).
Table 2
Pattern descriptors by condition and technology.
Use of high magnification
|
Atrophic gastritis
|
Gastric intestinal metaplasia
|
Dysplasia/early gastric cancer
|
Narrow-band imaging (NBI)
|
Yes
|
LBC: a fine blue-white line on the crest of the epithelial surface/gyri [15]
MTB: an enclosing white turbid band on the epithelial surface/gyri [15]
Oval or tubulovillous pit with clearly visible coiled or wavy vessels [14]
|
LBC: a fine blue-white line on the crest of the epithelial surface/gyri [15]
[17]
[21]
[26]
MTB: an enclosing white turbid band on the epithelial surface/gyri [15]
Ridge/tubulovillous mucosal pattern [16]
[17]
Oval or tubulovillous pit with clearly visible coiled or wavy vessels [14]
[23]
WOS: a white substance that renders the subepithelial vasculature of the intervening part surrounded by crypt openings opaque [21]
|
VS classification: irregular microsurface and/or microvascular pattern within a demarcation line [27]
[28]
[30]
[32]
[33]
[34]
[36]
[37]
[38]
[39]
[40]
[41]
[42]
Demarcated lesions with a disappearance of fine mucosal structure, microvascular dilation and microvascular heterogeneity in shape [29]
Obscure irregular microsurface or microvascular pattern / no microsurface pattern and sparse microvascular or avascular areas [31]
Demarcated lesions with disappearance of normal pit pattern and appearance of new vessels [17]
|
No
|
|
Ridge/tubulovillous mucosal pattern [9]
[18]
[19]
[20]
[24]
[25]
LBC with different definitions:
A fine, blue-white line on the crests of the epithelial surface [9]
[18]
[24]
Blue-whitish slightly raised areas [9]
[20]
LLC: a combination of linear dark and light areas that differed from the normal gastric epithelium [9]
Bluish-whitish areas with a regular mucosal pattern [22]
|
Irregular mucosal and vascular pattern [18]
[20]
VS classification [35]
WOS: a white material above the mucosa that could be either well defined (regular) or not (irregular) [20].
|
Autofluorescence imaging (AFI)
|
No
|
Homogeneous green appearance in the gastric body [43]
[44]
|
Homogeneous green areas with a regular pattern in the gastric body [26]
[43]
[44]
|
Dark red or deep red changes [45]
[46]
Area with a defined margin and with a difference in color compared with the surrounding mucosa [47]
[48]
[49]
|
i-SCAN
|
Yes
|
–
|
–
|
Surface pit pattern: irregular arrangement and size or destructive pattern [50]
|
Flexible spectral imaging color enhancement (FICE)
|
Yes
|
–
|
Villous mucosal pattern [51]
LBC: a fine blue-white line on the crest of the epithelial surface [51]
LLC: a combination of linear dark and light areas [51]
|
–
|
Blue laser imaging (BLI)
|
Yes
|
–
|
Bluish-whitish patchy areas with a regular mucosal pattern [53]
|
Irregular microsurface or microvascular pattern within a demarcation line [52]
|
No
|
|
Bluish-whitish patchy areas with a regular mucosal pattern [54]
|
|
Linked color imaging (LCI)
|
No
|
–
|
Focal and patchy lesion with a lavender color that was distinguished from the surrounding area [55], defined as “lavender-color sign” [56]
|
–
|
Summary
|
|
To be better defined
|
Tubulovillous pattern is the most consistent endoscopic marker and improves accuracy values
|
Irregular microsurface and/or microvascular pattern are consistently a sign of dysplasia/cancer
|
LBC, light-blue crest; MTB, marginal turbid band; LLC, large long crest; WOS, white opaque substance; VS classification, vessel plus surface classification, according to Yao et al. [57].
Fig. 2 Representative images of the principal markers of gastric intestinal metaplasia and early gastric cancer.
Some of the images were previously published in Endoscopy
[21] and in Endoscopy International Open
[4], and were used in this systematic review with the permission of the Editorial Board.
* Endoscopic description according to Uedo’s definition [5].
Quality assessment
The quality assessments of the included studies are shown in Table 2s and Fig. 1s. Each study was judged for risk of bias and applicability concerns, and was classified as “low risk” when considered “low” in all domains, “high risk” when one or more domains were considered “high,” and “unclear” when insufficient data were reported. Almost half of the studies (52 %) showed high risk of bias on patient selection, mainly because they included very select population/gastric area/lesions (e. g. enriched population, only one gastric area assessed, depressed lesions, lesions < 10 mm). Concerning other domains, the majority of studies presented low risk of bias, although 34 % of studies demonstrated unclear risk with regard to the reference standard (mostly because of uncertainty of blinding). Almost all studies showed low concerns relating to applicability. Subgroup analysis according to study quality was not possible owing to the low number of studies in each subgroup.
Diagnostic characteristics of different IEE technologies
[Table 3] and [Table 4] show pooled analysis for GIM and dysplasia/EGC. The detailed non-pooled measures from each study are shown for atrophic gastritis (Table 3s), GIM (Table 4s), and dysplasia/EGC (Table 5s) (owing to the low number of studies included in some groups, it was not possible to conduct a pooled analysis in all of them).
Table 3
Pooled analysis of results for gastric intestinal metaplasia.
Covariates
|
Number of studies
|
Sensitivity (95 % CI)
|
Specificity (95 % CI)
|
Positive LR (95 % CI)
|
DOR (95 % CI)
|
AUC (95 % CI)
|
Narrow-band imaging (NBI): per-patient analysis
|
Overall
|
6
|
0.79 (0.72–0.85)
|
0.91 (0.88–0.94)
|
12.37 (3.66–41.83)
|
65.65 (11.04–390.41)
|
0.9 (0.77–1.03)
|
High magnification (ME)
|
Overall
|
3
|
0.82 (0.67–0.92)
|
0.97 (0.92–0.99)
|
17.78 (6.69–47.25)
|
86.40 (26.71–279.5)
|
0.95 (0.83–1.07)
|
Previous WLE
|
3
|
0.82 (0.67–0.92)
|
0.97 (0.92–0.99)
|
17.78 (6.69–47.25)
|
86.40 (26.71–279.5)
|
0.95 (0.83–1.07)
|
Non-previous WLE
|
0
|
–
|
–
|
–
|
–
|
–
|
Villous pattern
|
2
|
0.75 (0.51–0.91)
|
0.97 (0.93–0.99)
|
21.41 (8.45–54.28)
|
78.00 (20.02–303.87)
|
–
|
Non-villous pattern
|
1
|
0.87 (0.68–0.92)
|
0.94 (0.70–1.00)
|
14.00 (2.09–93.95)
|
105.00 (9.93–1110.00)
|
–
|
Non-ME
|
Overall
|
3
|
0.78 (0.69–0.84)
|
0.89 (0.84–0.92)
|
8.11 (1.44–45.68)
|
45.85 (2.56–822.13)
|
0.67 (0.55–0.79)
|
Previous WLE
|
3
|
0.78 (0.69–0.84)
|
0.89 (0.84–0.92)
|
8.11 (1.44–45.68)
|
45.85 (2.56–822.13)
|
0.67 (0.55–0.79)
|
Non-previous WLE
|
0
|
–
|
–
|
–
|
–
|
–
|
Villous pattern +/ LBC
|
2
|
0.90 (0.79–0.96)
|
0.95 (0.91–0.97)
|
17.14 (9.63–30.53)
|
157.58 (56.28–441.25)
|
–
|
Non-villous pattern +/ LBC
|
1
|
0.67 (0.54–0.78)
|
0.68 (0.56–0.79)
|
2.10 (1.42–3.10)
|
4.29 (2.07–8.88)
|
–
|
Narrow-band imaging (NBI): per-biopsy analysis
|
Overall
|
9
|
0.84 (0.81–0.86)
|
0.95 (0.94–0.96)
|
12.71 (5.45–29.6)
|
72.51 (23.31–225.52)
|
0.92 (0.84–1.00)
|
High magnification (ME)
|
Overall
|
4
|
0.83 (0.77–0.88)
|
0.95 (0.92–0.96)
|
14.72 (5.48–39.53)
|
85.53 (24.18–302.55)
|
0.96 (0.88–1.05)
|
WLE
|
3
|
0.78 (0.71–0.85)
|
0.93 (0.89–0.96)
|
10.09 (4.25–23.94)
|
42.75 (22.67–80.61)
|
0.92 (0.87–0.97)
|
Non-WLE
|
1
|
0.96 (0.86–0.99)
|
0.98 (0.94–1.00)
|
56.11 (14.18–221.98)
|
1351.3 (184.87–9876.5)
|
–
|
LBC
|
2
|
0.81 (0.72–0.88)
|
0.89 (0.82–0.94)
|
8.27 (2.36–29.04)
|
39.20 (17.30–88.82)
|
–
|
Non-LBC
|
2
|
0.85 (0.76–0.92)
|
0.96 (0.94–0.98)
|
26.12 (6.12–111.95)
|
233.81 (9.12–5997.8)
|
–
|
Non-ME
|
Overall
|
5
|
0.84 (0.81–0.87)
|
0.95 (0.94–0.96)
|
11.05 (3.18–38.34)
|
60.25 (11.01–329.71)
|
0.87 (0.72–1.01)
|
WLE
|
4
|
0.84 (0.81–0.87)
|
0.95 (0.94–0.96)
|
10.24 (2.46–42.70)
|
46.60 (6.67–325.36)
|
0.79 (0.66–0.93)
|
Non-WLE
|
1
|
0.92 (0.82–0.98)
|
0.94 (0.85–0.98)
|
15.46 (5.96–40.12)
|
189.00 (44.96–794.43)
|
–
|
Villous pattern +/ LBC
|
3
|
0.88 (0.84–0.90)
|
0.97 (0.96–0.98)
|
29.03 (17.73–47.52)
|
224.28 (148.63–338.43)
|
0.97 (0.95–1)
|
Non-villous pattern +/ LBC
|
2
|
0.74 (0.66–0.80)
|
0.78 (0.72–0.83)
|
3.081 (1.45–6.55)
|
8.74 (2.19–34.82)
|
–
|
Autofluorescence imaging (AFI): per-patient analysis
|
Without ME-NBI
|
2
|
0.86 (0.77–0.92)
|
0.82 (0.74–0.88)
|
3.82 (1.00–14.59)
|
27.04 (11.05–66.19)
|
–
|
With ME-NBI (TMI)
|
2
|
0.80 (0.69–0.88)
|
0.77 (0.68–0.84)
|
3.03 (0.15–59.52)
|
7.50 (0.062–910.64)
|
–
|
Linked color imaging (LCI): per-biopsy analysis
|
Overall (without ME)
|
2
|
0.73 (0.65–0.81)
|
0.92 (0.87–0.96)
|
8.89 (2.49–31.68)
|
35.09 (16.20–75.98)
|
–
|
Blue laser imaging (BLI): per-patient analysis
|
Overall
|
2
|
0.78 (0.67–0.87)
|
0.83 (0.75–0.89)
|
7.48 (0.37–150.36)
|
32.12 (0.66–1569.3)
|
–
|
With ME
|
1
|
0.89 (0.74–0.97)
|
0.97 (0.89–0.99)
|
28.44 (7.23–111.82)
|
248 (43.09–1427.4)
|
–
|
Without ME
|
1
|
0.68 (0.52–0.82)
|
0.69 (0.57–0.80)
|
2.22 (1.46–3.38)
|
4.85 (2.09–11.26)
|
–
|
CI, confidence interval; LR, likelihood ratio; DOR, diagnostic odds ratio; AUC, area under the summary receiver operating curve; WLE, white light endoscopy; LBC, light-blue crest; TMI, trimodal imaging.
Table 4
Pooled analysis of results for dysplasia/early gastric cancer.
Covariates
|
Number of studies
|
Sensitivity (95 % CI)
|
Specificity (95 % CI)
|
Positive LR (95 % CI)
|
DOR (95 % CI)
|
AUC (95 % CI)
|
Narrow-band imaging (NBI) for early gastric cancer: per-biopsy analysis
|
Overall
|
19
|
0.87 (0.84–0.89)
|
0.97 (0.97–0.98)
|
14.03 (7.21–27.28)
|
107.38 (45.91–251.15)
|
0.95 (0.93–0.98)
|
High magnification (ME)
|
Overall
|
17
|
0.87 (0.84–0.89)
|
0.97 (0.97–0.98)
|
15.20 (7.31–31.58)
|
114.08 (46.30–281.08)
|
0.96 (0.93–0.98)
|
WLE
|
8
|
0.88 (0.88–0.92)
|
0.96 (0.95–0.97)
|
18.96 (10.21–35.20)
|
158.49 (67.49–372.20)
|
0.98 (0.96–1.00)
|
Non-WLE
|
9
|
0.86 (0.84–0.89)
|
0.98 (0.97–0.98)
|
12.12 (3.49–42.05)
|
80.78 (19.26–338.86)
|
0.94 (0.90–0.98)
|
VS classification
|
14
|
0.86 (0.83–0.88)
|
0.98 (0.97–0.98)
|
14.56 (6.01–35.30)
|
97.81 (34.51–277.22)
|
0.95 (0.91–0.98)
|
Non-VS classification
|
3
|
0.94 (0.88–0.98)
|
0.94 (0.92–0.96)
|
17.31 (5.30–56.57)
|
223.79 (73.91–677.64)
|
0.98 (0.97–1.00)
|
Non-ME
|
Overall
|
2
|
0.91 (0.59–1.00)
|
0.84 (0.78–0.90)
|
7.12 (1.88–26.92)
|
60.34 (9.26–393.14)
|
–
|
WLE
|
1
|
0.88 (0.47–1.00)
|
0.93 (0.81–0.99)
|
12.83 (4.17–39.46)
|
95.67 (8.67–1055.5)
|
–
|
Non-WLE
|
1
|
1.00 (0.29–1.00)
|
0.81 (0.73–0.88)
|
4.55 (2.69–7.69)
|
29.40 (1.47–589.72)
|
–
|
VS classification
|
1
|
0.88 (0.47–1.00)
|
0.93 (0.81–0.99)
|
12.83 (4.17–39.46)
|
95.67 (8.67–1055.5)
|
–
|
Non-VS classification
|
1
|
1.00 (0.29–1.00)
|
0.81 (0.73–0.88)
|
4.55 (2.69–7.69)
|
29.40 (1.47–589.72)
|
–
|
Depressed-type lesions
|
Overall
|
6
|
0.88 (0.80–0.93)
|
0.96 (0.93–0.97)
|
17.41 (10.69–28.36)
|
143.83 (38.80–533.23)
|
0.99 (0.98–1.00)
|
ME
|
5
|
0.88 (0.80–0.93)
|
0.96 (0.94–0.97)
|
18.08 (10.11–32.32)
|
160.53(32.98–781.36)
|
0.99 (0.98–1.00)
|
Non-ME
|
1
|
0.88 (0.47–1.00)
|
0.93 (0.81–0.99)
|
12.83 (4.17–39.46)
|
95.67 (8.67–1055.5)
|
–
|
ME-WLE
|
5
|
0.86 (0.76–0.92)
|
0.96 (0.94–0.97)
|
18.57 (9.96–34.65)
|
170.69 (26.55–1097.4)
|
0.99 (0.96–1.01)
|
ME-Non-WLE
|
0
|
–
|
–
|
–
|
–
|
–
|
ME-VS classification
|
4
|
0.87 (0.78–0.93)
|
0.96 (0.94–0.98)
|
17.55 (7.25–42.49)
|
151.80 (20.37–1131.18)
|
0.99 (0.96–1.01)
|
ME-Non-VS classification
|
1
|
0.93 (0.66–1.00)
|
0.95 (0.90–0.97)
|
17.36 (9.34–32.29)
|
230.10 (27.31–1939.00)
|
–
|
Elevated-type lesions
|
Overall
|
3
|
0.88 (0.82–0.92)
|
0.87 (0.80–0.92)
|
6.74 (0.96–47.45)
|
45.92 (3.85–547.59)
|
0.94 (0.85–1.03)
|
ME
|
3
|
0.88 (0.82–0.92)
|
0.87 (0.80–0.92)
|
6.74 (0.96–47.45)
|
45.92 (3.85–547.59)
|
0.94 (0.85–1.03)
|
Non-ME
|
0
|
–
|
–
|
–
|
–
|
–
|
ME-WLE
|
0
|
–
|
–
|
–
|
–
|
–
|
ME-Non-WLE
|
3
|
0.88 (0.82–0.92)
|
0.87 (0.80–0.92)
|
6.74 (0.96–47.45)
|
45.92 (3.85–547.59)
|
0.94 (0.85–1.03)
|
ME-VS classification
|
3
|
0.88 (0.82–0.92)
|
0.87 (0.80–0.92)
|
6.74 (0.96–47.45)
|
45.92 (3.85–547.59)
|
0.94 (0.85–1.03)
|
ME-Non-VS classification
|
0
|
0.88 (0.82–0.92)
|
0.87 (0.80–0.92)
|
6.74 (0.96–47.45)
|
45.92 (3.85–547.59)
|
0.94 (0.85–1.03)
|
Trimodal imaging (TMI) for dysplasia: per-biopsy analysis
|
Overall
|
2
|
0.93 (0.85–0.98)
|
0.98 (0.92–1.00)
|
35.24 (10.06–123.46)
|
565.81 (93.32–3430.6)
|
|
CI, confidence interval; LR, likelihood ratio; DOR, diagnostic odds ratio; AUC, area under the summary receiver operating curve; WLE, whitelight endoscopy; VS classification: vessel plus surface classification.
NBI
Thirty studies were selected for analysis, including a total of 8482 patients. It was only possible to perform pooled analysis for GIM and EGC.
Atrophic gastritis: Two studies evaluated the presence of atrophic gastritis [14]
[15]. The endoscopic criteria for atrophic gastritis were the same as those used for GIM; however, for atrophic gastritis, the sensitivities were significantly lower compared with those for GIM.
GIM: Fourteen studies reported their results regarding GIM [9]
[14]
[15]
[16]
[17]
[18]
[19]
[20]
[21]
[22]
[23]
[24]
[25]
[26].
-
Per-patient analysis: the pooled sensitivity and specificity from the six studies included were 0.79 (95 %CI 0.72–0.85) and 0.91 (95 %CI 0.88–0.94), with moderate to high heterogeneity. In the three studies that used ME-NBI, heterogeneity was absent to moderate, and the pooled specificity was significantly higher compared with the non-ME-NBI subgroup (0.97 [95 %CI 0.92–0.99] vs. 0.89 [95 %CI 0.84–0.92]).
-
Per-biopsy analysis: from the nine studies included, the pooled sensitivity and specificity were 0.84 (95 %CI 0.81–0.86) and 0.95 (95 %CI 0.94–0.96), with high heterogeneity. Subgroup analysis according to ME-NBI use did not influence accuracy or heterogeneity.
Under high magnification, we found that using LBC as the only marker for GIM was associated with a lower specificity compared with the use of other patterns or a combination of endoscopic markers (0.89 [95 %CI 0.82–0.94] vs. 0.96 [95 %CI 0.94–0.98], high heterogeneity); however, only two studies were included in each subgroup. In the studies without high magnification, accuracy was significantly higher in the subgroup that used tubulovillous pattern with or without LBC on the per-biopsy analysis, and heterogeneity was absent to moderate (sensitivity and specificity of 0.88 [95 %CI 0.84–0.90] and 0.97 [05 %CI 0.96–0.98]) ([Fig. 3]).
Fig. 3 Narrow-band imaging pooled analysis for gastric intestinal metaplasia: accuracy of tubulovillous pattern ± light blue crest, without high magnification, on per-biopsy basis. CI, confidence interval; SROC, summary receiver operating curve; AUC, area under the curve; SE, standard error; Q*, Q index; df, degrees of freedom.
Dysplasia and EGC: Nineteen articles evaluated the detection of dysplasia/EGC. One article performed two separate analyses based on morphology without conditioning overlapping data, so we included both of these, meaning a total of 20 studies were included [17]
[18]
[20]
[27]
[28]
[29]
[30]
[31]
[32]
[33]
[34]
[35]
[36]
[37]
[38]
[39]
[40]
[41]
[42]. In order to reduce heterogeneity, we only performed the analysis with those studies that aimed to discriminate cancerous (Vienna 4–5) vs. non-cancerous lesions (including Vienna 3). Studies regarding histological characterization of EGC were outside the scope of this meta-analysis.
-
Per-biopsy analysis: the pooled sensitivity and specificity from the 19 studies included were 0.87 (95 %CI 0.84–0.89) and 0.97 (95 %CI 0.97–0.98), respectively, with high heterogeneity. Specificity was significantly higher in the ME-NBI subgroup (17 studies) at 0.97 (95 %CI 0.97–0.98) vs. 0.84 (95 %CI 0.78–0.90) in the non-ME studies (2 studies; high heterogeneity). Morphology had a significant impact on the diagnostic accuracy, being higher in depressed-type lesions (sensitivity 0.88 [95 %CI 0.80–0.93], specificity 0.96 [95 %CI 0.93–0.97]; absent to moderate heterogeneity).
In the studies with ME-NBI, the specificity was significantly higher in the subgroup that used the “vessel plus surface” (VS) classification ([Fig. 4]) (specificity 0.98 [95 %CI 0.97–0.98] vs. 0.94 [95 %CI 0.92–0.96]), although the sensitivity was lower (0.86 [95 %CI 0.83–0.88] vs. 0.94 [95 %CI 0.88–0.98]), with high heterogeneity. The pooled specificity of the studies with WLE before ME-NBI was lower compared with those without WLE (0.96 [95 %CI 0.95–0.97] vs. 0.98 [95 %CI 0.97–0.98]; high heterogeneity).
Fig. 4 Narrow-band imaging pooled analysis for early gastric cancer: accuracy of the vessel plus surface classification under high magnification, on a per-biopsy basis. CI, confidence interval; SROC, summary receiver operating curve; AUC, area under the curve; SE, standard error; Q*, Q index; df, degrees of freedom. * WLE followed by NBI. † Elevated lesions. ‡ Depressed lesions.
AFI/TMI
Eight studies were selected, including 649 patients. One study evaluated the laser-induced fluorescence endoscopy in the gastrointestinal tract (LIFE-GI) system, which represents the technological predecessor of AFI, and has a lower image quality. Four studies evaluated AFI combined with ME-NBI, which is recognized as trimodal imaging (TMI).
Atrophic gastritis: The presence of atrophic gastritis in the corpus was assessed in two studies [43]
[44]: one evaluated AFI on a per-patient basis; the other evaluated TMI on a per-patient and per-biopsy basis. Although the results were variable, the studies showed a very low sensitivity and an intermediate specificity.
GIM: Four studies assessed GIM [26]
[43]
[44]
[45]. Although the overall accuracy was slightly higher for AFI compared with TMI, this was not statistically significant. One TMI study showed very poor specificity [44], and the authors speculated that the reason for this could be mostly down to their limited experience in interpreting ME-NBI images.
Dysplasia/EGC: Among the five included studies [45]
[46]
[47]
[48]
[49], pooled analysis was only possible for two of them. Both studies evaluated the presence of dysplasia with TMI, showing a sensitivity and specificity of 0.93 and 0.98, respectively. In general, studies with TMI obtained better results, chiefly improving the sensitivity.
i-SCAN
One study evaluated the accuracy of ME-i-SCAN in the diagnosis of cancerous lesions (43 patients/lesions) [50]. Although the sensitivity was very high (1.00), the specificity was only acceptable (0.77) and the PPV was poor (0.50). The authors concluded that the value of ME-i-SCAN in the diagnosis of cancerous lesions is limited, because microvascular assessment remains unsatisfactory.
FICE
One study [51] compared ME-FICE with ME-FICE + pCLE (probed-based confocal laser endomicroscopy) in 60 patients. Although ME-FICE had a high sensitivity and specificity for GIM (0.96 and 0.80, respectively), the addition of pCLE increased the specificity in 11 %. Therefore, the authors support the use of a combination of virtual chromoendoscopy with pCLE to characterize suspected GIM areas.
BLI
Three studies evaluated BLI, including 736 patients [52]
[53]
[54]. Pooled analysis was possible for two of them, reaching a sensitivity and specificity of 0.78 and 0.83, respectively, for GIM detection. Only one study evaluated BLI without high magnification, showing a low sensitivity and specificity (0.68 and 0.69, respectively), but possible reasons for this low performance level were not discussed in the study. In contrast, the two studies that applied BLI with high magnification obtained a high accuracy and the authors concluded that the results were similar to those obtained with ME-NBI in previous reports.
LCI
Two studies evaluated the accuracy of LCI for GIM, including 176 patients [55]
[56]. The pooled sensitivity and specificity were 0.73 and 0.92, respectively. Regarding the appearance of GIM, the authors speculated that the “lavender-color sign” in LCI corresponds to the “bluish-whitish area” observed in NBI as both probably have the same explanation, this being differences in the light reflectance of the brush border.
Discussion
Gastric cancer is the fifth most common cancer worldwide, with a high lethality rate [58], mainly owing to its late diagnosis. Screening programs have been applied for many years in high incidence populations, improving the survival rate. Although these programs can also be cost-effective in intermediate risk countries [59]
[60], they have still not been implemented, and current recommendations suggest the diagnosis and surveillance of individuals with extensive preneoplastic conditions through the use of IEE [59]
[61]
[62]
[63]. This approach allows the possibility of offering an endoscopic treatment instead of gastric surgery, so avoiding the associated morbidity and mortality [59]
[62]
[64]
[65].
According to the IEE technology being used, descriptors for preneoplastic conditions or EGC have been modified over time. For instance, with WLE, GIM was first described as “ash-colored nodular changes” by Kaminishi et al. [66] and as a “motley patchy erythema” by Nagata et al. [67]. With NBI, new markers were described such as “bluish-whitish areas” [8] and, when using ME-NBI, LBCs can be seen in these areas [5]. However, the accuracy of these technologies changes according to which endoscopic marker is analyzed, so it is necessary to clearly identify the descriptors that are associated with higher IEE diagnostic accuracy. Furthermore, current AI technologies have high false positive/negative rates [68]
[69], a possible explanation for which is a lack of standardization of patterns. Therefore, the results of this meta-analysis can also aid in the development of better AI technologies for detection and characterization of gastric preneoplastic conditions and neoplastic lesions.
Previous meta-analyses have assessed the performance of NBI for diagnosing GIM and dysplasia [13]
[70]
[71]; however, there are fewer studies evaluating other IEE technologies and the possible factors that influence their performance.
This meta-analysis confirms the high accuracy of NBI (with or without high magnification) for GIM and also for EGC (but for this outcome specificity was higher with high magnification). In the studies with ME-NBI, the use of the VS classification also seemed to improve specificity compared with other endoscopic criteria (0.98 vs. 0.94). Authors from the non-VS classification studies established the diagnosis according to irregularities on the microvascular/microsurface pattern, although two of them analyzed these patterns in demarcated or circumscribed lesions without specifying if they were really evaluating a demarcation line or not. Our meta-analysis supports the results from previous studies that suggest the VS classification is an effective criterion to diagnose intestinal-type EGC [72]. A slightly higher specificity (2 % increase) for EGC was found in the studies that did not use WLE before NBI. However, even if WLE does not add to the characterization, it is undoubtedly useful for detection and can provide additional clues to endoscopic diagnosis and prediction of deep submucosal invasion (e. g. morphological changes, redness, convergence of mucosal folds) [72].
Regarding GIM, the accuracy of non-ME-NBI was similar to that of ME-NBI, and the best diagnostic measures were even obtained using tubulovillous pattern without high magnification (sensitivity of 0.88, specificity of 0.97, absent-to-moderate heterogeneity). Under high magnification, the presence of LBCs as the only marker for GIM obtained lower specificity when compared with the use of other markers (0.89 vs. 0.96). LBC definition also differed between studies. Therefore, we consider that tubulovillous pattern is the most adequate marker to identify GIM. Moreover, tubulovillous pattern can be effectively detected even without high magnification, which makes it suitable for widespread adoption because of the limited availability of high magnification in some centers.
Regarding other technologies, pooled analysis was only possible for some of them owing to the low number of studies included. Although it was not possible to perform a comparison between IEE technologies, current evidence suggests that NBI is the most effective technology to detect gastric preneoplastic conditions and EGC [1]. The high false positive rate of AFI may improve with TMI, but none of these technologies were demonstrated to be superior to ME-NBI. The lack of standardization of FICE settings makes it difficult to perform comparative studies, and i-SCAN seems not to be adequate for vascular pattern assessment [50]. Although one study suggested that pCLE added to the specificity when compared with FICE alone [51], the diagnostic measures obtained are not superior to the pooled accuracy achieved with virtual chromoendoscopy, meaning its value for this outcome remains questionable.
The main difference between NBI and the above-mentioned technologies is that it is a narrowed-spectrum technology by filtering illumination light, which provides good visualization of the microsurface/microvascular pattern. BLI shares the same physical principle but, instead of having an optical filter, it combines two laser lights (blue laser imaging) or changes the light intensity of different LED lights (blue light imaging) to obtain the narrow-band light. As a result, BLI delivers images similar to NBI, and it is expected that results from the two technologies will be equivalent. Although some studies reported comparable results between BLI and NBI, they were performed under high magnification, and mostly in Eastern countries. A recent study also reported excellent results with BLI without high magnification to diagnose GIM [73]; nevertheless, more studies evaluating BLI without high magnification, especially in Western countries, are needed to reach definite conclusions.
None of the studies showed a specific pattern for atrophic gastritis, and the accuracy for detecting this condition was particularly low. However, GIM is a more reliable marker of gastric cancer risk and its endoscopic descriptors are more consistent and reproducible, therefore, unless new accurate atrophic gastritis classifications emerge, GIM seems to be the best endoscopic indicator for stratification of gastric cancer risk.
Some limitations have to be considered. First, most of the studies were carried out in Eastern countries, namely Japan, which is considered a high risk country for the incidence of gastric cancer; this may induce an enriched study population. Second, there was high heterogeneity among studies regarding descriptors, lesions/area assessed, and population, which could lead to a mis- or overdiagnosis. Some of these affected the quality of studies, especially for dysplasia/EGC, and this must to be taken into account in the assessment of the results. In spite of these assumptions, our initial analysis considering studies with low risk of bias obtained the same results in terms of accuracy and heterogeneity; moreover, the investigation of the possible influence of different covariates was not possible owing to the low number of studies.
In conclusion, our study confirms the high accuracy of NBI for GIM and EGC ([Table 5]). There is a necessity for mucosal pattern to be reassessed for atrophic gastritis; until new atrophic gastritis classifications emerge, GIM is the most effective marker to evaluate EGC risk. As the presence of the tubulovillous pattern is the most relevant pattern for detecting GIM, this should be used in current practice, and it can be effectively evaluated without using high magnification. This feature, along with the VS classification, seems to be consistent and usable for new IEE technologies, such as BLI (with the recently emerged multi-LED technology) and AI, looking toward improving the diagnosis of preneoplastic conditions and cancerous lesions.
Table 5
List of recommendations.
Image-enhanced endoscopy (IEE) technologies
|
Current evidence suggests narrow-band imaging (NBI) to be the most effective IEE technology to detect gastric intestinal metaplasia (GIM) and cancerous lesions
Owing to blue laser/light imaging (BLI) and NBI sharing the same physical principle, it is expected that results will be equivalent
|
Preneoplastic conditions and neoplastic lesions
|
There is a necessity to reassess descriptors for atrophic gastritis
Tubulovillous pattern is the most effective marker to detect GIM
Vessel plus surface (VS) classification may be useful to characterize cancerous lesions
Magnifying endoscopy can be helpful even though not required for GIM assessment
|